CN112927033A

CN112927033A - Data processing method and device, electronic equipment and storage medium

Info

Publication number: CN112927033A
Application number: CN202110113230.9A
Authority: CN
Inventors: 秦泽民; 薛朱梅; 张子隆; 苏伟
Original assignee: Shanghai Sensetime Intelligent Technology Co Ltd
Current assignee: Shanghai Sensetime Intelligent Technology Co Ltd
Priority date: 2021-01-27
Filing date: 2021-01-27
Publication date: 2021-06-08

Abstract

The present disclosure relates to a data processing method and apparatus, an electronic device, and a storage medium, the method including: acquiring a video image, wherein the video image is at least one frame of image in a video stream obtained by shooting a service area through image acquisition equipment; performing target detection based on the video image; responding to the detection result representation of the target detection to detect a target object, and acquiring user information of the target object; and acquiring response information corresponding to the user information, and outputting the response information through electronic equipment. The embodiment of the disclosure can save labor cost, improve user service handling efficiency and improve user experience.

Description

Data processing method and device, electronic equipment and storage medium

Technical Field

The present disclosure relates to the field of computer technologies, and in particular, to a data processing method and apparatus, an electronic device, and a storage medium.

Background

When a user transacts business, most of the users are used for receiving and conducting guide service by staff, so that the labor cost is consumed, and the problem that the users are more and the number of the users is limited because the staff are limited, so that part of the users receive no people or cannot provide the guide service in time is solved, and the business transaction efficiency of the users is reduced.

Disclosure of Invention

The present disclosure proposes a technical solution for data processing.

According to an aspect of the present disclosure, there is provided a data processing method including:

acquiring a video image, wherein the video image is at least one frame of image in a video stream obtained by shooting a service area through image acquisition equipment;

performing target detection based on the video image;

responding to the detection result representation of the target detection to detect a target object, and acquiring user information of the target object;

and acquiring response information corresponding to the user information, and outputting the response information through electronic equipment.

According to the data processing method provided by the embodiment of the disclosure, the user can be received through the response information output by the electronic equipment, and the user is guided to perform related business handling, so that the labor cost can be saved, the business handling efficiency of the user is improved, and the user experience is improved.

In one possible implementation, the response information includes at least one of first response information and second response information; the first response information comprises product recommendation information for the target object; the second response message includes response message for the user message.

The data processing method provided by the embodiment of the disclosure can be used for carrying out targeted product recommendation, can be used for carrying out marketing recommendation more accurately, and can improve the user experience while improving the marketing accuracy.

In a possible implementation manner, the obtaining response information corresponding to the user information includes:

determining the identity information of the target object according to the face image;

and acquiring first response information according to the identity information of the target object.

The data processing method provided by the embodiment of the disclosure can perform product recommendation pertinently according to the identity information of the target object, can perform marketing recommendation more accurately, and can improve user experience while improving marketing accuracy.

In a possible implementation manner, the obtaining first response information according to the identity information of the target object includes:

in the case that the identity information is first identity information, querying reserved attribute information of the target object based on the first identity information;

and determining the first response information according to the reserved attribute information.

According to the data processing method provided by the embodiment of the disclosure, the first recommended product to be recommended can be determined according to the reserved attribute information of the target object, and the product recommendation information of the first recommended product is used as the first response information, so that marketing recommendation can be performed more accurately, marketing accuracy is improved, and user experience is improved.

under the condition that the identity information is second identity information, performing attribute identification on the face image of the target object to obtain attribute information of the face image;

and determining the first response information according to the attribute information of the face image.

According to the data processing method provided by the embodiment of the disclosure, the first response information can be determined according to the attribute information of the face image of the target object, marketing recommendation can be performed more accurately, and the user experience can be improved while the marketing accuracy is improved.

determining whether preset keywords exist in the voice information;

and under the condition that the preset keywords exist in the voice information, acquiring second response information corresponding to the preset keywords in an intranet response library.

The data processing method provided by the embodiment of the disclosure can quickly and accurately acquire the second response information aiming at the voice information by presetting the keywords, and can improve the response efficiency and precision.

In a possible implementation manner, the preset keywords include a primary preset keyword and a secondary preset keyword associated with the primary preset keyword, the primary preset keyword is used to indicate a service to be awakened, and the secondary preset keyword is used to indicate a service to be awakened.

According to the data processing method provided by the embodiment of the disclosure, the requirements of the user can be determined more accurately in a cascade setting mode, the corresponding second response information is obtained, and the response efficiency and precision can be improved.

In a possible implementation manner, the obtaining, in the intranet response library, second response information corresponding to the preset keyword includes:

under the condition that the voice information comprises the primary preset keywords, acquiring secondary preset keywords related to the primary preset keywords from the intranet response library;

and generating corresponding second response information according to the secondary preset keywords.

The data processing method provided by the embodiment of the disclosure can quickly and accurately acquire the second response information for the voice information in a cascade setting manner, and can improve the response efficiency and precision.

In a possible implementation manner, the obtaining, in the intranet response library, second response information corresponding to the preset keyword further includes:

generating corresponding service confirmation information according to the secondary preset keywords, wherein the service confirmation information is used for confirming whether to awaken the service corresponding to the secondary preset keywords or not;

responding to the confirmation operation aiming at the service confirmation information, and acquiring corresponding service information according to the service type of the secondary preset keyword;

and generating corresponding second response information according to the business service information.

According to the data processing method provided by the embodiment of the disclosure, the business service can be accurately called through the secondary preset keywords, the second response information of the voice information is obtained according to the business service information, and the user is received and guided to conduct business handling through the second response information, so that the response efficiency and precision can be improved, the business handling efficiency of the user can be improved, and the user experience is improved.

In a possible implementation manner, the first-level preset keyword includes a queuing and calling service, the second-level preset keyword associated with the first-level preset keyword includes a service related to the queuing and calling service, and the second response information includes queuing result information.

According to the data processing method provided by the embodiment of the disclosure, the queuing and calling service can be called through the cascaded preset keywords so as to obtain the queuing and calling result, the queuing and calling efficiency of the user can be improved, and further the user experience is improved.

In a possible implementation manner, the obtaining response information corresponding to the user information further includes:

acquiring semantic information of the voice information under the condition that the preset keywords do not exist in the voice information;

and acquiring corresponding second response information in an external network response library according to the semantic information.

The data processing method provided by the embodiment of the disclosure can provide response information irrelevant to the service for the user through the external network response library, so as to ensure the security of the service data while realizing interaction with the user.

In one possible implementation manner, the outputting, by the electronic device, the response information includes:

outputting second response information through the electronic equipment;

and outputting the first response information through the electronic equipment when the display condition of the first response information is met.

The data processing method provided by the embodiment of the disclosure can select a proper channel and output the response information in time based on the occupation conditions of different output channels of the electronic equipment under the condition of not influencing the output of contents such as other response information, so that the target object can quickly receive the response information.

In one possible implementation manner, the response message includes at least one of voice message, text message, picture message, video message, and animation message.

According to an aspect of the present disclosure, there is provided a data processing apparatus including:

the system comprises a first acquisition module, a second acquisition module and a third acquisition module, wherein the first acquisition module is used for acquiring a video image, and the video image is at least one frame of image in a video stream obtained by shooting a service area through image acquisition equipment;

the detection module is used for carrying out target detection based on the video image;

the second acquisition module is used for responding to the detection result representation of the target detection to detect the target object and acquiring the user information of the target object;

and the output module is used for acquiring the response information corresponding to the user information and outputting the response information through the electronic equipment.

In a possible implementation manner, the user information includes a face image of the target object, and the output module is further configured to:

In a possible implementation manner, the output module is further configured to:

In a possible implementation manner, the user information includes voice information of the target object, and the output module is further configured to:

determining whether preset keywords exist in the voice information;

In a possible implementation manner, the user information includes a face image of the target object and speech information of the target object, and the output module is further configured to:

outputting second response information through the electronic equipment;

According to an aspect of the present disclosure, there is provided an electronic device including: a processor; a memory for storing processor-executable instructions; wherein the processor is configured to invoke the memory-stored instructions to perform the above-described method.

According to an aspect of the present disclosure, there is provided a computer readable storage medium having stored thereon computer program instructions which, when executed by a processor, implement the above-described method.

In the embodiment of the present disclosure, a video image may be acquired, and target detection may be performed based on the video image. Furthermore, the target object can be detected in response to the detection result representation of the target detection, the user information of the target object is obtained, the response information corresponding to the user information is obtained, and the response information is displayed through the output of the electronic equipment. According to the data processing method and device, the electronic device and the storage medium, the user can be received through the response information output by the electronic device, and the user is guided to conduct related business transaction, so that the labor cost can be saved, the business transaction efficiency of the user is improved, and the user experience is improved.

It is to be understood that both the foregoing general description and the following detailed description are exemplary and explanatory only and are not restrictive of the disclosure. Other features and aspects of the present disclosure will become apparent from the following detailed description of exemplary embodiments, which proceeds with reference to the accompanying drawings.

Drawings

The accompanying drawings, which are incorporated in and constitute a part of this specification, illustrate embodiments consistent with the present disclosure and, together with the description, serve to explain the principles of the disclosure.

FIG. 1 shows a flow diagram of a data processing method according to an embodiment of the present disclosure;

FIG. 2 shows a schematic diagram of a data processing method according to an embodiment of the present disclosure;

FIG. 3 shows a schematic diagram of a data processing method according to an embodiment of the present disclosure;

FIG. 4 shows a schematic diagram of a data processing method according to an embodiment of the present disclosure;

FIG. 5 shows a schematic diagram of a data processing method according to an embodiment of the present disclosure;

FIG. 6 shows a schematic diagram of a data processing method according to an embodiment of the present disclosure;

FIG. 7 shows a schematic diagram of a data processing method according to an embodiment of the present disclosure;

FIG. 8 shows a schematic diagram of a data processing method according to an embodiment of the present disclosure;

FIG. 9 shows a block diagram of a data processing apparatus according to an embodiment of the present disclosure;

FIG. 10 shows a block diagram of an electronic device 1000 in accordance with an embodiment of the disclosure;

fig. 11 shows a block diagram of an electronic device 1900 according to an embodiment of the disclosure.

Detailed Description

Various exemplary embodiments, features and aspects of the present disclosure will be described in detail below with reference to the accompanying drawings. In the drawings, like reference numbers can indicate functionally identical or similar elements. While the various aspects of the embodiments are presented in drawings, the drawings are not necessarily drawn to scale unless specifically indicated.

The word "exemplary" is used exclusively herein to mean "serving as an example, embodiment, or illustration. Any embodiment described herein as "exemplary" is not necessarily to be construed as preferred or advantageous over other embodiments.

The term "and/or" herein is merely an association describing an associated object, meaning that three relationships may exist, e.g., a and/or B, may mean: a exists alone, A and B exist simultaneously, and B exists alone. In addition, the term "at least one" herein means any one of a plurality or any combination of at least two of a plurality, for example, including at least one of A, B, C, and may mean including any one or more elements selected from the group consisting of A, B and C.

Furthermore, in the following detailed description, numerous specific details are set forth in order to provide a better understanding of the present disclosure. It will be understood by those skilled in the art that the present disclosure may be practiced without some of these specific details. In some instances, methods, means, elements and circuits that are well known to those skilled in the art have not been described in detail so as not to obscure the present disclosure.

The embodiment of the disclosure provides a data processing method, which includes acquiring a video image through an image acquisition device, and acquiring user information of a target object under the condition that a target object is detected by a target detection result representation of the video image, wherein the user information may include face image information of the target object and/or voice information of the target object. The response information corresponding to the user information can be acquired according to the user information of the target object, and the response information is output through the electronic equipment so as to receive the target object and guide the target object to conduct business handling.

For example, in the application scenario of a bank, the electronic device may be placed in a service area of the bank. When a user walks into an image acquisition area of the electronic equipment, the video image acquired by the image acquisition equipment comprises the user, and then when the video image is subjected to target detection, a target detection result represents that a target object is detected. The electronic device may obtain user information of the target object, for example: acquiring a face image and/or voice information of a user, and acquiring corresponding response information according to the user information, for example: and acquiring product recommendation information aiming at the user according to the face image of the user, or acquiring response information aiming at the voice information of the user according to the voice information of the user. The response information is output through the electronic equipment so as to receive the user and guide the user to handle related services, so that the labor cost can be saved, the service handling efficiency of the user is improved, and the user experience is improved.

Fig. 1 shows a flowchart of a data processing method according to an embodiment of the present disclosure, which may be performed by an electronic device such as a terminal device or a server, the terminal device may be a User Equipment (UE), a mobile device, a User terminal, a cellular phone, a cordless phone, a Personal Digital Assistant (PDA), a handheld device, a computing device, a vehicle-mounted device, a wearable device, or the like, and the method may be implemented by a processor calling a computer-readable instruction stored in a memory. Alternatively, the method may be performed by a server.

As shown in fig. 1, the data processing method may include:

in step S11, a video image is obtained, where the video image is at least one frame of image in a video stream obtained by shooting a service area through an image capturing device;

in step S12, object detection is performed based on the video image;

in step S13, in response to that the detection result of the target detection indicates that a target object is detected, acquiring user information of the target object;

in step S14, response information corresponding to the user information is acquired, and the response information is output by the electronic device.

For example, the service area may be a preset area, an electronic device may be set in the area, a video stream of the service area is acquired in real time by an image acquisition device, and at least one frame of image is obtained from the video stream as a video image by performing operations such as frame extraction on the video stream. Target detection can be performed on the video image to detect whether a human face exists in the video image. When a face or a human body is detected in the video image, the obtained detection result can be used for representing a detected target object, and the target object is a user corresponding to the detected face or human body. In the case that the human face and the human body are not detected in the video image, a prompt message "please stand in the middle of the screen" may be displayed in a prompt message display area in a display interface of the electronic device, as shown in fig. 2.

When the detection result represents that the target object is detected, user information of the target object may be acquired, where the user information may include information that can distinguish the target object from other objects, such as a face image of the target object and/or voice information of the target object. For example: the face image can be intercepted from the video image to serve as the user information of the target object, and/or the voice information of the target object can be collected through the audio equipment to serve as the user information of the target object. The image capturing device and the audio device may be integrated in an electronic device, or may be an external device of the electronic device, which is not specifically limited in this disclosure.

After the user information of the target object is obtained, response information corresponding to the user information can be obtained, and the response information can include at least one of information in multiple expression forms such as voice information, character information, picture information, video information, animation information and the like.

In one possible implementation, the response information includes at least one of first response information and second response information; the first response information includes product recommendation information for the target object, and the second response information includes response information for the user information.

For example, the identity information of the target object may be determined according to the user information, and then the product recommendation information for the target object is obtained according to the identity information of the target object, where the product recommendation information is the first response information, or the corresponding response information may be obtained from the response library according to a keyword included in the user information, where the response information is the second response information, for example: when the user information is voice information for consulting, second response information corresponding to the consulting content can be obtained in the response library according to the keywords in the voice information.

After the response information corresponding to the user information is obtained, the response information can be output through the electronic device, and the method includes: and displaying the response information through a display interface of the electronic equipment and/or playing the response information through the audio equipment. For example: the response information comprises text information, picture information and voice information, so that the text information and the picture information can be displayed through a display interface of the electronic equipment, and the voice information can be played through the audio equipment. The corresponding output mode can be selected according to the expression form of the response information, so that the response information can be output together in different output modes, and the target object can receive the response information through various output channels.

In a possible implementation manner, after the response information is obtained, part of the content of the response information may be selected for display according to the occupation situation of the electronic device resource. For example: under the condition that both a display interface and audio equipment of the electronic equipment are idle, response information can be output through the display interface and the audio equipment; under the condition that a display interface of the electronic equipment is occupied and the audio equipment is idle, the voice information in the response information can be played through the audio equipment; under the condition that a display interface of the electronic equipment is idle and the audio equipment is occupied, character information, picture information, video information, animation information and the like in the response information can be displayed through the display interface; under the condition that the display interface and the audio equipment of the electronic equipment are occupied, the response information can be output in a suspending way until the display interface and/or the audio equipment of the electronic equipment are idle, and then the response information is output. Namely, under the condition of not influencing the output of contents such as other response information, the appropriate channel is selected based on the occupation conditions of different output channels of the electronic equipment, and the response information is output in time, so that the target object can quickly receive the response information.

The embodiment of the disclosure can acquire a video image and perform target detection based on the video image. Furthermore, the target object can be detected in response to the detection result representation of the target detection, the user information of the target object is obtained, the response information corresponding to the user information is obtained, and the response information is displayed through the output of the electronic equipment. According to the data processing method provided by the embodiment of the disclosure, the user can be received through the response information output by the electronic equipment, and the user is guided to perform related business handling, so that the labor cost can be saved, the business handling efficiency of the user is improved, and the user experience is improved.

In a possible implementation manner, the user information may include a face image of the target object, and the obtaining of the corresponding response information according to the user information may include:

For example, the user information may include a face image of the target object, and face recognition may be performed according to the face image to determine the identity information of the target object. The identity information may represent the identity of the target object, for example, when the identity information is the first identity information, the target object may be represented as a historical user, or when the identity information is the second identity information, the target object may be represented as a new user.

After obtaining the identity information of the target object, corresponding first response information may be obtained according to the identity information of the target object, for example: for a historical user, the related information (for example, user identity information, historical transaction information, etc.) reserved in the system by the historical user can be obtained. The method comprises the steps of determining a first recommended product suitable for a historical user and/or higher in acceptance of the historical user by analyzing related information reserved by the historical user, and further obtaining product recommendation information of the first recommended product as first response information. Or, for the new user, if there is no reserved related information in the system, the analysis result may be obtained by analyzing the face image of the new user (the analysis result may include attribute information of the new user, and the attribute information may include, but is not limited to, age, gender, character, and the like), and then the second recommended product suitable for the new user may be determined according to the analysis result, and the product recommendation information of the second recommended product is obtained as the first response information, or a third recommended product may be preset, and after the target object is determined to be the new user, the product recommendation information of the third recommended product may be directly obtained and used as the first response information.

In order that those skilled in the art will better understand the embodiments of the present disclosure, the embodiments of the present disclosure are described below by way of specific examples. For example, referring to fig. 2, after the facial image is acquired by the image acquisition device, quality evaluation may be performed on the facial image, including but not limited to evaluating the definition of the facial image, the integrity of the face in the facial image, and the like, and marketing service may be performed on the facial image whose quality meets the requirement.

In the marketing service, the identity information of the target object can be determined according to the face image, and then first response information matched with the identity information of the target object can be acquired in an intranet response library through a marketing service interface, wherein the intranet response library can comprise a product to be recommended and product recommendation information of the product to be recommended, and the product recommendation information can include and is not limited to at least one of a product marketing picture, a product marketing animation, a product marketing video, a product marketing tactical text and a product detail two-dimensional code.

For example, the first response information may be output by the electronic device to display and/or play the first response information to the target object. For example: taking the example of showing the response information to the target object, referring to fig. 2 and 3, a pop-up window bubble may be shown in the display interface of the electronic device, and first response information may be shown in the pop-up window bubble (the first response information in fig. 3 may include a product marketing text and a product detail two-dimensional code (content shown in a square area in the pop-up window bubble in fig. 3), where the product marketing text is "mr. you are good and you recommend a xxx product"), and the target object may scan the product detail two-dimensional code through the terminal device to enter the detail interface of the product to be recommended, so as to obtain the detail information of the product to be recommended. Or the first response information can also comprise product detail keywords, and the target object can directly carry out dialogue with the electronic equipment through the detail keywords to obtain the detail information of the product to be recommended; or the product marketing words text in the first response message can be converted into corresponding voice messages and then played through the electronic equipment.

In fig. 3, the content displayed in the upper left corner of the display interface of the electronic device may be used to prompt the user (i.e., the target object) to perform a corresponding operation. For example, in the display interface example on the left side of fig. 3, the user is prompted to "please stand in the middle of the screen" by text presentation, that is, the user is prompted to make the user projection be displayed in the middle area (i.e., the black human-shaped area) of the display interface by adjusting the distance, the orientation, and the like between the user and the electronic device. For another example, in the display interface example on the right side of fig. 3, the user is prompted to "please talk" through text display, that is, the user is prompted to input a password or the like to the electronic device through speaking/playing audio, so as to implement a conversation with the electronic device. It should be noted that the manner for prompting the user to execute the corresponding operation may include, but is not limited to, text output, voice output, and the like, and may be specifically adjusted according to actual scene needs and the like, which is not limited herein.

Therefore, the product recommendation can be performed according to the identity information of the target object, the marketing recommendation can be performed more accurately, and the user experience can be improved while the marketing accuracy is improved.

In a possible implementation manner, the obtaining first response information according to the identity information of the target object may include:

For example, the historical user information base may store reserved attribute information of the historical user and a facial image of the historical user. The face matching can be performed in the historical user information base according to the face image of the target object, when the historical user face image matched with the face image of the target object exists in the historical user information base, the identity information of the target object can be determined to be the first identity information, and under the condition that the identity information of the target object is the first identity information, the target object can be determined to be the historical user. When the similarity between the face image of the target object and the face image of the historical user is greater than the similarity threshold value, the face image of the target object can be determined to be matched with the face image of the historical user.

For example, the reserved attribute information of the target object may be obtained from the historical user face image matched with the face image of the target object in the historical user information base, wherein the reserved attribute information may include, but is not limited to, user name, age, gender, transacted business condition (which may include deposit amount, loan amount, purchased product condition, etc.), risk tolerance, liveness, credit rating, and other information. Further, the reserved attribute information of the target object and each product can be analyzed, so that a first recommended product suitable for the target object is determined, and the product recommendation information of the first recommended product is acquired as first response information for the target object.

For example: after the information of the business condition, the risk tolerance, the liveness, the credit level and the like of the target object is obtained, a product similar to the purchased product of the target object can be determined as a first recommended product, or a product with the risk within the range of the risk tolerance of the target object and matched with the credit level, the deposit amount and the credit amount of the user can be determined as the first recommended product according to the credit level, the deposit amount, the credit amount and the risk tolerance of the target object.

It should be noted that, in the embodiment of the present disclosure, the manner of determining the first recommended product according to the reserved attribute information of the target object is only an example of the embodiment of the present disclosure, and is not to be understood as a limitation on the manner of determining the first recommended product, and the embodiment of the present disclosure does not make any limitation on the policy of determining the first recommended product according to the reserved attribute information.

Therefore, the first recommended product to be recommended can be determined according to the reserved attribute information of the target object, the product recommendation information of the first recommended product is used as the first response information, marketing recommendation can be performed more accurately, marketing accuracy is improved, and user experience can be improved.

For example, face matching may be performed in the historical user information base according to the face image of the target object, and when the historical user face image matched with the face image of the target object does not exist in the historical user information base, it may be determined that the identity information of the target object is the second identity information.

For example, when the identity information of the target object is the second identity information, it may be determined that the target object is a new user, that is, the reserved attribute information of the target object is not stored in the historical user information base, and the product recommendation cannot be performed on the target object through the reserved attribute information. The face image of the target object can be identified to obtain the attribute information of the face image, a second recommended product suitable for the target object is analyzed according to the attribute information of the target object, and the product recommendation information of the second recommended product is obtained to serve as first response information so as to carry out product marketing recommendation on the target object. The attribute information of the face image may include, but is not limited to, age, sex, expression, and the like of the target object.

For example: according to the embodiment of the disclosure, information such as age range, gender and the like of a user purchasing each product to be marketed can be counted and analyzed in advance according to historical selling information and historical consulting information of each product to be marketed, and the age range, the gender and the like suitable for each product to be marketed are preset according to a counting and analyzing result, and after the information such as age, gender and the like of a target object is identified, the product to be marketed matched with the information such as age, gender and the like of the target object can be determined to be a second recommended product; or, the information such as the risk tolerance of the target object may be analyzed according to the information such as the age, sex, and expression of the user, and then the second recommended product may be determined according to the risk tolerance of the target object, for example: the attribute information identifying the target object includes: if the age is over 50 years old, the gender is female, and the expression is serious, the conclusion that the user is cautious in nature and low in risk tolerance can be obtained by analyzing the attribute information of the target object, so that a product with low risk can be used as a second recommended product.

It should be noted that, the manner of determining the second recommended product according to the attribute information of the face image in the embodiment of the present disclosure is only an example of the embodiment of the present disclosure, and is not to be understood as a limitation on the manner of determining the second recommended product in the embodiment of the present disclosure, and the embodiment of the present disclosure does not make any limitation on the policy of determining the second recommended product according to the attribute information of the face image.

Therefore, the first response information can be determined according to the attribute information of the face image of the target object, marketing recommendation can be performed more accurately, marketing accuracy is improved, and user experience can be improved.

In a possible implementation manner, the user information may include voice information of the target object, and the obtaining of the corresponding response information according to the user information may include:

determining whether preset keywords exist in the voice information;

For example, the voice information of the target object may be collected in real time through the audio device, when the voice information is collected, voice recognition may be performed on the voice information, and after the voice information is recognized as text information, it is determined whether a preset keyword exists in the text information. The preset keywords comprise preset keywords related to services or services, and the intranet response library can store the preset keywords and second response information corresponding to the preset keywords.

And under the condition that the preset keywords exist in the text information, second response information corresponding to the preset keywords can be obtained in the intranet response library. For example: when the keyword is a large amount deposit, the corresponding second response information may be a large amount deposit process, or when the keyword is a purchase financing process, the corresponding second response information may be a purchase financing process, and the like. Therefore, the second response information aiming at the voice information can be quickly and accurately acquired through the preset keywords, and the response efficiency and precision can be improved.

In a possible implementation manner, the preset keywords may include a primary preset keyword and a secondary preset keyword associated with the primary preset keyword, where the primary preset keyword is used to indicate a service to be woken up, and the secondary preset keyword is used to indicate a service to be woken up.

For example, the target object may wake up a corresponding service of the related service through voice information, for example: and performing card opening and numbering, performing deposit and numbering, and the like, cascade setting can be performed on the preset keywords, specifically, the keywords corresponding to the service to be awakened can be set as first-level preset keywords, and the keywords corresponding to the service to be awakened related to the service to be awakened are set as second-level preset keywords associated with the first-level preset keywords. For example, a keyword corresponding to the queuing and calling service may be set as a primary keyword, and a keyword related to a service of the queuing and calling service may be set as a secondary keyword. Therefore, the requirements of the user can be determined more accurately in a cascading setting mode, the corresponding second response information is obtained, and the response efficiency and precision can be improved.

In a possible implementation manner, the obtaining, in the intranet response library, the second response information corresponding to the preset keyword may include:

For example, in the case where the preset keyword is included in the voice information, it may be determined whether the preset keyword is a primary preset keyword or a secondary preset keyword. Under the condition that the voice information comprises the primary preset keywords, the secondary preset keywords related to the primary preset keywords can be obtained from the intranet answer library. The intranet response library can be used for storing information related to the service, and the information includes a first-level preset keyword and a second-level preset keyword associated with the first-level preset keyword.

Further, corresponding second response information may be generated according to a secondary preset keyword associated with the primary preset keyword, where the second response information may be used to guide the target object to wake up a service corresponding to the secondary preset keyword. Illustratively, the second response message may include a query sentence and query content, and the query content may include a secondary preset keyword. When a plurality of secondary preset keywords are associated with the primary preset keyword, the corresponding second response information comprises a plurality of secondary preset keywords.

For example: in the example of the display interface on the left side of fig. 4, the user is prompted to "please stand in the middle of the screen" by text presentation, that is, the user is prompted to project a screen that can be displayed in the middle area (i.e., the black human-shaped area) of the display interface by adjusting the distance, the orientation, and the like between the user and the electronic device. After the target object is identified, guidance information for guiding the user to perform related operations may be output through the electronic device, for example, in the display interface example in the middle of fig. 4, the guidance information may be displayed through characters, and the guidance information may include information such as "help arrange numbers, how to purchase money, how to get money", and the like. When the collected voice information of the target object is 'help arrange number', the voice information of the target object can be determined to comprise a primary preset keyword 'queue', a secondary preset keyword 'deposit, withdrawal and open card' associated with the primary preset keyword can be obtained, second response information can be generated according to the secondary preset keyword, and the second response information is output through the electronic equipment. For example, in the display interface example on the right side of fig. 4, the second response message displayed by text may include: a query sentence (ask what business you want to transact.

In a possible implementation manner, the obtaining, in the intranet response library, the second response information corresponding to the preset keyword may further include:

For example, when the voice information includes the second-level preset keyword, the service confirmation information may be generated according to the second-level preset keyword, where the service confirmation information may include the second-level preset keyword, and the service confirmation information may be used to confirm whether to wake up a service corresponding to the second-level preset keyword.

For example: the voice message includes a secondary preset keyword "deposit", and then a corresponding service confirmation message "do you need to handle deposit service? ", and may be presented via an electronic device output. Further, in response to a confirmation operation of the target object for the service confirmation information (for example, receiving a confirmation voice instruction sent by the target object or recognizing a confirmation gesture of the target object), a service corresponding to the primary preset keyword associated with the secondary preset keyword may be awakened for the service corresponding to the secondary preset keyword to obtain corresponding service information, and corresponding second response information may be generated according to the service information. The service information may be a service result obtained after the corresponding service is executed for the corresponding service.

Illustratively, a service system corresponding to the secondary preset keyword may be called according to the secondary preset keyword, corresponding service information may be obtained after the service system executes corresponding service, and corresponding second response information may be generated according to the service information, where the second response information may include the service information.

Therefore, the data processing method provided by the embodiment of the disclosure can accurately call the business service through the secondary preset keywords, further obtain the second response information of the voice information according to the business service information, receive and guide the user to conduct business transaction through the second response information, and not only can improve the response efficiency and precision, but also can improve the business transaction efficiency of the user and improve the user experience.

For example, the primary preset keyword may include a queuing service, and the queuing service may correspond to a plurality of primary preset keywords, for example: the service to be awakened indicated by the primary preset keywords such as number calling, number arranging, queuing and the like is a queuing number calling service. The second preset keywords associated with the first preset keywords include services related to queuing and number calling services, such as: the second response information may include queuing result information, which may include information such as a queuing number, a service transaction window, a predicted waiting duration, and the like of the current service transaction. The first-level preset keywords and the second-level preset keywords associated with the first-level preset keywords can be set according to requirements, and specific contents of the first-level preset keywords and the second-level preset keywords associated with the first-level preset keywords are not limited in the embodiment of the disclosure.

And under the condition that the voice information of the target object hits the queuing and calling service, the service related to the queuing and calling service and related to the queuing and calling service can be obtained, and after second response information is generated according to the service related to the queuing and calling service, the second response information can be output through the electronic equipment. For example: referring to fig. 5, in the display interface example on the left side of fig. 5, in the case that the target object hits the queuing number-calling service through the voice information "help queuing number", the "asking which service to transact? Meanwhile, services (deposit, withdrawal, card transaction and the like) related to the queuing and calling service are displayed on a display interface so as to guide the target object to further output voice information including the service to be handled, and further the queuing and calling service of the service can be carried out according to the voice information including the service to be handled.

For another example, in the display interface example in the middle of fig. 5, the user is prompted to "please talk" through text display, that is, the user is prompted to input a password or the like to the electronic device through speaking/playing audio, so as to implement a conversation with the electronic device. When receiving 'deposit' of a password input by a target object, determining that the voice information of the target object comprises a secondary preset keyword 'deposit', and generating corresponding service confirmation information 'do you need to handle deposit service'. The service interface of the queuing and calling service of the service (deposit in the example) corresponding to the secondary preset keyword can be called to perform queuing and calling in response to the confirmation operation of the target object for the service information, the service information is generated according to the queuing result information, and the corresponding second response information is generated according to the service information. Referring to an example of a display interface on the right side of fig. 5, the service information is a queuing number of 198, 10 people in front are queuing, the estimated queuing time is 30 minutes, the generated and displayed second response information is "hello, the current queuing number of your is 198, 10 people in front are queuing, the estimated queuing time is 30 minutes, and please scan a code to obtain a queuing voucher".

And outputting and displaying the second response information through the electronic equipment, including displaying the second response information in a display interface, performing voice synthesis on the second response information to obtain voice response information corresponding to the second response information, and playing the voice response information through the audio equipment. And the two-dimensional code information corresponding to the queuing result information can be displayed, and the target object can scan the two-dimensional code information through the terminal equipment to obtain the corresponding queuing result information.

Illustratively, the voice information of the target object is collected in real time, and if new voice information of the target object is received during the period of displaying the second response information, the new voice information of the target object can be processed, and response information aiming at the new voice information is output through the electronic equipment. After the second response information is displayed for the first preset time, or the target object is not detected within the second preset time, the queuing process can be ended, and the second response information is not displayed any more.

Therefore, the data processing method provided by the embodiment of the disclosure can call the queuing and calling service through the cascaded preset keywords to obtain the queuing and calling result, can improve the queuing and calling efficiency of the user, and further improves the user experience.

In a possible implementation manner, the obtaining the corresponding response information according to the user information may further include:

For example, referring to fig. 6 and 7, in a case that a preset keyword does not exist in the voice information, semantic analysis may be performed on the voice information to obtain semantic information corresponding to the voice information, and second response information corresponding to the semantic information may be searched in an extranet response library through an extranet interface (token verification may be performed in a process of searching for the second response information, which is not described much in the embodiment of the present disclosure), where the extranet response library may be used to store response information unrelated to a service. For example: the target object may ask "how is the weather today? "," what restaurants are in the vicinity? "," how to walk at the nearest bus stop in the vicinity? "or the target object may propose a suggestion or give an evaluation to the service, for example," suggest a bank air conditioner temperature is increased a little, "" service is very good, "thank you" and so on, and may search the corresponding response information in the external network response library according to the semantic information.

After the second response information is obtained, whether the current second response information is used for indicating to queue and call numbers or not can be judged, and the second response information can be output and displayed through the electronic equipment under the condition that the current second response information is not used for indicating to queue and call numbers. For example, as shown in fig. 6, the image information, the animation information, the video information, and the like in the second response information are displayed through the display interface of the electronic device, the text information in the second response information is converted into the voice information through the voice synthesis interface, and then the text information is played through the audio device of the electronic device, or the motion to be displayed of the electronic device may be determined through the voice information, and the motion to be displayed is displayed by the electronic device while the voice information is played.

When the corresponding response information is not found in the extranet response library, guidance information for guiding the user to perform the relevant operation may be output through the electronic device (specifically, refer to fig. 4).

And under the condition that the current second response information is used for indicating queuing and calling, and the second response information comprises at least one second preset keyword, the at least one second preset keyword can be displayed, a service interface of a service corresponding to the second preset keyword is called to perform a calling system in response to the second preset keyword hit by the target object, so that queuing result information is obtained, corresponding two-dimensional code information can be generated according to the queuing result information, new second response information is generated according to the queuing result information and the two-dimensional code information, and the new second response information is output and displayed through the electronic equipment.

Therefore, the data processing method provided by the embodiment of the disclosure can provide response information irrelevant to the service for the user through the extranet response library, so as to ensure the security of the service data while realizing interaction with the user.

In a possible implementation manner, the user information includes a face image of the target object and voice information of the target object, and the outputting the response information by the electronic device may include:

outputting, by the electronic device, the second response information;

For example, since the second response information includes response information for the target object and the first response information includes product recommendation information for the target object, the second response information may be preferentially output by the electronic device and the first response information may be preferentially output, so as to improve user experience.

For example, after the second response information is output, the time for displaying the first response information may be determined according to the occupation of the output resource of the electronic device by the second response information. For example: and under the condition that the second response information does not occupy all output resources of the electronic equipment or the second response information releases the output resources of the electronic equipment, determining that the display condition of the first response information is met, and outputting and displaying part or all of the first response information through the idle output resources of the electronic equipment.

Exemplarily, in the case that the second response information only occupies the audio device of the electronic device, a part of information that can be displayed through the display interface in the first response information can be displayed through the display interface; or, under the condition that the second response information only occupies the display interface of the electronic device, outputting part of information which can be displayed through the audio device in the first response information through the audio device; alternatively, when the second response message occupies all the output resources of the electronic device, the output to the first electronic device may be suspended until the second response message releases all or part of the output resources of the electronic device, and all or part of the first response message is output through all or part of the output resources of the electronic device.

It should be noted that, in the process of displaying the first response information, no influence is generated on other business operations, the target object may continue to perform operations such as inquiry, number calling, and the like, and when there is new second response information to be output, the display of the first response information may be suspended, the second response information is preferentially displayed, and when the display condition of the first response information is satisfied, the first response information is output through the electronic device. Therefore, the data processing method provided by the embodiment of the disclosure can select a proper channel and output the response information in time based on the occupation conditions of different output channels of the electronic equipment under the condition of not influencing the output of contents such as other response information, so that the target object can quickly receive the response information.

In order that those skilled in the art will better understand the embodiments of the present disclosure, the embodiments of the present disclosure are described below by way of specific examples.

Illustratively, referring to fig. 8, when the target object reaches the service area, a face image of the video target object may be captured by the image capturing device and voice information of the target object may be captured in real time by the audio device. Under the condition that the face image of the target object is collected, the identity information of the target object can be determined according to the face image of the target object, and first response information is obtained according to the identity information of the target object, wherein the first response information comprises product recommendation information aiming at the target object. The first response information may be output by the electronic device to present the first response information to the target object. And under the condition that the target object wants to know the product recommendation information details, the product recommendation information details can be acquired through voice interaction.

Under the condition that the voice information of the target object is collected, the voice information can be subjected to voice recognition, the voice information is converted into text information, and whether preset keywords exist in the text information or not is recognized. In the case that the preset keyword exists in the text information, the second response information corresponding to the preset keyword may be acquired, for example: and under the condition that a second-level preset keyword associated with the queuing and calling service exists in the text information, the queuing and calling service of the service to be awakened and indicated by the second-level preset keyword can be performed, after a queuing and calling result is obtained, second response information is generated according to the queuing and calling result, and the second response information is output and displayed through the electronic equipment, so that the queuing and calling process of the target object is completed.

Or, under the condition that the preset keyword corresponding to the product to be recommended exists in the text information, the product recommendation information details of the product to be recommended can be acquired according to the preset keyword to serve as second response information, and the second response information is output and displayed through the electronic equipment, so that the target object can know the product to be recommended.

The embodiment of the disclosure can be applied to the scenes that a user needs to be accepted, the user is guided to carry out business operation and the product marketing is carried out on the user, and by carrying out business integration design, each basic business is connected in series based on two main flows of vision and voice, the business switching is natural and reasonable, and the use experience is smooth.

Taking a bank scene as an example, the embodiment of the disclosure can be applied to a bank outlet for reception and marketing, and in terms of reception and service guidance, the embodiment of the disclosure can replace the original reception service mode of manual reception, manual number taking or single-function number calling machine, gives a user a reception guidance experience with more scientific and technological sense, and further simplifies personnel while improving service type service experience. In the aspect of marketing, the user identity can be identified by combining a face identification technology, the multimedia form recommendation of products is carried out by combining historical reserved attribute information and face attributes of users, related products can be recommended more vividly and accurately, meanwhile, the users can acquire product information in modes of code scanning interaction, voice interaction and the like, the accuracy of product touch and the sales conversion rate can be effectively improved by matching network operators, and the lightweight and intelligent transformation of the off-line bank network can be promoted.

It is understood that the above-mentioned method embodiments of the present disclosure can be combined with each other to form a combined embodiment without departing from the logic of the principle, which is limited by the space, and the detailed description of the present disclosure is omitted. Those skilled in the art will appreciate that in the above methods of the specific embodiments, the specific order of execution of the steps should be determined by their function and possibly their inherent logic.

In addition, the present disclosure also provides a data processing apparatus, an electronic device, a computer-readable storage medium, and a program, which can be used to implement any data processing method provided by the present disclosure, and the corresponding technical solutions and descriptions and corresponding descriptions in the method section are not repeated.

Fig. 9 shows a block diagram of a data processing apparatus according to an embodiment of the present disclosure, which, as shown in fig. 9, includes:

the first obtaining module 91 may be configured to obtain a video image, where the video image is at least one frame of image in a video stream obtained by shooting a service area through an image capturing device;

a detection module 92, operable to perform object detection based on the video image;

a second obtaining module 93, configured to obtain user information of a target object in response to a detection result representing that the target object is detected;

the output module 94 may be configured to obtain response information corresponding to the user information, and output the response information through an electronic device.

In the embodiment of the present disclosure, a video image may be acquired, and target detection may be performed based on the video image. Furthermore, the target object can be detected in response to the detection result representation of the target detection, the user information of the target object is obtained, the response information corresponding to the user information is obtained, and the response information is displayed through the output of the electronic equipment. The data processing device provided by the embodiment of the disclosure can receive the user through the response information output by the electronic equipment and guide the user to perform related business handling, so that the labor cost can be saved, the business handling efficiency of the user can be improved, and the user experience can be improved.

In one possible implementation, the response information may include at least one of first response information and second response information; the first response information may include product recommendation information for the target object; the second response message includes response message for the user message.

In a possible implementation manner, the user information may include a face image of the target object, and the output module 94 is further configured to:

determining the identity information of the target object according to the face image; and acquiring first response information according to the identity information of the target object.

In a possible implementation manner, the output module 94 may be further configured to:

in the case that the identity information is first identity information, querying reserved attribute information of the target object based on the first identity information; and determining the first response information according to the reserved attribute information.

under the condition that the identity information is second identity information, performing attribute identification on the face image of the target object to obtain attribute information of the face image; and determining the first response information according to the attribute information of the face image.

In a possible implementation manner, the user information includes voice information of the target object, and the output module 94 is further configured to:

determining whether preset keywords exist in the voice information;

In a possible implementation manner, the preset keywords may include a primary preset keyword and a secondary preset keyword associated with the primary preset keyword, the primary preset keyword may be used to indicate a service to be awakened, and the secondary preset keyword may be used to indicate a service to be awakened.

In a possible implementation manner, the primary preset keyword may include a queuing and calling service, the secondary preset keyword associated with the primary preset keyword may include a service related to the queuing and calling service, and the second response information may include queuing result information.

In a possible implementation manner, the user information may include a face image of the target object and voice information of the target object, and the output module 94 is further configured to:

outputting second response information through the electronic equipment;

In one possible implementation, the response message may include at least one of voice message, text message, picture message, video message, and animation message. In some embodiments, functions of or modules included in the apparatus provided in the embodiments of the present disclosure may be used to execute the method described in the above method embodiments, and specific implementation thereof may refer to the description of the above method embodiments, and for brevity, will not be described again here.

Embodiments of the present disclosure also provide a computer-readable storage medium having stored thereon computer program instructions, which when executed by a processor, implement the above-mentioned method. The computer readable storage medium may be a non-volatile computer readable storage medium.

An embodiment of the present disclosure further provides an electronic device, including: a processor; a memory for storing processor-executable instructions; wherein the processor is configured to invoke the memory-stored instructions to perform the above-described method.

The embodiments of the present disclosure also provide a computer program product, which includes computer readable code, and when the computer readable code runs on a device, a processor in the device executes instructions for implementing the data processing method provided in any one of the above embodiments.

The embodiments of the present disclosure also provide another computer program product for storing computer readable instructions, which when executed cause a computer to perform the operations of the data processing method provided in any of the above embodiments.

The electronic device may be provided as a terminal, server, or other form of device.

Fig. 10 shows a block diagram of an electronic device 1000 according to an embodiment of the disclosure. For example, the electronic device 1000 may be a mobile phone, a computer, a digital broadcast terminal, a messaging device, a game console, a tablet device, a medical device, a fitness device, a personal digital assistant, or the like terminal.

Referring to fig. 10, electronic device 1000 may include one or more of the following components: processing component 1002, memory 1004, power component 1006, multimedia component 1008, audio component 1010, input/output (I/O) interface 1012, sensor component 1014, and communications component 1016.

The processing component 1002 generally controls overall operation of the electronic device 1000, such as operations associated with display, telephone calls, data communications, camera operations, and recording operations. The processing components 1002 may include one or more processors 1020 to execute instructions to perform all or a portion of the steps of the methods described above. Further, processing component 1002 may include one or more modules that facilitate interaction between processing component 1002 and other components. For example, the processing component 1002 may include a multimedia module to facilitate interaction between the multimedia component 1008 and the processing component 1002.

The memory 1004 is configured to store various types of data to support operations at the electronic device 1000. Examples of such data include instructions for any application or method operating on the electronic device 1000, contact data, phonebook data, messages, pictures, videos, and so forth. The memory 1004 may be implemented by any type or combination of volatile or non-volatile memory devices such as Static Random Access Memory (SRAM), electrically erasable programmable read-only memory (EEPROM), erasable programmable read-only memory (EPROM), programmable read-only memory (PROM), read-only memory (ROM), magnetic memory, flash memory, magnetic or optical disks.

The power supply component 1006 provides power to the various components of the electronic device 1000. The power components 1006 may include a power management system, one or more power sources, and other components associated with generating, managing, and distributing power for the electronic device 1000.

The multimedia component 1008 includes a screen that provides an output interface between the electronic device 1000 and a user. In some embodiments, the screen may include a Liquid Crystal Display (LCD) and a Touch Panel (TP). If the screen includes a touch panel, the screen may be implemented as a touch screen to receive an input signal from a user. The touch panel includes one or more touch sensors to sense touch, slide, and gestures on the touch panel. The touch sensor may not only sense the boundary of a touch or slide action, but also detect the duration and pressure associated with the touch or slide operation. In some embodiments, the multimedia component 1008 includes a front facing camera and/or a rear facing camera. The front camera and/or the rear camera may receive external multimedia data when the electronic device 1000 is in an operating mode, such as a shooting mode or a video mode. Each front camera and rear camera may be a fixed optical lens system or have a focal length and optical zoom capability.

The audio component 1010 is configured to output and/or input audio signals. For example, the audio component 1010 may include a Microphone (MIC) configured to receive external audio signals when the electronic device 1000 is in an operational mode, such as a call mode, a recording mode, and a voice recognition mode. The received audio signals may further be stored in the memory 804 or transmitted via the communication component 1016. In some embodiments, audio component 1010 also includes a speaker for outputting audio signals.

I/O interface 1012 provides an interface between processing component 1002 and peripheral interface modules, which may be keyboards, click wheels, buttons, etc. These buttons may include, but are not limited to: a home button, a volume button, a start button, and a lock button.

The sensor assembly 1014 includes one or more sensors for providing various aspects of status assessment for the electronic device 1000. For example, the sensor assembly 1014 may detect an open/closed state of the electronic device 1000, the relative positioning of components, such as a display and keypad of the electronic device 1000, the sensor assembly 1014 may also detect a change in position of the electronic device 1000 or a component of the electronic device 1000, the presence or absence of user contact with the electronic device 1000, orientation or acceleration/deceleration of the electronic device 1000, and a change in temperature of the electronic device 1000. The sensor assembly 1014 may include a proximity sensor configured to detect the presence of a nearby object without any physical contact. The sensor assembly 1014 may also include a light sensor, such as a Complementary Metal Oxide Semiconductor (CMOS) or Charge Coupled Device (CCD) image sensor, for use in imaging applications. In some embodiments, the sensor assembly 1014 may also include an acceleration sensor, a gyroscope sensor, a magnetic sensor, a pressure sensor, or a temperature sensor.

The communication component 1016 is configured to facilitate wired or wireless communication between the electronic device 1000 and other devices. The electronic device 1000 may access a wireless network based on a communication standard, such as a wireless network (WiFi), a second generation mobile communication technology (2G) or a third generation mobile communication technology (3G), or a combination thereof. In an exemplary embodiment, the communication component 1016 receives a broadcast signal or broadcast related information from an external broadcast management system via a broadcast channel. In an exemplary embodiment, the communications component 1016 further includes a Near Field Communication (NFC) module to facilitate short-range communications. For example, the NFC module may be implemented based on Radio Frequency Identification (RFID) technology, infrared data association (IrDA) technology, Ultra Wideband (UWB) technology, Bluetooth (BT) technology, and other technologies.

In an exemplary embodiment, the electronic device 1000 may be implemented by one or more Application Specific Integrated Circuits (ASICs), Digital Signal Processors (DSPs), Digital Signal Processing Devices (DSPDs), Programmable Logic Devices (PLDs), Field Programmable Gate Arrays (FPGAs), controllers, micro-controllers, microprocessors, or other electronic components for performing the above-described methods.

In an exemplary embodiment, a non-transitory computer-readable storage medium, such as the memory 1004, is also provided that includes computer program instructions executable by the processor 1020 of the electronic device 1000 to perform the above-described methods.

Fig. 11 shows a block diagram of an electronic device 1900 according to an embodiment of the disclosure. For example, the electronic device 1900 may be provided as a server. Referring to fig. 11, electronic device 1900 includes a processing component 1922 further including one or more processors and memory resources, represented by memory 1932, for storing instructions, e.g., applications, executable by processing component 1922. The application programs stored in memory 1932 may include one or more modules that each correspond to a set of instructions. Further, the processing component 1922 is configured to execute instructions to perform the above-described method.

The electronic device 1900 may also include a power component 1926 configured to perform power management of the electronic device 1900, a wired or wireless network interface 1950 configured to connect the electronic device 1900 to a network, and an input/output (I/O) interface 1958. The electronic device 1900 may operate based on an operating system, such as the Microsoft Server operating system (Windows Server), stored in the memory 1932^TM) Apple Inc. of the present application based on the graphic user interface operating System (Mac OS X)^TM) Multi-user, multi-process computer operating system (Unix)^TM) Free and open native code Unix-like operating System (Linux)^TM) Open native code Unix-like operating System (FreeBSD)^TM) Or the like.

In an exemplary embodiment, a non-transitory computer readable storage medium, such as the memory 1932, is also provided that includes computer program instructions executable by the processing component 1922 of the electronic device 1900 to perform the above-described methods.

The present disclosure may be systems, methods, and/or computer program products. The computer program product may include a computer-readable storage medium having computer-readable program instructions embodied thereon for causing a processor to implement various aspects of the present disclosure.

The computer readable storage medium may be a tangible device that can hold and store the instructions for use by the instruction execution device. The computer readable storage medium may be, for example, but is not limited to, an electronic memory device, a magnetic memory device, an optical memory device, an electromagnetic memory device, a semiconductor memory device, or any suitable combination of the foregoing. More specific examples (a non-exhaustive list) of the computer readable storage medium would include the following: a portable computer diskette, a hard disk, a Random Access Memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or flash memory), a Static Random Access Memory (SRAM), a portable compact disc read-only memory (CD-ROM), a Digital Versatile Disc (DVD), a memory stick, a floppy disk, a mechanical coding device, such as punch cards or in-groove projection structures having instructions stored thereon, and any suitable combination of the foregoing. Computer-readable storage media as used herein is not to be construed as transitory signals per se, such as radio waves or other freely propagating electromagnetic waves, electromagnetic waves propagating through a waveguide or other transmission medium (e.g., optical pulses through a fiber optic cable), or electrical signals transmitted through electrical wires.

The computer-readable program instructions described herein may be downloaded from a computer-readable storage medium to a respective computing/processing device, or to an external computer or external storage device via a network, such as the internet, a local area network, a wide area network, and/or a wireless network. The network may include copper transmission cables, fiber optic transmission, wireless transmission, routers, firewalls, switches, gateway computers and/or edge servers. The network adapter card or network interface in each computing/processing device receives computer-readable program instructions from the network and forwards the computer-readable program instructions for storage in a computer-readable storage medium in the respective computing/processing device.

The computer program instructions for carrying out operations of the present disclosure may be assembler instructions, Instruction Set Architecture (ISA) instructions, machine-related instructions, microcode, firmware instructions, state setting data, or source or object code written in any combination of one or more programming languages, including an object oriented programming language such as Smalltalk, C + + or the like and conventional procedural programming languages, such as the "C" programming language or similar programming languages. The computer-readable program instructions may execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer or entirely on the remote computer or server. In the case of a remote computer, the remote computer may be connected to the user's computer through any type of network, including a Local Area Network (LAN) or a Wide Area Network (WAN), or the connection may be made to an external computer (for example, through the Internet using an Internet service provider). In some embodiments, the electronic circuitry that can execute the computer-readable program instructions implements aspects of the present disclosure by utilizing the state information of the computer-readable program instructions to personalize the electronic circuitry, such as a programmable logic circuit, a Field Programmable Gate Array (FPGA), or a Programmable Logic Array (PLA).

Various aspects of the present disclosure are described herein with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems) and computer program products according to embodiments of the disclosure. It will be understood that each block of the flowchart illustrations and/or block diagrams, and combinations of blocks in the flowchart illustrations and/or block diagrams, can be implemented by computer-readable program instructions.

These computer-readable program instructions may be provided to a processor of a general purpose computer, special purpose computer, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions/acts specified in the flowchart and/or block diagram block or blocks. These computer-readable program instructions may also be stored in a computer-readable storage medium that can direct a computer, programmable data processing apparatus, and/or other devices to function in a particular manner, such that the computer-readable medium storing the instructions comprises an article of manufacture including instructions which implement the function/act specified in the flowchart and/or block diagram block or blocks.

The computer readable program instructions may also be loaded onto a computer, other programmable data processing apparatus, or other devices to cause a series of operational steps to be performed on the computer, other programmable apparatus or other devices to produce a computer implemented process such that the instructions which execute on the computer, other programmable apparatus or other devices implement the functions/acts specified in the flowchart and/or block diagram block or blocks.

The flowchart and block diagrams in the figures illustrate the architecture, functionality, and operation of possible implementations of systems, methods and computer program products according to various embodiments of the present disclosure. In this regard, each block in the flowchart or block diagrams may represent a module, segment, or portion of instructions, which comprises one or more executable instructions for implementing the specified logical function(s). In some alternative implementations, the functions noted in the block may occur out of the order noted in the figures. For example, two blocks shown in succession may, in fact, be executed substantially concurrently, or the blocks may sometimes be executed in the reverse order, depending upon the functionality involved. It will also be noted that each block of the block diagrams and/or flowchart illustration, and combinations of blocks in the block diagrams and/or flowchart illustration, can be implemented by special purpose hardware-based systems which perform the specified functions or acts, or combinations of special purpose hardware and computer instructions.

The computer program product may be embodied in hardware, software or a combination thereof. In an alternative embodiment, the computer program product is embodied in a computer storage medium, and in another alternative embodiment, the computer program product is embodied in a Software product, such as a Software Development Kit (SDK), or the like.

Having described embodiments of the present disclosure, the foregoing description is intended to be exemplary, not exhaustive, and not limited to the disclosed embodiments. Many modifications and variations will be apparent to those of ordinary skill in the art without departing from the scope and spirit of the described embodiments. The terminology used herein is chosen in order to best explain the principles of the embodiments, the practical application, or improvements made to the technology in the marketplace, or to enable others of ordinary skill in the art to understand the embodiments disclosed herein.

Claims

1. A data processing method, comprising:

performing target detection based on the video image;

2. The method of claim 1, wherein the response information includes at least one of first response information and second response information;

the first response information comprises product recommendation information for the target object;

the second response message includes response message for the user message.

3. The method according to claim 1 or 2, wherein the user information includes a face image of the target object, and the acquiring response information corresponding to the user information includes:

4. The method according to claim 3, wherein the obtaining the first response information according to the identity information of the target object comprises:

5. The method according to claim 3, wherein the obtaining the first response information according to the identity information of the target object comprises:

6. The method according to any one of claims 1 to 5, wherein the user information includes voice information of the target object, and the acquiring response information corresponding to the user information includes:

determining whether preset keywords exist in the voice information;

7. The method according to claim 6, wherein the preset keywords comprise a primary preset keyword and a secondary preset keyword associated with the primary preset keyword, the primary preset keyword is used for indicating a service to be awakened, and the secondary preset keyword is used for indicating a service to be awakened.

8. The method according to claim 7, wherein the obtaining of the second response information corresponding to the preset keyword in the intranet response library includes:

9. The method according to claim 7 or 8, wherein the obtaining of the second response information corresponding to the preset keyword in the intranet response library further comprises:

10. The method according to any one of claims 7 to 9, wherein the primary preset keyword comprises a queuing service, the secondary preset keyword associated with the primary preset keyword comprises a service related to the queuing service, and the second response information comprises queuing result information.

11. The method according to any one of claims 6 to 10, wherein the obtaining response information corresponding to the user information further comprises:

12. The method according to any one of claims 1 to 11, wherein the user information includes a face image of the target object and voice information of the target object, and the outputting of the response information by the electronic device includes:

outputting second response information through the electronic equipment;

13. The method according to any one of claims 1 to 12, wherein the response message comprises at least one of a voice message, a text message, a picture message, a video message, and an animation message.

14. A data processing apparatus, comprising:

15. An electronic device, comprising:

a processor;

a memory for storing processor-executable instructions;

wherein the processor is configured to invoke the memory-stored instructions to perform the method of any one of claims 1 to 13.

16. A computer readable storage medium having computer program instructions stored thereon, which when executed by a processor implement the method of any one of claims 1 to 13.