CN107612815B

CN107612815B - Information sending method, device and equipment

Info

Publication number: CN107612815B
Application number: CN201710848158.8A
Authority: CN
Inventors: 李聪
Original assignee: Beijing Kingsoft Internet Security Software Co Ltd
Current assignee: Beijing Kingsoft Internet Security Software Co Ltd
Priority date: 2017-09-19
Filing date: 2017-09-19
Publication date: 2020-12-25
Anticipated expiration: 2037-09-19
Also published as: CN107612815A

Abstract

The embodiment of the invention provides an information sending method, device and equipment, wherein the method comprises the following steps: acquiring target text information corresponding to voice information used for generating comments by a user; acquiring a target face image in video information used for generating comments by a user; and sending the target text information and the target face image to a server so that the server combines the target text information and the target face image to generate comment information to be published. When the information is sent by the scheme provided by the embodiment of the invention, the target text information and the target face image can be sent to the server, so that the comment information finally generated by the server not only contains the character information, but also contains the face image of the user, and therefore, the comment information not only can express the emotion of the user, but also can enrich the content of the comment information.

Description

Information sending method, device and equipment

Technical Field

The invention relates to the technical field of information processing, in particular to a comment information sending method, device and equipment.

Background

With the rapid development of internet technology, many multimedia applications provide users with an instant comment function, so that users can comment on related content, for example, spit related content, communicate with other users about related content, and the like.

In the prior art, a text input box and an information sending button are arranged in a comment area of an application program, when a user wants to make a comment, text content to be made the comment can be input in the text input box, when the application program receives a comment sending instruction sent by the user, the text content is sent to a server, and after the server receives the text content to be made the comment, the text content is converted into comment information.

However, the inventor finds that the prior art has at least the following problems in the process of implementing the invention: when an existing application program sends information for generating comments to a server, only the text content of the comments to be made by a user is simply sent to the server, so that the generated comment information only contains characters and is single in content.

Disclosure of Invention

The embodiment of the invention aims to provide an information sending method, device and equipment, so that a server can generate comment information with rich content. The specific technical scheme is as follows:

an information sending method applied to an application program is characterized by comprising the following steps:

acquiring target text information corresponding to voice information used for generating comments by a user;

acquiring a target face image in video information used for generating comments by a user;

and sending the target text information and the target face image to a server so that the server combines the target text information and the target face image to generate comment information to be published.

Optionally, the step of obtaining the target text information corresponding to the voice information used by the user to generate the comment includes:

instructing a voice recognition application to collect voice information used by a user to generate a comment, wherein the voice recognition application is: a third party application for performing speech recognition;

and receiving target text information corresponding to the voice information fed back by the voice recognition application program, wherein the target text information is obtained by performing voice recognition on the voice information by the voice recognition application program.

collecting voice information used by a user for generating comments;

and carrying out voice recognition on the voice information to obtain target text information corresponding to the voice information.

Optionally, the method further includes:

in the process of collecting the voice information, setting the terminal where the voice recognition application program is in a mute mode;

and after the voice information is collected, setting the terminal to be in a non-silent mode.

Optionally, the step of obtaining a target face image in video information used by a user to generate a comment includes:

instructing a face recognition application to collect video information used by a user to generate a comment, wherein the face recognition application is: a third party application for face recognition;

and receiving a target face image in the video information fed back by the face recognition application program, wherein the target face image is obtained by carrying out face recognition on the video information by the face recognition application program.

collecting video information used by a user for generating comments;

and carrying out face recognition on the video information to obtain a target face image in the video information.

Optionally, the step of sending the target text information and the target face image to a server includes:

setting the same time stamp for the target text information and the target face image;

sending the text information and the face image with the timestamp to a server;

the comment information to be issued is: and when detecting that the time stamp set by the received text information is consistent with the time stamp set by the face image, the server combines the received text information and the face information to generate comment information.

Optionally, after the step of sending the text information and the face image to a server, the method further includes:

and receiving the comment information fed back by the server, and displaying the comment information.

In another aspect of the present invention, there is provided an information transmitting apparatus applied to an application program, including:

the first acquisition module is used for acquiring target text information corresponding to the voice information used for generating comments by the user;

the second acquisition module is used for acquiring a target face image in the video information used for generating comments by the user;

and the sending module is used for sending the target text information and the target face image to a server so that the server combines the target text information and the target face image to generate comment information to be published.

Optionally, the first obtaining module includes:

the comment processing device comprises a first indicating unit, a voice recognition application program and a comment processing unit, wherein the first indicating unit is used for indicating the voice recognition application program to collect voice information used by a user for generating comments, and the voice recognition application program is as follows: a third party application for performing speech recognition;

a first receiving unit, configured to receive target text information corresponding to the voice information fed back by the voice recognition application, where the target text information is obtained by performing voice recognition on the voice information by the voice recognition application.

Optionally, the first obtaining module includes:

the first acquisition unit is used for acquiring voice information used by a user for generating comments;

and the first identification unit is used for carrying out voice identification on the voice information to obtain target text information corresponding to the voice information.

Optionally, the apparatus further comprises:

the setting module is used for setting the terminal where the voice recognition application program is in a mute mode in the process of acquiring the voice information; and after the voice information is collected, setting the terminal to be in a non-silent mode.

Optionally, the second obtaining module includes:

the second indicating unit is used for indicating a face recognition application program to acquire video information used by a user for generating comments, wherein the face recognition application program is as follows: a third party application for face recognition;

and the second receiving unit is used for receiving a target face image in the video information fed back by the face recognition application program, wherein the target face image is obtained by carrying out face recognition on the video information by the face recognition application program.

Optionally, the second obtaining module includes:

the second acquisition unit is used for acquiring video information used by a user for generating comments;

and the second identification unit is used for carrying out face identification on the video information to obtain a target face image in the video information.

Optionally, the sending module is specifically configured to,

sending the text information and the face image with the timestamp to a server;

Optionally, the apparatus further comprises:

and the display module is used for receiving the comment information fed back by the server and displaying the comment information.

In another aspect of the present invention, an electronic device is further provided, which includes a processor, a communication interface, a memory, and a communication bus, where the processor, the communication interface, and the memory complete communication with each other through the communication bus;

a memory for storing a computer program;

and a processor for implementing any of the above-described information transmission methods when executing the program stored in the memory.

In yet another aspect of the present invention, there is also provided a computer-readable storage medium having stored therein instructions, which when executed on a computer, cause the computer to execute any of the above-described information transmission methods.

In yet another aspect of the present invention, there is also provided a computer program product containing instructions which, when run on a computer, cause the computer to perform the information transmission method described in any one of the above.

The information sending method, the device and the equipment provided by the embodiment of the invention send the target text information acquired by identifying the voice information used by the user for generating the comment and the target face image acquired by identifying the video information used by the user for generating the comment to a server, so that the server generates the comment information containing the face image of the user. When the information is sent by the scheme provided by the embodiment of the invention, the obtained text information is obtained by carrying out voice recognition on the voice information, namely, the user releases the text information for comment in a voice input mode, the voice input mode is more convenient and quicker compared with the modes of character input and the like in the prior art, and the target text information and the target face image are sent to the server, so that the comment information finally generated by the server not only contains the character information but also contains the face image of the user, and the comment information not only can express the emotion of the user, but also can enrich the content of the comment information. Of course, it is not necessary for any product or method of practicing the invention to achieve all of the above-described advantages at the same time.

Drawings

In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, it is obvious that the drawings in the following description are only some embodiments of the present invention, and for those skilled in the art, other drawings can be obtained according to the drawings without creative efforts.

Fig. 1 is a schematic flow chart of an information sending method according to an embodiment of the present invention;

fig. 2 is a schematic structural diagram of an information sending apparatus according to an embodiment of the present invention;

fig. 3 is a schematic structural diagram of an electronic device according to an embodiment of the present invention.

Detailed Description

The technical solutions in the embodiments of the present invention will be described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.

Referring to fig. 1, a flow chart diagram of an information sending method provided in an embodiment of the present invention is shown, where the method is applied to an application program, and the method specifically includes the following steps:

the application programs referred in the present invention include, but are not limited to, video playing applications, chat applications, and game applications, but all applications having instant comment and chat functions may adopt the information sending method provided in the embodiments of the present invention.

S100, acquiring target text information corresponding to the voice information used for generating comments by the user.

And S110, acquiring a target face image in the video information used for generating the comment by the user.

And S120, sending the target text information and the target face image to a server so that the server combines the target text information and the target face image to generate comment information to be published.

When the information is sent by the scheme provided by the embodiment of the invention, the obtained text information is obtained by carrying out voice recognition on the voice information, namely, the user releases the text information for comment in a voice input mode, the voice input mode is more convenient and quicker compared with the modes of character input and the like in the prior art, and the target text information and the target face image are sent to the server, so that the comment information finally generated by the server not only contains the character information but also contains the face image of the user, and the comment information not only can express the emotion of the user, but also can enrich the content of the comment information.

The step of obtaining the target text information corresponding to the voice information used by the user to generate the comment may have various implementation manners, and the specific implementation manner of the step is described in detail in the following embodiments one to four:

in the first embodiment, after receiving a trigger instruction for a user to issue comment information, an application program invokes a recording element of a terminal where the application program is located, collects voice information used by the user to generate comments, and sends the collected voice information to a cloud recognition server after the collection is completed, the cloud recognition server recognizes the voice information as text information and then feeds the text information back to the application program, and after receiving the text information fed back by the cloud recognition server, the application program determines the text information as target text information corresponding to the voice information used by the user to generate comments.

For example, the recording element may be a microphone or the like.

In a second embodiment, the application instructs the speech recognition application to collect the speech information used by the user to generate the comment, and then receives target text information corresponding to the speech information fed back by the speech recognition application, where the speech recognition application is: and the target text information is obtained by performing voice recognition on the voice information by the voice recognition application program.

That is to say, after receiving a trigger instruction of a user for issuing comment information, the application program invokes a recording element of the terminal where the application program is located, collects voice information used by the user for generating comments, and sends the collected voice information to a third-party voice recognition application program after the collection is completed, the third-party voice recognition application program recognizes the voice information as text information and then feeds the text information back to the application program, and after receiving the text information fed back by the third-party voice recognition application program, the application program determines the text information as target text information corresponding to the voice information used by the user for generating comments.

Because the voice information collected in the voice lexicon of the third-party application program for voice recognition is more comprehensive, and the recognition result is more accurate, the target text information corresponding to the voice information used by the user for generating comments is acquired through the voice recognition application program based on the embodiment, so that the accuracy of the acquired target text information is improved.

In the third embodiment, after receiving a trigger instruction for a user to issue comment information, the application program instructs a third-party speech recognition application program to collect speech information used by the user to generate comments, the third-party speech recognition application collects the speech information used by the user to generate comments through a recording element of a terminal where the third-party speech recognition application program is located, then performs speech recognition on the collected speech information to generate text information, feeds the generated text information back to the application program, and after receiving the text information fed back by the third-party speech recognition application program, the application program determines the text information as target text information corresponding to the speech information used by the user to generate comments.

Acquiring voice information used by a user for generating comments; and carrying out voice recognition on the voice information to obtain target text information corresponding to the voice information.

That is to say, after receiving a trigger instruction of a user for issuing comment information, the application program invokes a recording element of the terminal where the application program is located, collects voice information used by the user for generating comments, performs voice recognition on the collected voice information after collection is completed, and acquires target text information corresponding to the voice information used by the user for generating comments.

In this embodiment, the acquisition and recognition of the voice information are completed by the application program itself, so as to accelerate the speed of acquiring the target text information.

The step of acquiring the target face image in the video information used for generating the comment by the user in the invention can be realized in various ways, including:

in the first embodiment, after receiving a trigger instruction for a user to issue comment information, the application program invokes a camera element of a terminal where the application program is located, collects video information used by the user to generate comments, and sends the collected video information to the cloud identification server after the collection is completed, the cloud identification server performs face identification on the video information to obtain a face image of the user and feeds the face image back to the application program, and the application program receives the face image fed back by the cloud identification server and determines the face image as a target face image in the video information used by the user to generate comments.

For example, the image pickup device may be a camera or the like.

In the second embodiment, after receiving a trigger instruction of a user for issuing comment information, the application program calls up a camera element of a terminal where the application program is located, collects video information used by the user for generating comments, and sends the collected video information to a third-party face recognition application program after the collection is finished, the third-party face recognition application program performs face recognition on the video information to obtain a face image of the user and feeds the face image back to the application program, and the application program determines the face image as a target face image in the video information used by the user for generating comments after receiving the face image fed back by the third-party face recognition application program.

In the third embodiment, the application program instructs the face recognition application program to collect video information used by the user to generate comments, and then receives a target face image in the video information fed back by the face recognition application program, where the face recognition application program is: and the target face image is obtained by carrying out face recognition on the video information by the face recognition application program.

That is to say, after receiving a trigger instruction of a user for issuing comment information, an application program instructs a third-party face recognition application program to acquire video information of the user, the third-party face recognition application program acquires the video information of the user through a camera element of a terminal where the third-party face recognition application program is located, then performs face recognition on the acquired video information to obtain a plurality of face images of the user and feeds the face images back to the application program, the application program receives the plurality of face images of the user and then performs screening through preset conditions, and one face image of the user meeting the conditions is selected as a target face image.

For example, when the application receives a plurality of user face images and then performs filtering according to a preset condition, the application may filter an image in which eyes are open and a mouth is open among the plurality of user face images.

Acquiring video information used by a user for generating comments; and carrying out face recognition on the video information to obtain a target face image in the video information.

That is to say, after receiving a trigger instruction of a user for issuing comment information, the application program invokes the camera element of the terminal where the application program is located, collects video information used by the user for generating comments, and after collection, performs face recognition on the collected video information to obtain target text information corresponding to voice information used by the user for generating comments.

The third-party speech recognition application program and the third-party face recognition application program may be independent application programs installed in a terminal where the application programs are located, or may be application programs that are plug-ins or functional modules of application programs that are execution subjects.

The above-mentioned embodiments are only some embodiments of the present invention, and do not constitute any limitation to the present invention, and all other embodiments obtained by those skilled in the art without any inventive work based on the above-mentioned embodiments of the present invention belong to the protection scope of the present invention.

In an embodiment of the present invention, the information sending method further includes:

In the process of collecting the voice information, in order to prevent the influence of the audio information output by the terminal where the voice recognition application program is located on the recorded voice information and further prevent the inaccuracy of the finally recognized target text information, the terminal where the voice recognition application program is located can be set to be in a mute mode in the process of collecting the voice information; and correspondingly setting the terminal to be in the non-silent mode after the acquisition is finished.

In one embodiment of the present invention, the step of sending the target text information and the target face image to a server includes:

sending the text information and the face image with the timestamp to a server;

In the information transmission process, the transmission speed of the text information is higher than that of the image, the difference is more obvious under the condition that the network is delayed, and under the condition, if a certain user sends a plurality of pieces of text information and face images for comment at the same time, the server can have a disordered corresponding relation after receiving the plurality of pieces of text information and face images sent by the user, namely the server cannot distinguish which piece of text information corresponds to which face image. Based on the method, the application program can set the same time stamp for the target text information and the target face image before sending the target text information and the target face image to the server, and the server can combine the text information and the face information with the same time stamp to generate comment information according to the time stamp after receiving the text information and the face image.

In an embodiment of the present invention, after the information sending method sends the target text information and the target face image to a server, the information sending method further includes:

And the server feeds the comment information generated by combination back to the user application program for display, and pushes the comment information to the application programs of all users for display.

All the users mentioned above can be understood as: the user currently using the application program can of course also understand: all registered users.

In one implementation, the application may display comment information posted by the user in the form of a bullet screen, which, as the name suggests, refers to a screen formed with many bullets, and a large number of spout comments when flying through the screen appear as a bullet screen in a flying shooting game. The comment information issued by the user is displayed in a dynamic form, so that the interest of the user in issuing comments is further improved.

In another implementation manner, the application program may also statically display the comment information in the comment information display area in the form of an information bar. For example, comments from microblogs, comments from WeChat friends in published status, comments from published articles, and the like.

Referring to fig. 2, a schematic structural diagram of an information sending apparatus according to an embodiment of the present invention is shown, where the apparatus is applied to an application program, and the apparatus specifically includes:

the first obtaining module 200 is configured to obtain target text information corresponding to voice information used by a user to generate a comment;

a second obtaining module 210, configured to obtain a target face image in video information used by a user to generate a comment;

the sending module 220 is configured to send the target text information and the target face image to a server, so that the server combines the target text information and the target face image to generate comment information to be published.

In an embodiment of the present invention, the first obtaining module 200 includes:

In one embodiment of the present invention, the information transmitting apparatus further includes:

the setting module is used for setting the terminal where the voice recognition application program is in a mute mode in the process of acquiring the voice information;

In an embodiment of the present invention, the second obtaining module 210 includes:

In one embodiment of the present invention, the sending module 220 is specifically configured to,

sending the text information and the face image with the timestamp to a server;

In one embodiment of the present invention, an information transmitting apparatus further includes:

An embodiment of the present invention further provides an electronic device, as shown in fig. 3, including a processor 001, a communication interface 002, a memory 003 and a communication bus 004, where the processor 001, the communication interface 002 and the memory 003 complete mutual communication through the communication bus 004,

a memory 003 for storing a computer program;

the processor 001 is configured to implement the information transmission method according to the embodiment of the present invention when executing the program stored in the memory 003.

Specifically, the information sending method includes:

It should be noted that, the processor 001 executes the program stored in the memory 003 to implement other embodiments of the information sending method, which are the same as the embodiments provided in the foregoing method embodiments and are not described again here.

The communication bus mentioned in the electronic device may be a Peripheral Component Interconnect (PCI) bus or an Extended Industry Standard Architecture (EISA) bus. The communication bus may be divided into an address bus, a data bus, a control bus, etc. For ease of illustration, only one thick line is shown, but this does not mean that there is only one bus or one type of bus.

The communication interface is used for communication between the electronic equipment and other equipment.

The Memory may include a Random Access Memory (RAM) or a non-volatile Memory (non-volatile Memory), such as at least one disk Memory. Optionally, the memory may also be at least one memory device located remotely from the processor.

The Processor may be a general-purpose Processor, and includes a Central Processing Unit (CPU), a network Processor (Ne word Processor, NP), and the like; the Integrated Circuit may also be a Digital Signal Processor (DSP), an Application Specific Integrated Circuit (ASIC), a Field Programmable Gate Array (FPGA) or other Programmable logic device, a discrete Gate or transistor logic device, or a discrete hardware component.

In another embodiment of the present invention, a computer-readable storage medium is further provided, where instructions are stored in the computer-readable storage medium, and when the instructions are executed on a computer, the instructions cause the computer to implement the information sending method according to the embodiment of the present invention.

Specifically, the information sending method includes:

It should be noted that other embodiments for implementing the information sending method through the computer-readable storage medium are the same as the embodiments provided in the foregoing method embodiments, and are not described herein again.

In another embodiment of the present invention, a computer program product containing instructions is also provided, which when run on a computer, causes the computer to implement the information sending method according to the embodiment of the present invention.

Specifically, the information sending method includes:

It should be noted that other embodiments for implementing the information sending method by using the computer program product are the same as the embodiments provided in the foregoing method embodiment section, and are not described again here.

In the above embodiments, the implementation may be wholly or partially realized by software, hardware, firmware, or any combination thereof. When implemented in software, may be implemented in whole or in part in the form of a computer program product. The computer program product includes one or more computer instructions. When loaded and executed on a computer, cause the processes or functions described in accordance with the embodiments of the invention to occur, in whole or in part. The computer may be a general purpose computer, a special purpose computer, a network of computers, or other programmable device. The computer instructions may be stored in a computer readable storage medium or transmitted from one computer readable storage medium to another, for example, from one website site, computer, server, or data center to another website site, computer, server, or data center via wired (e.g., coaxial cable, fiber optic, Digital Subscriber Line (DSL)) or wireless (e.g., infrared, wireless, microwave, etc.). The computer-readable storage medium can be any available medium that can be accessed by a computer or a data storage device, such as a server, a data center, etc., that incorporates one or more of the available media. The usable medium may be a magnetic medium (e.g., floppy Disk, hard Disk, magnetic tape), an optical medium (e.g., DVD), or a semiconductor medium (e.g., Solid State Disk (SSD)), among others.

It is noted that, herein, relational terms such as first and second, and the like may be used solely to distinguish one entity or action from another entity or action without necessarily requiring or implying any actual such relationship or order between such entities or actions. Also, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrase "comprising an … …" does not exclude the presence of other identical elements in a process, method, article, or apparatus that comprises the element.

All the embodiments in the present specification are described in a related manner, and the same and similar parts among the embodiments may be referred to each other, and each embodiment focuses on the differences from the other embodiments. In particular, for the apparatus, the electronic device, the computer program product, and the computer-readable storage medium embodiments, since they are substantially similar to the method embodiments, the description is relatively simple, and for the relevant points, reference may be made to the partial description of the method embodiments.

The above description is only for the preferred embodiment of the present invention, and is not intended to limit the scope of the present invention. Any modification, equivalent replacement, or improvement made within the spirit and principle of the present invention shall fall within the protection scope of the present invention.

Claims

1. An information sending method applied to an application program is characterized by comprising the following steps:

sending the text information and the face image with the timestamp set to a server so that the server combines the target text information and the target face image to generate comment information to be issued, wherein the target text information and the target face image are both acquired by an application program after receiving a trigger instruction of a user for issuing the comment information; the comment information to be issued is: and when detecting that the time stamp set by the received text information is consistent with the time stamp set by the face image, the server combines the received text information and the face image to generate comment information.

2. The method of claim 1, wherein the step of obtaining target text information corresponding to the voice information used by the user to generate the comment comprises:

3. The method of claim 1, wherein the step of obtaining target text information corresponding to the voice information used by the user to generate the comment comprises:

collecting voice information used by a user for generating comments;

4. A method according to claim 2 or 3, characterized in that the method further comprises:

5. The method of claim 1, wherein the step of obtaining the target face image in the video information used by the user to generate the comment comprises:

6. The method of claim 1, wherein the step of obtaining the target face image in the video information used by the user to generate the comment comprises:

collecting video information used by a user for generating comments;

7. The method of claim 1, further comprising:

8. An information transmission apparatus applied to an application program, comprising:

the sending module is used for setting the same time stamp for the target text information and the target face image; sending the text information and the face image with the timestamp set to a server so that the server combines the target text information and the target face image to generate comment information to be issued, wherein the target text information and the target face image are both acquired by an application program after receiving a trigger instruction of a user for issuing the comment information; the comment information to be issued is: and when detecting that the time stamp set by the received text information is consistent with the time stamp set by the face image, the server combines the received text information and the face image to generate comment information.

9. The apparatus of claim 8, wherein the first obtaining module comprises:

10. The apparatus of claim 8, wherein the first obtaining module comprises:

11. The apparatus of claim 9 or 10, further comprising:

12. The apparatus of claim 8, wherein the second obtaining module comprises:

13. The apparatus of claim 8, wherein the second obtaining module comprises:

14. The apparatus of claim 8, further comprising:

15. An electronic device is characterized by comprising a processor, a communication interface, a memory and a communication bus, wherein the processor and the communication interface are used for realizing mutual communication by the memory through the communication bus;

a memory for storing a computer program;

a processor for implementing the method steps of any of claims 1 to 7 when executing a program stored in the memory.

16. A computer-readable storage medium having instructions stored therein, which when run on a computer, cause the computer to implement the method of any one of claims 1-7.