CN107612815B - Information sending method, device and equipment - Google Patents

Information sending method, device and equipment Download PDF

Info

Publication number
CN107612815B
CN107612815B CN201710848158.8A CN201710848158A CN107612815B CN 107612815 B CN107612815 B CN 107612815B CN 201710848158 A CN201710848158 A CN 201710848158A CN 107612815 B CN107612815 B CN 107612815B
Authority
CN
China
Prior art keywords
information
voice
user
face image
comment
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201710848158.8A
Other languages
Chinese (zh)
Other versions
CN107612815A (en
Inventor
李聪
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Kingsoft Internet Security Software Co Ltd
Original Assignee
Beijing Kingsoft Internet Security Software Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Kingsoft Internet Security Software Co Ltd filed Critical Beijing Kingsoft Internet Security Software Co Ltd
Priority to CN201710848158.8A priority Critical patent/CN107612815B/en
Publication of CN107612815A publication Critical patent/CN107612815A/en
Application granted granted Critical
Publication of CN107612815B publication Critical patent/CN107612815B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Information Transfer Between Computers (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)

Abstract

The embodiment of the invention provides an information sending method, device and equipment, wherein the method comprises the following steps: acquiring target text information corresponding to voice information used for generating comments by a user; acquiring a target face image in video information used for generating comments by a user; and sending the target text information and the target face image to a server so that the server combines the target text information and the target face image to generate comment information to be published. When the information is sent by the scheme provided by the embodiment of the invention, the target text information and the target face image can be sent to the server, so that the comment information finally generated by the server not only contains the character information, but also contains the face image of the user, and therefore, the comment information not only can express the emotion of the user, but also can enrich the content of the comment information.

Description

Information sending method, device and equipment
Technical Field
The invention relates to the technical field of information processing, in particular to a comment information sending method, device and equipment.
Background
With the rapid development of internet technology, many multimedia applications provide users with an instant comment function, so that users can comment on related content, for example, spit related content, communicate with other users about related content, and the like.
In the prior art, a text input box and an information sending button are arranged in a comment area of an application program, when a user wants to make a comment, text content to be made the comment can be input in the text input box, when the application program receives a comment sending instruction sent by the user, the text content is sent to a server, and after the server receives the text content to be made the comment, the text content is converted into comment information.
However, the inventor finds that the prior art has at least the following problems in the process of implementing the invention: when an existing application program sends information for generating comments to a server, only the text content of the comments to be made by a user is simply sent to the server, so that the generated comment information only contains characters and is single in content.
Disclosure of Invention
The embodiment of the invention aims to provide an information sending method, device and equipment, so that a server can generate comment information with rich content. The specific technical scheme is as follows:
an information sending method applied to an application program is characterized by comprising the following steps:
acquiring target text information corresponding to voice information used for generating comments by a user;
acquiring a target face image in video information used for generating comments by a user;
and sending the target text information and the target face image to a server so that the server combines the target text information and the target face image to generate comment information to be published.
Optionally, the step of obtaining the target text information corresponding to the voice information used by the user to generate the comment includes:
instructing a voice recognition application to collect voice information used by a user to generate a comment, wherein the voice recognition application is: a third party application for performing speech recognition;
and receiving target text information corresponding to the voice information fed back by the voice recognition application program, wherein the target text information is obtained by performing voice recognition on the voice information by the voice recognition application program.
Optionally, the step of obtaining the target text information corresponding to the voice information used by the user to generate the comment includes:
collecting voice information used by a user for generating comments;
and carrying out voice recognition on the voice information to obtain target text information corresponding to the voice information.
Optionally, the method further includes:
in the process of collecting the voice information, setting the terminal where the voice recognition application program is in a mute mode;
and after the voice information is collected, setting the terminal to be in a non-silent mode.
Optionally, the step of obtaining a target face image in video information used by a user to generate a comment includes:
instructing a face recognition application to collect video information used by a user to generate a comment, wherein the face recognition application is: a third party application for face recognition;
and receiving a target face image in the video information fed back by the face recognition application program, wherein the target face image is obtained by carrying out face recognition on the video information by the face recognition application program.
Optionally, the step of obtaining a target face image in video information used by a user to generate a comment includes:
collecting video information used by a user for generating comments;
and carrying out face recognition on the video information to obtain a target face image in the video information.
Optionally, the step of sending the target text information and the target face image to a server includes:
setting the same time stamp for the target text information and the target face image;
sending the text information and the face image with the timestamp to a server;
the comment information to be issued is: and when detecting that the time stamp set by the received text information is consistent with the time stamp set by the face image, the server combines the received text information and the face information to generate comment information.
Optionally, after the step of sending the text information and the face image to a server, the method further includes:
and receiving the comment information fed back by the server, and displaying the comment information.
In another aspect of the present invention, there is provided an information transmitting apparatus applied to an application program, including:
the first acquisition module is used for acquiring target text information corresponding to the voice information used for generating comments by the user;
the second acquisition module is used for acquiring a target face image in the video information used for generating comments by the user;
and the sending module is used for sending the target text information and the target face image to a server so that the server combines the target text information and the target face image to generate comment information to be published.
Optionally, the first obtaining module includes:
the comment processing device comprises a first indicating unit, a voice recognition application program and a comment processing unit, wherein the first indicating unit is used for indicating the voice recognition application program to collect voice information used by a user for generating comments, and the voice recognition application program is as follows: a third party application for performing speech recognition;
a first receiving unit, configured to receive target text information corresponding to the voice information fed back by the voice recognition application, where the target text information is obtained by performing voice recognition on the voice information by the voice recognition application.
Optionally, the first obtaining module includes:
the first acquisition unit is used for acquiring voice information used by a user for generating comments;
and the first identification unit is used for carrying out voice identification on the voice information to obtain target text information corresponding to the voice information.
Optionally, the apparatus further comprises:
the setting module is used for setting the terminal where the voice recognition application program is in a mute mode in the process of acquiring the voice information; and after the voice information is collected, setting the terminal to be in a non-silent mode.
Optionally, the second obtaining module includes:
the second indicating unit is used for indicating a face recognition application program to acquire video information used by a user for generating comments, wherein the face recognition application program is as follows: a third party application for face recognition;
and the second receiving unit is used for receiving a target face image in the video information fed back by the face recognition application program, wherein the target face image is obtained by carrying out face recognition on the video information by the face recognition application program.
Optionally, the second obtaining module includes:
the second acquisition unit is used for acquiring video information used by a user for generating comments;
and the second identification unit is used for carrying out face identification on the video information to obtain a target face image in the video information.
Optionally, the sending module is specifically configured to,
setting the same time stamp for the target text information and the target face image;
sending the text information and the face image with the timestamp to a server;
the comment information to be issued is: and when detecting that the time stamp set by the received text information is consistent with the time stamp set by the face image, the server combines the received text information and the face information to generate comment information.
Optionally, the apparatus further comprises:
and the display module is used for receiving the comment information fed back by the server and displaying the comment information.
In another aspect of the present invention, an electronic device is further provided, which includes a processor, a communication interface, a memory, and a communication bus, where the processor, the communication interface, and the memory complete communication with each other through the communication bus;
a memory for storing a computer program;
and a processor for implementing any of the above-described information transmission methods when executing the program stored in the memory.
In yet another aspect of the present invention, there is also provided a computer-readable storage medium having stored therein instructions, which when executed on a computer, cause the computer to execute any of the above-described information transmission methods.
In yet another aspect of the present invention, there is also provided a computer program product containing instructions which, when run on a computer, cause the computer to perform the information transmission method described in any one of the above.
The information sending method, the device and the equipment provided by the embodiment of the invention send the target text information acquired by identifying the voice information used by the user for generating the comment and the target face image acquired by identifying the video information used by the user for generating the comment to a server, so that the server generates the comment information containing the face image of the user. When the information is sent by the scheme provided by the embodiment of the invention, the obtained text information is obtained by carrying out voice recognition on the voice information, namely, the user releases the text information for comment in a voice input mode, the voice input mode is more convenient and quicker compared with the modes of character input and the like in the prior art, and the target text information and the target face image are sent to the server, so that the comment information finally generated by the server not only contains the character information but also contains the face image of the user, and the comment information not only can express the emotion of the user, but also can enrich the content of the comment information. Of course, it is not necessary for any product or method of practicing the invention to achieve all of the above-described advantages at the same time.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, it is obvious that the drawings in the following description are only some embodiments of the present invention, and for those skilled in the art, other drawings can be obtained according to the drawings without creative efforts.
Fig. 1 is a schematic flow chart of an information sending method according to an embodiment of the present invention;
fig. 2 is a schematic structural diagram of an information sending apparatus according to an embodiment of the present invention;
fig. 3 is a schematic structural diagram of an electronic device according to an embodiment of the present invention.
Detailed Description
The technical solutions in the embodiments of the present invention will be described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
Referring to fig. 1, a flow chart diagram of an information sending method provided in an embodiment of the present invention is shown, where the method is applied to an application program, and the method specifically includes the following steps:
the application programs referred in the present invention include, but are not limited to, video playing applications, chat applications, and game applications, but all applications having instant comment and chat functions may adopt the information sending method provided in the embodiments of the present invention.
S100, acquiring target text information corresponding to the voice information used for generating comments by the user.
And S110, acquiring a target face image in the video information used for generating the comment by the user.
And S120, sending the target text information and the target face image to a server so that the server combines the target text information and the target face image to generate comment information to be published.
When the information is sent by the scheme provided by the embodiment of the invention, the obtained text information is obtained by carrying out voice recognition on the voice information, namely, the user releases the text information for comment in a voice input mode, the voice input mode is more convenient and quicker compared with the modes of character input and the like in the prior art, and the target text information and the target face image are sent to the server, so that the comment information finally generated by the server not only contains the character information but also contains the face image of the user, and the comment information not only can express the emotion of the user, but also can enrich the content of the comment information.
The step of obtaining the target text information corresponding to the voice information used by the user to generate the comment may have various implementation manners, and the specific implementation manner of the step is described in detail in the following embodiments one to four:
in the first embodiment, after receiving a trigger instruction for a user to issue comment information, an application program invokes a recording element of a terminal where the application program is located, collects voice information used by the user to generate comments, and sends the collected voice information to a cloud recognition server after the collection is completed, the cloud recognition server recognizes the voice information as text information and then feeds the text information back to the application program, and after receiving the text information fed back by the cloud recognition server, the application program determines the text information as target text information corresponding to the voice information used by the user to generate comments.
For example, the recording element may be a microphone or the like.
In a second embodiment, the application instructs the speech recognition application to collect the speech information used by the user to generate the comment, and then receives target text information corresponding to the speech information fed back by the speech recognition application, where the speech recognition application is: and the target text information is obtained by performing voice recognition on the voice information by the voice recognition application program.
That is to say, after receiving a trigger instruction of a user for issuing comment information, the application program invokes a recording element of the terminal where the application program is located, collects voice information used by the user for generating comments, and sends the collected voice information to a third-party voice recognition application program after the collection is completed, the third-party voice recognition application program recognizes the voice information as text information and then feeds the text information back to the application program, and after receiving the text information fed back by the third-party voice recognition application program, the application program determines the text information as target text information corresponding to the voice information used by the user for generating comments.
Because the voice information collected in the voice lexicon of the third-party application program for voice recognition is more comprehensive, and the recognition result is more accurate, the target text information corresponding to the voice information used by the user for generating comments is acquired through the voice recognition application program based on the embodiment, so that the accuracy of the acquired target text information is improved.
In the third embodiment, after receiving a trigger instruction for a user to issue comment information, the application program instructs a third-party speech recognition application program to collect speech information used by the user to generate comments, the third-party speech recognition application collects the speech information used by the user to generate comments through a recording element of a terminal where the third-party speech recognition application program is located, then performs speech recognition on the collected speech information to generate text information, feeds the generated text information back to the application program, and after receiving the text information fed back by the third-party speech recognition application program, the application program determines the text information as target text information corresponding to the speech information used by the user to generate comments.
Acquiring voice information used by a user for generating comments; and carrying out voice recognition on the voice information to obtain target text information corresponding to the voice information.
That is to say, after receiving a trigger instruction of a user for issuing comment information, the application program invokes a recording element of the terminal where the application program is located, collects voice information used by the user for generating comments, performs voice recognition on the collected voice information after collection is completed, and acquires target text information corresponding to the voice information used by the user for generating comments.
In this embodiment, the acquisition and recognition of the voice information are completed by the application program itself, so as to accelerate the speed of acquiring the target text information.
The step of acquiring the target face image in the video information used for generating the comment by the user in the invention can be realized in various ways, including:
in the first embodiment, after receiving a trigger instruction for a user to issue comment information, the application program invokes a camera element of a terminal where the application program is located, collects video information used by the user to generate comments, and sends the collected video information to the cloud identification server after the collection is completed, the cloud identification server performs face identification on the video information to obtain a face image of the user and feeds the face image back to the application program, and the application program receives the face image fed back by the cloud identification server and determines the face image as a target face image in the video information used by the user to generate comments.
For example, the image pickup device may be a camera or the like.
In the second embodiment, after receiving a trigger instruction of a user for issuing comment information, the application program calls up a camera element of a terminal where the application program is located, collects video information used by the user for generating comments, and sends the collected video information to a third-party face recognition application program after the collection is finished, the third-party face recognition application program performs face recognition on the video information to obtain a face image of the user and feeds the face image back to the application program, and the application program determines the face image as a target face image in the video information used by the user for generating comments after receiving the face image fed back by the third-party face recognition application program.
In the third embodiment, the application program instructs the face recognition application program to collect video information used by the user to generate comments, and then receives a target face image in the video information fed back by the face recognition application program, where the face recognition application program is: and the target face image is obtained by carrying out face recognition on the video information by the face recognition application program.
That is to say, after receiving a trigger instruction of a user for issuing comment information, an application program instructs a third-party face recognition application program to acquire video information of the user, the third-party face recognition application program acquires the video information of the user through a camera element of a terminal where the third-party face recognition application program is located, then performs face recognition on the acquired video information to obtain a plurality of face images of the user and feeds the face images back to the application program, the application program receives the plurality of face images of the user and then performs screening through preset conditions, and one face image of the user meeting the conditions is selected as a target face image.
For example, when the application receives a plurality of user face images and then performs filtering according to a preset condition, the application may filter an image in which eyes are open and a mouth is open among the plurality of user face images.
Acquiring video information used by a user for generating comments; and carrying out face recognition on the video information to obtain a target face image in the video information.
That is to say, after receiving a trigger instruction of a user for issuing comment information, the application program invokes the camera element of the terminal where the application program is located, collects video information used by the user for generating comments, and after collection, performs face recognition on the collected video information to obtain target text information corresponding to voice information used by the user for generating comments.
The third-party speech recognition application program and the third-party face recognition application program may be independent application programs installed in a terminal where the application programs are located, or may be application programs that are plug-ins or functional modules of application programs that are execution subjects.
The above-mentioned embodiments are only some embodiments of the present invention, and do not constitute any limitation to the present invention, and all other embodiments obtained by those skilled in the art without any inventive work based on the above-mentioned embodiments of the present invention belong to the protection scope of the present invention.
In an embodiment of the present invention, the information sending method further includes:
in the process of collecting the voice information, setting the terminal where the voice recognition application program is in a mute mode;
and after the voice information is collected, setting the terminal to be in a non-silent mode.
In the process of collecting the voice information, in order to prevent the influence of the audio information output by the terminal where the voice recognition application program is located on the recorded voice information and further prevent the inaccuracy of the finally recognized target text information, the terminal where the voice recognition application program is located can be set to be in a mute mode in the process of collecting the voice information; and correspondingly setting the terminal to be in the non-silent mode after the acquisition is finished.
In one embodiment of the present invention, the step of sending the target text information and the target face image to a server includes:
setting the same time stamp for the target text information and the target face image;
sending the text information and the face image with the timestamp to a server;
the comment information to be issued is: and when detecting that the time stamp set by the received text information is consistent with the time stamp set by the face image, the server combines the received text information and the face information to generate comment information.
In the information transmission process, the transmission speed of the text information is higher than that of the image, the difference is more obvious under the condition that the network is delayed, and under the condition, if a certain user sends a plurality of pieces of text information and face images for comment at the same time, the server can have a disordered corresponding relation after receiving the plurality of pieces of text information and face images sent by the user, namely the server cannot distinguish which piece of text information corresponds to which face image. Based on the method, the application program can set the same time stamp for the target text information and the target face image before sending the target text information and the target face image to the server, and the server can combine the text information and the face information with the same time stamp to generate comment information according to the time stamp after receiving the text information and the face image.
In an embodiment of the present invention, after the information sending method sends the target text information and the target face image to a server, the information sending method further includes:
and receiving the comment information fed back by the server, and displaying the comment information.
And the server feeds the comment information generated by combination back to the user application program for display, and pushes the comment information to the application programs of all users for display.
All the users mentioned above can be understood as: the user currently using the application program can of course also understand: all registered users.
In one implementation, the application may display comment information posted by the user in the form of a bullet screen, which, as the name suggests, refers to a screen formed with many bullets, and a large number of spout comments when flying through the screen appear as a bullet screen in a flying shooting game. The comment information issued by the user is displayed in a dynamic form, so that the interest of the user in issuing comments is further improved.
In another implementation manner, the application program may also statically display the comment information in the comment information display area in the form of an information bar. For example, comments from microblogs, comments from WeChat friends in published status, comments from published articles, and the like.
When the information is sent by the scheme provided by the embodiment of the invention, the obtained text information is obtained by carrying out voice recognition on the voice information, namely, the user releases the text information for comment in a voice input mode, the voice input mode is more convenient and quicker compared with the modes of character input and the like in the prior art, and the target text information and the target face image are sent to the server, so that the comment information finally generated by the server not only contains the character information but also contains the face image of the user, and the comment information not only can express the emotion of the user, but also can enrich the content of the comment information.
Referring to fig. 2, a schematic structural diagram of an information sending apparatus according to an embodiment of the present invention is shown, where the apparatus is applied to an application program, and the apparatus specifically includes:
the first obtaining module 200 is configured to obtain target text information corresponding to voice information used by a user to generate a comment;
a second obtaining module 210, configured to obtain a target face image in video information used by a user to generate a comment;
the sending module 220 is configured to send the target text information and the target face image to a server, so that the server combines the target text information and the target face image to generate comment information to be published.
When the information is sent by the scheme provided by the embodiment of the invention, the obtained text information is obtained by carrying out voice recognition on the voice information, namely, the user releases the text information for comment in a voice input mode, the voice input mode is more convenient and quicker compared with the modes of character input and the like in the prior art, and the target text information and the target face image are sent to the server, so that the comment information finally generated by the server not only contains the character information but also contains the face image of the user, and the comment information not only can express the emotion of the user, but also can enrich the content of the comment information.
In an embodiment of the present invention, the first obtaining module 200 includes:
the comment processing device comprises a first indicating unit, a voice recognition application program and a comment processing unit, wherein the first indicating unit is used for indicating the voice recognition application program to collect voice information used by a user for generating comments, and the voice recognition application program is as follows: a third party application for performing speech recognition;
a first receiving unit, configured to receive target text information corresponding to the voice information fed back by the voice recognition application, where the target text information is obtained by performing voice recognition on the voice information by the voice recognition application.
In an embodiment of the present invention, the first obtaining module 200 includes:
the first acquisition unit is used for acquiring voice information used by a user for generating comments;
and the first identification unit is used for carrying out voice identification on the voice information to obtain target text information corresponding to the voice information.
In one embodiment of the present invention, the information transmitting apparatus further includes:
the setting module is used for setting the terminal where the voice recognition application program is in a mute mode in the process of acquiring the voice information;
and after the voice information is collected, setting the terminal to be in a non-silent mode.
In an embodiment of the present invention, the second obtaining module 210 includes:
the second indicating unit is used for indicating a face recognition application program to acquire video information used by a user for generating comments, wherein the face recognition application program is as follows: a third party application for face recognition;
and the second receiving unit is used for receiving a target face image in the video information fed back by the face recognition application program, wherein the target face image is obtained by carrying out face recognition on the video information by the face recognition application program.
In an embodiment of the present invention, the second obtaining module 210 includes:
the second acquisition unit is used for acquiring video information used by a user for generating comments;
and the second identification unit is used for carrying out face identification on the video information to obtain a target face image in the video information.
In one embodiment of the present invention, the sending module 220 is specifically configured to,
setting the same time stamp for the target text information and the target face image;
sending the text information and the face image with the timestamp to a server;
the comment information to be issued is: and when detecting that the time stamp set by the received text information is consistent with the time stamp set by the face image, the server combines the received text information and the face information to generate comment information.
In one embodiment of the present invention, an information transmitting apparatus further includes:
and the display module is used for receiving the comment information fed back by the server and displaying the comment information.
When the information is sent by the scheme provided by the embodiment of the invention, the obtained text information is obtained by carrying out voice recognition on the voice information, namely, the user releases the text information for comment in a voice input mode, the voice input mode is more convenient and quicker compared with the modes of character input and the like in the prior art, and the target text information and the target face image are sent to the server, so that the comment information finally generated by the server not only contains the character information but also contains the face image of the user, and the comment information not only can express the emotion of the user, but also can enrich the content of the comment information.
An embodiment of the present invention further provides an electronic device, as shown in fig. 3, including a processor 001, a communication interface 002, a memory 003 and a communication bus 004, where the processor 001, the communication interface 002 and the memory 003 complete mutual communication through the communication bus 004,
a memory 003 for storing a computer program;
the processor 001 is configured to implement the information transmission method according to the embodiment of the present invention when executing the program stored in the memory 003.
Specifically, the information sending method includes:
acquiring target text information corresponding to voice information used for generating comments by a user;
acquiring a target face image in video information used for generating comments by a user;
and sending the target text information and the target face image to a server so that the server combines the target text information and the target face image to generate comment information to be published.
It should be noted that, the processor 001 executes the program stored in the memory 003 to implement other embodiments of the information sending method, which are the same as the embodiments provided in the foregoing method embodiments and are not described again here.
When the information is sent by the scheme provided by the embodiment of the invention, the obtained text information is obtained by carrying out voice recognition on the voice information, namely, the user releases the text information for comment in a voice input mode, the voice input mode is more convenient and quicker compared with the modes of character input and the like in the prior art, and the target text information and the target face image are sent to the server, so that the comment information finally generated by the server not only contains the character information but also contains the face image of the user, and the comment information not only can express the emotion of the user, but also can enrich the content of the comment information.
The communication bus mentioned in the electronic device may be a Peripheral Component Interconnect (PCI) bus or an Extended Industry Standard Architecture (EISA) bus. The communication bus may be divided into an address bus, a data bus, a control bus, etc. For ease of illustration, only one thick line is shown, but this does not mean that there is only one bus or one type of bus.
The communication interface is used for communication between the electronic equipment and other equipment.
The Memory may include a Random Access Memory (RAM) or a non-volatile Memory (non-volatile Memory), such as at least one disk Memory. Optionally, the memory may also be at least one memory device located remotely from the processor.
The Processor may be a general-purpose Processor, and includes a Central Processing Unit (CPU), a network Processor (Ne word Processor, NP), and the like; the Integrated Circuit may also be a Digital Signal Processor (DSP), an Application Specific Integrated Circuit (ASIC), a Field Programmable Gate Array (FPGA) or other Programmable logic device, a discrete Gate or transistor logic device, or a discrete hardware component.
In another embodiment of the present invention, a computer-readable storage medium is further provided, where instructions are stored in the computer-readable storage medium, and when the instructions are executed on a computer, the instructions cause the computer to implement the information sending method according to the embodiment of the present invention.
Specifically, the information sending method includes:
acquiring target text information corresponding to voice information used for generating comments by a user;
acquiring a target face image in video information used for generating comments by a user;
and sending the target text information and the target face image to a server so that the server combines the target text information and the target face image to generate comment information to be published.
It should be noted that other embodiments for implementing the information sending method through the computer-readable storage medium are the same as the embodiments provided in the foregoing method embodiments, and are not described herein again.
When the information is sent by the scheme provided by the embodiment of the invention, the obtained text information is obtained by carrying out voice recognition on the voice information, namely, the user releases the text information for comment in a voice input mode, the voice input mode is more convenient and quicker compared with the modes of character input and the like in the prior art, and the target text information and the target face image are sent to the server, so that the comment information finally generated by the server not only contains the character information but also contains the face image of the user, and the comment information not only can express the emotion of the user, but also can enrich the content of the comment information.
In another embodiment of the present invention, a computer program product containing instructions is also provided, which when run on a computer, causes the computer to implement the information sending method according to the embodiment of the present invention.
Specifically, the information sending method includes:
acquiring target text information corresponding to voice information used for generating comments by a user;
acquiring a target face image in video information used for generating comments by a user;
and sending the target text information and the target face image to a server so that the server combines the target text information and the target face image to generate comment information to be published.
It should be noted that other embodiments for implementing the information sending method by using the computer program product are the same as the embodiments provided in the foregoing method embodiment section, and are not described again here.
When the information is sent by the scheme provided by the embodiment of the invention, the obtained text information is obtained by carrying out voice recognition on the voice information, namely, the user releases the text information for comment in a voice input mode, the voice input mode is more convenient and quicker compared with the modes of character input and the like in the prior art, and the target text information and the target face image are sent to the server, so that the comment information finally generated by the server not only contains the character information but also contains the face image of the user, and the comment information not only can express the emotion of the user, but also can enrich the content of the comment information.
In the above embodiments, the implementation may be wholly or partially realized by software, hardware, firmware, or any combination thereof. When implemented in software, may be implemented in whole or in part in the form of a computer program product. The computer program product includes one or more computer instructions. When loaded and executed on a computer, cause the processes or functions described in accordance with the embodiments of the invention to occur, in whole or in part. The computer may be a general purpose computer, a special purpose computer, a network of computers, or other programmable device. The computer instructions may be stored in a computer readable storage medium or transmitted from one computer readable storage medium to another, for example, from one website site, computer, server, or data center to another website site, computer, server, or data center via wired (e.g., coaxial cable, fiber optic, Digital Subscriber Line (DSL)) or wireless (e.g., infrared, wireless, microwave, etc.). The computer-readable storage medium can be any available medium that can be accessed by a computer or a data storage device, such as a server, a data center, etc., that incorporates one or more of the available media. The usable medium may be a magnetic medium (e.g., floppy Disk, hard Disk, magnetic tape), an optical medium (e.g., DVD), or a semiconductor medium (e.g., Solid State Disk (SSD)), among others.
It is noted that, herein, relational terms such as first and second, and the like may be used solely to distinguish one entity or action from another entity or action without necessarily requiring or implying any actual such relationship or order between such entities or actions. Also, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrase "comprising an … …" does not exclude the presence of other identical elements in a process, method, article, or apparatus that comprises the element.
All the embodiments in the present specification are described in a related manner, and the same and similar parts among the embodiments may be referred to each other, and each embodiment focuses on the differences from the other embodiments. In particular, for the apparatus, the electronic device, the computer program product, and the computer-readable storage medium embodiments, since they are substantially similar to the method embodiments, the description is relatively simple, and for the relevant points, reference may be made to the partial description of the method embodiments.
The above description is only for the preferred embodiment of the present invention, and is not intended to limit the scope of the present invention. Any modification, equivalent replacement, or improvement made within the spirit and principle of the present invention shall fall within the protection scope of the present invention.

Claims (16)

1. An information sending method applied to an application program is characterized by comprising the following steps:
acquiring target text information corresponding to voice information used for generating comments by a user;
acquiring a target face image in video information used for generating comments by a user;
setting the same time stamp for the target text information and the target face image;
sending the text information and the face image with the timestamp set to a server so that the server combines the target text information and the target face image to generate comment information to be issued, wherein the target text information and the target face image are both acquired by an application program after receiving a trigger instruction of a user for issuing the comment information; the comment information to be issued is: and when detecting that the time stamp set by the received text information is consistent with the time stamp set by the face image, the server combines the received text information and the face image to generate comment information.
2. The method of claim 1, wherein the step of obtaining target text information corresponding to the voice information used by the user to generate the comment comprises:
instructing a voice recognition application to collect voice information used by a user to generate a comment, wherein the voice recognition application is: a third party application for performing speech recognition;
and receiving target text information corresponding to the voice information fed back by the voice recognition application program, wherein the target text information is obtained by performing voice recognition on the voice information by the voice recognition application program.
3. The method of claim 1, wherein the step of obtaining target text information corresponding to the voice information used by the user to generate the comment comprises:
collecting voice information used by a user for generating comments;
and carrying out voice recognition on the voice information to obtain target text information corresponding to the voice information.
4. A method according to claim 2 or 3, characterized in that the method further comprises:
in the process of collecting the voice information, setting the terminal where the voice recognition application program is in a mute mode;
and after the voice information is collected, setting the terminal to be in a non-silent mode.
5. The method of claim 1, wherein the step of obtaining the target face image in the video information used by the user to generate the comment comprises:
instructing a face recognition application to collect video information used by a user to generate a comment, wherein the face recognition application is: a third party application for face recognition;
and receiving a target face image in the video information fed back by the face recognition application program, wherein the target face image is obtained by carrying out face recognition on the video information by the face recognition application program.
6. The method of claim 1, wherein the step of obtaining the target face image in the video information used by the user to generate the comment comprises:
collecting video information used by a user for generating comments;
and carrying out face recognition on the video information to obtain a target face image in the video information.
7. The method of claim 1, further comprising:
and receiving the comment information fed back by the server, and displaying the comment information.
8. An information transmission apparatus applied to an application program, comprising:
the first acquisition module is used for acquiring target text information corresponding to the voice information used for generating comments by the user;
the second acquisition module is used for acquiring a target face image in the video information used for generating comments by the user;
the sending module is used for setting the same time stamp for the target text information and the target face image; sending the text information and the face image with the timestamp set to a server so that the server combines the target text information and the target face image to generate comment information to be issued, wherein the target text information and the target face image are both acquired by an application program after receiving a trigger instruction of a user for issuing the comment information; the comment information to be issued is: and when detecting that the time stamp set by the received text information is consistent with the time stamp set by the face image, the server combines the received text information and the face image to generate comment information.
9. The apparatus of claim 8, wherein the first obtaining module comprises:
the comment processing device comprises a first indicating unit, a voice recognition application program and a comment processing unit, wherein the first indicating unit is used for indicating the voice recognition application program to collect voice information used by a user for generating comments, and the voice recognition application program is as follows: a third party application for performing speech recognition;
a first receiving unit, configured to receive target text information corresponding to the voice information fed back by the voice recognition application, where the target text information is obtained by performing voice recognition on the voice information by the voice recognition application.
10. The apparatus of claim 8, wherein the first obtaining module comprises:
the first acquisition unit is used for acquiring voice information used by a user for generating comments;
and the first identification unit is used for carrying out voice identification on the voice information to obtain target text information corresponding to the voice information.
11. The apparatus of claim 9 or 10, further comprising:
the setting module is used for setting the terminal where the voice recognition application program is in a mute mode in the process of acquiring the voice information; and after the voice information is collected, setting the terminal to be in a non-silent mode.
12. The apparatus of claim 8, wherein the second obtaining module comprises:
the second indicating unit is used for indicating a face recognition application program to acquire video information used by a user for generating comments, wherein the face recognition application program is as follows: a third party application for face recognition;
and the second receiving unit is used for receiving a target face image in the video information fed back by the face recognition application program, wherein the target face image is obtained by carrying out face recognition on the video information by the face recognition application program.
13. The apparatus of claim 8, wherein the second obtaining module comprises:
the second acquisition unit is used for acquiring video information used by a user for generating comments;
and the second identification unit is used for carrying out face identification on the video information to obtain a target face image in the video information.
14. The apparatus of claim 8, further comprising:
and the display module is used for receiving the comment information fed back by the server and displaying the comment information.
15. An electronic device is characterized by comprising a processor, a communication interface, a memory and a communication bus, wherein the processor and the communication interface are used for realizing mutual communication by the memory through the communication bus;
a memory for storing a computer program;
a processor for implementing the method steps of any of claims 1 to 7 when executing a program stored in the memory.
16. A computer-readable storage medium having instructions stored therein, which when run on a computer, cause the computer to implement the method of any one of claims 1-7.
CN201710848158.8A 2017-09-19 2017-09-19 Information sending method, device and equipment Active CN107612815B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710848158.8A CN107612815B (en) 2017-09-19 2017-09-19 Information sending method, device and equipment

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710848158.8A CN107612815B (en) 2017-09-19 2017-09-19 Information sending method, device and equipment

Publications (2)

Publication Number Publication Date
CN107612815A CN107612815A (en) 2018-01-19
CN107612815B true CN107612815B (en) 2020-12-25

Family

ID=61060966

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710848158.8A Active CN107612815B (en) 2017-09-19 2017-09-19 Information sending method, device and equipment

Country Status (1)

Country Link
CN (1) CN107612815B (en)

Families Citing this family (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108322832B (en) * 2018-01-22 2022-05-17 阿里巴巴(中国)有限公司 Comment method and device and electronic equipment
CN109325964B (en) * 2018-08-17 2020-08-28 深圳市中电数通智慧安全科技股份有限公司 Face tracking method and device and terminal
CN109787977B (en) * 2019-01-17 2022-09-30 深圳壹账通智能科技有限公司 Product information processing method, device and equipment based on short video and storage medium
CN110413834B (en) * 2019-06-14 2022-07-05 北京字节跳动网络技术有限公司 Voice comment modification method, system, medium and electronic device
CN110366002B (en) * 2019-06-14 2022-03-11 北京字节跳动网络技术有限公司 Video file synthesis method, system, medium and electronic device
CN110599359B (en) * 2019-09-05 2022-09-16 深圳追一科技有限公司 Social contact method, device, system, terminal equipment and storage medium
CN111507774A (en) * 2020-04-28 2020-08-07 上海依图网络科技有限公司 Data processing method and device
CN111681353B (en) * 2020-05-28 2022-03-25 浙江大华技术股份有限公司 Thermal imaging temperature measurement method, integrated equipment and storage medium
CN114760257A (en) * 2021-01-08 2022-07-15 上海博泰悦臻网络技术服务有限公司 Commenting method, electronic device and computer readable storage medium

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102595212A (en) * 2010-12-16 2012-07-18 微软公司 Simulated group interaction with multimedia content
CN104714937A (en) * 2015-03-30 2015-06-17 北京奇艺世纪科技有限公司 Method and device for releasing comment information
CN104750387A (en) * 2015-03-24 2015-07-01 联想(北京)有限公司 Information processing method and electronic equipment
CN105592365A (en) * 2015-12-20 2016-05-18 天脉聚源(北京)科技有限公司 Method and device for displaying guest-predicted scores

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP6176991B2 (en) * 2013-04-26 2017-08-09 キヤノン株式会社 Information processing apparatus, control method thereof, and program
CN104298682B (en) * 2013-07-18 2018-02-23 广州华久信息科技有限公司 A kind of evaluation method and mobile phone of the information recommendation effect based on Facial Expression Image
US10699454B2 (en) * 2014-12-30 2020-06-30 Facebook, Inc. Systems and methods for providing textual social remarks overlaid on media content
CN105045899A (en) * 2015-08-03 2015-11-11 北京金山安全软件有限公司 Comment content providing method and device and terminal equipment
CN105224627A (en) * 2015-09-23 2016-01-06 网易传媒科技(北京)有限公司 A kind of information getting method and device
CN105898599A (en) * 2015-12-09 2016-08-24 乐视网信息技术(北京)股份有限公司 Video comment method and device and terminal equipment
CN105608623A (en) * 2015-12-20 2016-05-25 天脉聚源(北京)科技有限公司 Method and device for displaying guess scores
CN106792170A (en) * 2016-12-14 2017-05-31 合网络技术(北京)有限公司 Method for processing video frequency and device

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102595212A (en) * 2010-12-16 2012-07-18 微软公司 Simulated group interaction with multimedia content
CN104750387A (en) * 2015-03-24 2015-07-01 联想(北京)有限公司 Information processing method and electronic equipment
CN104714937A (en) * 2015-03-30 2015-06-17 北京奇艺世纪科技有限公司 Method and device for releasing comment information
CN105592365A (en) * 2015-12-20 2016-05-18 天脉聚源(北京)科技有限公司 Method and device for displaying guest-predicted scores

Also Published As

Publication number Publication date
CN107612815A (en) 2018-01-19

Similar Documents

Publication Publication Date Title
CN107612815B (en) Information sending method, device and equipment
US20200301663A1 (en) Interactive control method and device for voice and video communications
CN110213610B (en) Live broadcast scene recognition method and device
KR101826329B1 (en) Method, device and system for determining crank phone number
US8352506B2 (en) Automatic submission of audiovisual content to desired destinations
CN108495185B (en) Video title generation method and device
CN111859020B (en) Recommendation method, recommendation device, electronic equipment and computer readable storage medium
CN108227950B (en) Input method and device
KR20190132360A (en) Method and device for processing multimedia resources
US11405447B2 (en) Method, apparatus and system for presenting mobile media information
CN104866275B (en) Method and device for acquiring image information
JP2018513511A (en) Message transmission method, message processing method, and terminal
WO2020221103A1 (en) Method for displaying user emotion, and device
CN104732975A (en) Method and device for voice instant messaging
CN113724709A (en) Text content matching method and device, electronic equipment and storage medium
CN113014854B (en) Method, device, equipment and medium for generating interactive record
CN110956016A (en) Document content format adjusting method and device and electronic equipment
US20160269510A1 (en) Content evaluator, content evaluation system, server unit and terminal unit to evaluate content
CN105808231B (en) System and method for recording and playing script
CN102664008A (en) Method, terminal and system for transmitting data
CN112532931A (en) Video processing method and device and electronic equipment
US20140012792A1 (en) Systems and methods for building a virtual social network
CN111813932B (en) Text data processing method, text data classifying device and readable storage medium
CN106888150B (en) Instant message processing method and device
CN113380229B (en) Voice response speed determining method, related device and computer program product

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant