CN111046196A

CN111046196A - Voice comment method, system, medium and device based on picture

Info

Publication number: CN111046196A
Application number: CN201911372145.3A
Authority: CN
Inventors: 时红仁
Original assignee: Shanghai Qinggan Intelligent Technology Co Ltd
Current assignee: Shanghai Qinggan Intelligent Technology Co Ltd
Priority date: 2019-12-27
Filing date: 2019-12-27
Publication date: 2020-04-21

Abstract

The invention provides a method, a system, a medium and equipment for voice comment based on pictures, wherein the method for voice comment based on pictures comprises the following steps: displaying a picture currently browsed by a user; when a user enters an image browsing mode, automatically detecting the comment voice; and/or receiving a comment instruction of a user, starting a detection function of the comment voice to associate the comment voice to the picture, and adding a voice icon on the picture to link the stored comment voice through the voice icon. The invention can connect the voice comment function of the picture with the Internet of vehicles, and realizes the recording and playing functions of a certain picture through the microphone button of the steering wheel in the vehicle while displaying the picture through the vehicle-mounted display screen.

Description

Voice comment method, system, medium and device based on picture

Technical Field

The invention belongs to the field of multimedia control, relates to a method for adding comment voice to a picture, and particularly relates to a voice comment method, a voice comment system, a voice comment medium and voice comment equipment based on the picture.

Background

In the era of faster and faster pace of life, the leisure mode and the entertainment mode of users are accelerated, the spiritual culture requirements of users are more and more obvious, and the interaction between users is developed towards a direction of being more convenient and faster. The traditional comment mode of the social message is character input, and with the diversification of interactive forms, a user can add static pictures or dynamic pictures such as emoticons and the like while commenting so as to enrich the emotion to be expressed by the user on the basis of character expression.

However, there is no more vivid image when commenting on a browsed picture, and a comment method capable of sufficiently expressing the emotion of the user cannot intuitively feel the mood and mood of the commenting user when browsing a picture through the voice of the user. Although the prior art can realize a simple voice comment function through a mobile terminal, the requirements of users on convenience and flexibility cannot be met, and a more flexible and more intelligent voice comment method is lacked about voice comments.

Therefore, how to provide a method, a system, a medium and a device for voice comment based on a picture to solve the defects that the voice comment cannot be flexibly and intelligently fused to the picture in the prior art and the like becomes a technical problem to be solved by technical personnel in the field.

Disclosure of Invention

In view of the above drawbacks of the prior art, an object of the present invention is to provide a method, a system, a medium, and a device for voice comment based on picture, which are used to solve the problem that the prior art cannot flexibly and intelligently fuse voice comments into a picture.

To achieve the above and other related objects, an aspect of the present invention provides a method for voice comment based on picture, including: displaying a picture currently browsed by a user; receiving and storing comment voices of users; and associating the comment voice to the picture, and adding a voice icon on the picture so as to link the stored comment voice through the voice icon.

In an embodiment of the present invention, the step of receiving and storing the comment voice of the user includes: when a user enters an image browsing mode, automatically detecting the comment voice; and/or receiving a comment instruction of a user, and starting a detection function of the comment voice.

In an embodiment of the present invention, the step of receiving a comment instruction from a user and starting the comment speech detection function includes: receiving touch operation of a user, and starting a recording function associated with the touch operation in advance according to the touch operation; or receiving a key instruction of a user, and starting a recording function pre-associated with the key instruction according to the key instruction; the key instructions comprise playing key instructions and recording key instructions, the playing key instructions are used for playing comment voices recorded by the user, and the recording key instructions are used for re-recording the comment voices of the user browsing the picture currently.

In an embodiment of the present invention, the voice icon links at least one of the comment voices, and the method for comment by voice based on picture further includes: when the user clicks the voice icon, displaying a list of the comment voices; and receiving a voice viewing instruction of a user, and playing the comment voice according to the voice viewing instruction.

Another aspect of the present invention provides a voice comment system based on pictures, including: the display module is used for displaying the picture currently browsed by the user; the voice receiving module is used for receiving and storing comment voices of the users; and the association module is used for associating the comment voice to the picture and adding a voice icon on the picture so as to link the stored comment voice through the voice icon.

Yet another aspect of the present invention provides a medium having stored thereon a computer program that, when executed by a processor, implements the picture-based voice comment method.

A final aspect of the invention provides an apparatus comprising: a processor and a memory; the memory is configured to store a computer program and the processor is configured to execute the computer program stored by the memory to cause the apparatus to perform the picture-based voice comment method.

In an embodiment of the present invention, the device is a vehicle end or a mobile terminal.

In an embodiment of the present invention, when the device is a mobile terminal, a touch operation between a user and the mobile terminal is received, and a recording function associated with the touch operation in advance is started according to the touch operation.

In an embodiment of the present invention, when the device is a car end, a key instruction of a user on a steering wheel in a car is received, and a recording function of the car end is started; the key instruction on the steering wheel in the vehicle comprises a playing key instruction and a recording key instruction; the playing key instruction is an instruction sent by a user clicking a microphone button of a steering wheel and is used for playing comment voice recorded by the user; the recording key instruction is an instruction for keeping the action of pressing a microphone button of the steering wheel by a user for more than a preset time period, and is used for re-recording comment voice of a current picture browsed by the user.

As described above, the method, system, medium, and apparatus for voice comment based on picture according to the present invention have the following advantages:

can be in the same place the pronunciation comment function and the car networking of picture contact, realize the recording and the play function to a certain picture through the microphone button of steering wheel in the car when showing the picture through vehicle-mounted display screen, the text input step of commenting the picture when having simplified the user and browsed the picture, directly link the pronunciation to the picture in, simple quick one step targets in place, on the other hand also provides the mode of a browsing picture comment for the user, replace the pronunciation comment with the word comment, make the user obtain better sense organ experience when listening to.

Drawings

Fig. 1 is a diagram illustrating an application scene architecture of the method for voice comment based on picture according to an embodiment of the present invention.

FIG. 2 is a schematic flow chart diagram illustrating a method for voice comment based on picture according to an embodiment of the present invention.

Fig. 3 is a receiving flow chart of the method for voice comment based on picture according to an embodiment of the present invention.

Fig. 4 is a flow chart showing comment instruction receiving in an embodiment of the method for voice comment based on picture according to the present invention.

Fig. 5 is a schematic diagram of a desert picture interface in an embodiment of the picture-based voice comment method of the present invention.

Fig. 6 is a schematic diagram illustrating a comment speech list in an embodiment of the method for speech comment based on picture according to the present invention.

Fig. 7 is a flow chart illustrating a desert picture comment method according to an embodiment of the present invention.

Fig. 8 is a schematic structural diagram of the voice comment system based on pictures according to an embodiment of the present invention.

Description of the element reference numerals

8 voice comment system based on picture

81 display module

82 voice receiving module

83 correlation module

S21-S23

S221 to S222

S222A-S222B steps

S71-S76

Detailed Description

The embodiments of the present invention are described below with reference to specific embodiments, and other advantages and effects of the present invention will be easily understood by those skilled in the art from the disclosure of the present specification. The invention is capable of other and different embodiments and of being practiced or of being carried out in various ways, and its several details are capable of modification in various respects, all without departing from the spirit and scope of the present invention. It is to be noted that the features in the following embodiments and examples may be combined with each other without conflict.

It should be noted that the drawings provided in the following embodiments are only for illustrating the basic idea of the present invention, and the components related to the present invention are only shown in the drawings rather than drawn according to the number, shape and size of the components in actual implementation, and the type, quantity and proportion of the components in actual implementation may be changed freely, and the layout of the components may be more complicated.

The picture-based voice comment method, the picture-based voice comment system, the picture-based voice comment medium and the picture-based voice comment equipment perform voice comment on browsed pictures through a mobile terminal or a vehicle terminal, particularly can connect the voice comment function of the pictures with the internet of vehicles through the vehicle terminal, display the pictures through a vehicle-mounted display screen, and simultaneously realize the functions of recording and playing a certain picture through a microphone button of a steering wheel in a vehicle.

The method, system, medium, and apparatus for voice comment based on picture provided in this embodiment will be described in detail with reference to the drawings.

As shown in fig. 1, in an embodiment, the voice comment method based on pictures is applied to a mobile terminal or a vehicle terminal. When a user browses a picture through an interface of the mobile terminal, voice comment is conducted on the picture by using the voice comment method based on the picture, and an icon for commenting voice is displayed on the picture; when a user browses the picture through the vehicle end, the picture-based voice comment method can be applied to carry out voice comment on the picture, and the browsed picture and the voice comment icon are displayed through the vehicle-mounted display screen.

In a specific application scene, a user clicks a favorite photo and selects to share the favorite photo to a map application or a vehicle application, and a system automatically generates a photo set based on a period of time, such as the weekend of the week or a certain travel place; after a user gets on the vehicle, the user can see favorite photos and videos on the vehicle, and other people can quickly collect the photos to the mobile terminal or the vehicle end of the user, so that the user can conveniently use the photos for the next trip and life; when a user browses, voice comments are made on the photo, and a voice icon, such as a trumpet, appears on the photo or the picture, and the voice with comments can be recorded and played.

As shown in fig. 2, in an embodiment, when the voice comment method based on pictures is applied to the car end, the photo application supports a common sharing interface, and sends the photo to the application of my car; synchronizing the photos to the cloud end, and automatically downloading the photos to a vehicle end after a user gets on a vehicle; the cloud server classifies the photos according to the address information carried by the photos, and automatically classifies the photos according to the identification of the photos, such as landscapes, foods, people, articles, buildings and the like; the system can be used as an album for browsing or address navigation, sharing recommendation and other applications; when a user browses, voice comments can be made on a certain picture. The method specifically comprises the following steps:

and S21, displaying the picture currently browsed by the user.

Specifically, when the picture browsed by the user is displayed through the vehicle-mounted display screen at the vehicle end, the user can switch the displayed picture by sliding the screen horizontally and vertically. The pictures are pictures taken by the user, pictures published by others in public or pictures collected by the user and having special meanings or subjects.

And S22, receiving and storing the comment voice of the user.

As shown in fig. 5, in one embodiment, S22 includes:

and S221, automatically detecting the comment voice after the user enters a picture browsing mode.

Specifically, the picture browsing mode is used as a mark for starting a voice detection function, when a user chats with other people and talks about a certain browsed picture, the conversation content of the user is automatically detected, and short comment voice is extracted through semantic analysis. The comment speech can be a short conversation of the user or a phrase or a short sentence composed of core words extracted from a longer conversation.

S222, receiving a comment instruction of a user, and starting a detection function of the comment voice.

Specifically, the comment instruction refers to an operation with a clear indication, and the operation includes a screen touch or screen long-time pressing operation on the mobile terminal by a user, or a voice comment function in a pull-down menu is selected by an editing icon; the operation also includes a user's operation of clicking or long-pressing a microphone button on an in-vehicle steering wheel on the vehicle end.

As shown in fig. 4, in an embodiment, S222 includes:

S222A, receiving the touch operation of the user, and starting the recording function pre-associated with the touch operation according to the touch operation.

Specifically, a point touch operation of a user on a screen of the mobile terminal is received, a voice comment function is started when a browsed picture is clicked, and when the user sends out a voice comment, a volume jumping icon is displayed on the screen of the mobile terminal and floats on the displayed picture.

S222B, receiving a key instruction of a user, and starting a recording function pre-associated with the key instruction according to the key instruction; the key instructions comprise playing key instructions and recording key instructions, the playing key instructions are used for playing comment voices recorded by the user, and the recording key instructions are used for re-recording the comment voices of the user browsing the picture currently.

Specifically, receiving a recording key instruction of a user for keeping a microphone button of a steering wheel on the steering wheel in a car for more than a preset time period, and starting a recording function, wherein the starting of the recording function is recording of a first comment voice or re-recording of the comment voice which is unsatisfied by the user for a previous comment voice; and receiving a play key instruction sent by a user clicking a microphone button of the steering wheel on the steering wheel in the vehicle, and starting a play function of comment voice. It should be noted that the preset time period may be any reasonable time period that can be distinguished from the action time of the key press, and that enables the detection position of the vehicle-end signal to be determined, for example, 2 seconds, 5 seconds, or 10 seconds. The vehicle-end signal detection part can judge the reasonability of the time period by setting a threshold, for example, when the action of pressing a microphone button of a steering wheel is kept for more than 5 minutes, the judgment is unreasonable, and the vehicle-end signal detection part may be a vehicle-end system fault or other abnormal reasons.

And S23, associating the comment voice to the picture, and adding a voice icon on the picture to link the stored comment voice through the voice icon.

In one embodiment, the voice icon links at least one comment voice, and when a user clicks the voice icon or slides the screen upwards, a list of the comment voices is displayed; and receiving a voice viewing instruction of a user, and playing the comment voice according to the voice viewing instruction.

Specifically, in the list of comment voices, each comment voice corresponds to one sub-voice icon; displaying the list of the comment voices through a mobile terminal, receiving a click instruction of a user for the sub-voice icon, and playing the corresponding comment voices; and displaying the list of the comment voices through the vehicle-mounted end, receiving a key instruction of clicking the previous/next part of the steering wheel by a user, and playing the corresponding comment voices. Furthermore, the sub-voice icons can count corresponding browsing listening times through a cloud, a plurality of sub-voice icons are arranged in the comment voice list according to the listening times from most to least, and counting updating and arrangement sequence updating are carried out according to the listening times at different times. Furthermore, one side of the sub-voice icon can correspondingly display the browsing amount of the comment voice so as to be referred by a user who needs to listen to the comment voice at a later stage.

As shown in fig. 5, in one embodiment, the picture-based voice comment method uses a desert picture as a specific embodiment. In the desert picture, there is an endless desert, there is a walking camel in the close view, and there is the shadow of this camel on the ground. And a trumpet icon is arranged at the lower left corner of the desert picture, comment voices of the desert picture can be played correspondingly by clicking the trumpet icon, and play and pause buttons and play duration are displayed at the bottommost part of the picture. The comment voices of the desert picture can comprise a plurality of voices, and when the desert picture is published to a public social platform to be browsed, a comment voice with the largest browsing click amount after cloud data statistics is clicked by clicking the trumpet icon in a default mode.

As shown in fig. 6, in an embodiment, when the desert picture in fig. 5 is displayed on a vehicle-mounted display screen at a vehicle end, the trumpet icon on the picture can be clicked to play the comment voice with the largest default browsing times. For example, the comment speech is a desert related song that a user hums perceptually when seeing the desert picture. Receiving a touch instruction of the user sliding upwards through the vehicle-mounted display screen, and displaying a list of comment voices: speech 1, speech 2, speech 3, speech 4 and speech 5.

As shown in fig. 7, in an embodiment, the step of voice commenting the desert picture in fig. 5 includes:

and S71, displaying the desert picture currently browsed by the user.

Specifically, when a user browses pictures through a vehicle-mounted display screen at a vehicle end, the pictures are sequentially browsed through the classification of landscape pictures, when the user browses the desert pictures, the user feels that the vehicle-mounted display screen does not receive a picture switching instruction of the user any more, so that the desert pictures are displayed on the vehicle-mounted display screen for a long time, and the user can conveniently comment the desert pictures in a voice mode.

S72, when the user keeps pressing the microphone button of the steering wheel for 2 seconds, the recording button command of the user on the in-vehicle steering wheel is received to record the comment voice "where there is oasis".

Specifically, when the action of pressing the microphone button of the steering wheel by the user is maintained for 2 seconds, it is determined that the user is going to make a voice comment by the signal detection part at the vehicle end, and the voice receiving function is activated so as to receive the comment voice "where there is oasis" uttered by the user at any time.

And S73, associating the comment voice 'where there is oasis' to the desert picture, and adding a voice icon on the desert picture.

Specifically, when the user releases the microphone button, the user is judged to finish recording the comment voice through the signal detection part at the vehicle end, and then a trumpet icon is displayed on the desert picture. It should be noted that if no one carries out voice comment before the desert picture, a trumpet icon is newly added in the desert picture; if someone carries out voice comment before the desert picture, the previous trumpet icon is used, but the comment voice which is linked by the trumpet icon and has the largest listening times is replaced by the comment voice which is recorded by the user at the moment, and the arrangement sequence of the listening times is restored again when the user finishes the voice comment operation of the picture. The time for replacing the just recorded comment voice can be judged by setting a threshold value, for example, the sequence of listening times is automatically recovered from more to less after 12 hours or 24 hours; and correspondingly recovering the arrangement sequence of the listening times from more to less through whether the picture displayed by the current vehicle-mounted display screen is changed or not by the vehicle terminal, judging that the user finishes recording the comment voice when the picture currently browsed by the user is switched from the desert picture to the food picture, and recovering the arrangement sequence of the listening times from more to less.

S74, when the user clicks the microphone button of the steering wheel, receiving a play button command from the user on the steering wheel in the car to play the comment voice "where there is oasis".

Specifically, when the user finishes recording the comment voice "where there is oasis" and wants to listen to the comment by playback, a signal detection position at the vehicle end determines that the user needs to listen to the comment voice from a play key instruction of clicking a microphone button on a steering wheel in the vehicle, and plays the comment voice "where there is oasis" that the user has just recorded. Further, when the user carries out voice comment, because the desert picture is in a public browsing state, other people can also carry out voice comment, when the vehicle end carries out playing, the recorded comment voice is matched through the vehicle identification number, and a latest comment voice at the vehicle end is played according to a playing key instruction of the user.

And S75, when the user is not satisfied with the comment voice and clicks the voice icon, popping up a voice deleting dialog box to delete the 'where oasis' of the comment voice according to the deleting operation selected by the user.

Specifically, when the user needs to delete the comment voice, a voice deletion dialog box can be popped up by pressing the voice picture for a long time. It should be noted that if other comment voices exist after the comment voice is deleted, the voice icon in the desert picture does not disappear; if no other comment voices exist after the comment voice is deleted, the voice icon in the desert picture disappears.

S76, when the action of the user pressing the microphone button of the steering wheel is maintained for 2 seconds, a recording key instruction of the user on the in-vehicle steering wheel is received to re-record the comment voice "where there can be one oasis in the wide desert".

Specifically, if the voice icon in the desert picture disappears, a new voice icon is added through the newly recorded comment voice; if the voice icon in the desert picture does not disappear, the corresponding relation of the link between the voice icon and the comment voice is just changed, and the comment voice linked after the voice icon is clicked corresponds to the comment voice recorded by the user, namely, where a oasis can exist in the desert.

It should be noted that the scope of the method for voice comment based on picture according to the present invention is not limited to the order of executing steps listed in this embodiment, and all the solutions implemented by adding, subtracting, and replacing steps in the prior art according to the principle of the present invention are included in the scope of the present invention.

As shown in fig. 8, in an embodiment, the image-based voice comment system 8 is applied to a mobile terminal or a vehicle, and specifically includes a display module 81, a voice receiving module 82, and an association module 83.

The display module 81 is configured to display a currently browsed picture of a user.

The voice receiving module 82 is used for receiving and storing comment voices of users.

In an embodiment, the voice receiving module 82 is specifically configured to automatically detect the comment voice after the user enters the picture browsing mode; and/or receiving a comment instruction of a user, and starting a detection function of the comment voice.

The association module 83 is configured to associate the comment voice with the picture, and add a voice icon on the picture to link to the stored comment voice through the voice icon.

It should be noted that the division of each module of the picture-based voice comment system is only a division of a logical function, and when the picture-based voice comment system is actually implemented, all or part of the division may be integrated into one physical entity, or may be physically separated. And the modules can be realized in a form that all software is called by the processing element, or in a form that all the modules are realized in a form that all the modules are called by the processing element, or in a form that part of the modules are called by the hardware. For example: a module may be a separate processing element, or may be implemented by being integrated into a chip of the system. Further, a certain module may be stored in the memory of the system in the form of program codes, and a certain processing element of the system may call and execute the functions of the following certain module. Other modules are implemented similarly. All or part of the modules can be integrated together or can be independently realized. The processing element described herein may be an integrated circuit having signal processing capabilities. In implementation, the steps of the above method or the following modules may be implemented by hardware integrated logic circuits in a processor element or instructions in software.

The following modules may be one or more integrated circuits configured to implement the above methods, for example: one or more Application Specific Integrated Circuits (ASICs), one or more Digital Signal Processors (DSPs), one or more Field Programmable Gate Arrays (FPGAs), and the like. When some of the following modules are implemented in the form of a program code called by a processing element, the processing element may be a general-purpose processor, such as a Central Processing Unit (CPU) or other processor capable of calling the program code. These modules may be integrated together and implemented in the form of a System-on-a-chip (SOC).

It should be noted that the picture-based voice comment system of the present invention can implement the picture-based voice comment method of the present invention, but the implementation apparatus of the picture-based voice comment method of the present invention includes, but is not limited to, the structure of the picture-based voice comment system described in this embodiment, and all structural modifications and substitutions in the prior art made according to the principles of the present invention are included in the scope of the present invention. It should be noted that the picture-based voice comment method and the picture-based voice comment system are also applicable to browsing applications of other audiovisual multimedia form contents such as videos, songs, friend circle messages and the like, and are included in the protection scope of the present invention.

In an embodiment, the computer storage medium of the present invention stores a computer program, and the computer program is executed by a processor to implement the method for voice comment based on picture.

Those of ordinary skill in the art will understand that: all or part of the steps for implementing the above method embodiments may be performed by hardware associated with a computer program. The aforementioned computer program may be stored in a computer readable storage medium. When executed, the program performs steps comprising the method embodiments described above; and the aforementioned computer-readable storage media comprise: various computer storage media that can store program codes, such as ROM, RAM, magnetic or optical disks.

In one embodiment, the apparatus of the present invention comprises: a processor, a memory, a transceiver, a communication interface, or/and a system bus. The memory and the communication interface are connected with the processor and the transceiver through a system bus and complete mutual communication, the memory is used for storing a computer program, the communication interface is used for communicating with other equipment, and the processor and the transceiver are used for operating the computer program to enable the equipment to execute all steps of the picture-based voice comment method.

The above-mentioned system bus may be a Peripheral Component Interconnect (PCI) bus, an Extended Industry Standard Architecture (EISA) bus, or the like. The system bus may be divided into an address bus, a data bus, a control bus, and the like. The communication interface is used for realizing communication between the database access device and other equipment (such as a client, a read-write library and a read-only library). The Memory may include a Random Access Memory (RAM), and may further include a non-volatile Memory (non-volatile Memory), such as at least one disk Memory.

The Processor may be a general-purpose Processor, and includes a Central Processing Unit (CPU), a Network Processor (NP), and the like; the device can also be a Digital Signal Processor (DSP), an Application Specific Integrated Circuit (ASIC), a Field Programmable Gate Array (FPGA) or other programmable logic device, discrete gate or transistor logic device, or discrete hardware components.

In an embodiment, the device is a vehicle end or a mobile terminal. The mobile terminal includes, but is not limited to, a smart phone, a tablet computer, and a PDA (Personal Digital Assistant).

In an embodiment, when the device is a mobile terminal, touch operations of a user and the mobile terminal are received, and a recording function pre-associated with the touch operations is started according to the touch operations.

In one embodiment, when the device is a car end, a key instruction of a user on a steering wheel in a car is received, and a recording function of the car end is started; the key instruction on the steering wheel in the vehicle comprises a playing key instruction and a recording key instruction; the playing key instruction is an instruction sent by a user clicking a microphone button of a steering wheel and is used for playing comment voice recorded by the user; the recording key instruction is an instruction for keeping the action of pressing a microphone button of the steering wheel by a user for more than a preset time period, and is used for re-recording comment voice of a current picture browsed by the user.

In an embodiment, in the list of comment voices, each comment voice corresponds to one sub-voice icon; displaying the list of the comment voices through a mobile terminal, receiving a click instruction of a user for the sub-voice icon, and playing the corresponding comment voices; and displaying the list of the comment voices through the vehicle-mounted end, receiving a key instruction of clicking the previous/next part of the steering wheel by a user, and playing the corresponding comment voices.

In summary, the voice comment method, the voice comment system, the voice comment medium and the voice comment equipment based on the pictures can connect the voice comment function of the pictures with the internet of vehicles, realize the recording and playing functions of a certain picture through the microphone button of the steering wheel in the vehicle while displaying the pictures through the vehicle-mounted display screen, simplify the text input step of commenting the pictures when a user browses the pictures, directly link the voices to the pictures, simply and quickly achieve one step, provide a mode for browsing the pictures for the user, replace the text comments with the voice comments, and enable the user to obtain better sensory experience when listening. The invention effectively overcomes various defects in the prior art and has high industrial utilization value.

The foregoing embodiments are merely illustrative of the principles and utilities of the present invention and are not intended to limit the invention. Any person skilled in the art can modify or change the above-mentioned embodiments without departing from the spirit and scope of the present invention. Accordingly, it is intended that all equivalent modifications or changes which can be made by those skilled in the art without departing from the spirit and technical spirit of the present invention be covered by the claims of the present invention.

Claims

1. A voice comment method based on pictures is characterized by comprising the following steps:

displaying a picture currently browsed by a user;

receiving and storing comment voices of users;

and associating the comment voice to the picture, and adding a voice icon on the picture so as to link the stored comment voice through the voice icon.

2. The picture-based voice comment method according to claim 1, wherein the step of receiving and storing a comment voice of a user includes:

when a user enters an image browsing mode, automatically detecting the comment voice; and/or

And receiving a comment instruction of a user, and starting a detection function of the comment voice.

3. The picture-based voice comment method according to claim 2, wherein the step of receiving a comment instruction from a user and starting a detection function of the comment voice includes:

receiving touch operation of a user, and starting a recording function associated with the touch operation in advance according to the touch operation; or

Receiving a key instruction of a user, and starting a recording function pre-associated with the key instruction according to the key instruction; the key instructions comprise playing key instructions and recording key instructions, the playing key instructions are used for playing comment voices recorded by the user, and the recording key instructions are used for re-recording the comment voices of the user browsing the picture currently.

4. The picture-based voice comment method according to claim 1, wherein the voice icon links at least one of the comment voices, the picture-based voice comment method further comprising:

when the user clicks the voice icon, displaying a list of the comment voices;

and receiving a voice viewing instruction of a user, and playing the comment voice according to the voice viewing instruction.

5. A picture-based voice comment system, comprising:

the display module is used for displaying the picture currently browsed by the user;

the voice receiving module is used for receiving and storing comment voices of the users;

and the association module is used for associating the comment voice to the picture and adding a voice icon on the picture so as to link the stored comment voice through the voice icon.

6. A medium on which a computer program is stored, which program, when being executed by a processor, is adapted to carry out the picture-based voice comment method of any one of claims 1 to 4.

7. An apparatus, comprising: a processor and a memory;

the memory is configured to store a computer program, and the processor is configured to execute the computer program stored by the memory to cause the apparatus to perform the picture-based voice comment method according to any one of claims 1 to 4.

8. The device of claim 7, wherein the device is a vehicle end or a mobile terminal.

9. The apparatus of claim 8,

and when the equipment is a mobile terminal, receiving touch operation of a user and the mobile terminal, and starting a recording function associated with the touch operation in advance according to the touch operation.

10. The apparatus of claim 8,

when the equipment is the car end, receiving a key instruction of a user on a steering wheel in a car, and starting a recording function of the car end;

the key instruction on the steering wheel in the vehicle comprises a playing key instruction and a recording key instruction;

the playing key instruction is an instruction sent by a user clicking a microphone button of a steering wheel and is used for playing comment voice recorded by the user; the recording key instruction is an instruction for keeping the action of pressing a microphone button of the steering wheel by a user for more than a preset time period, and is used for re-recording comment voice of a current picture browsed by the user.