CN111782172A - Information display method and device - Google Patents

Information display method and device Download PDF

Info

Publication number
CN111782172A
CN111782172A CN202010591971.3A CN202010591971A CN111782172A CN 111782172 A CN111782172 A CN 111782172A CN 202010591971 A CN202010591971 A CN 202010591971A CN 111782172 A CN111782172 A CN 111782172A
Authority
CN
China
Prior art keywords
information
picture
session
filled
customized
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202010591971.3A
Other languages
Chinese (zh)
Other versions
CN111782172B (en
Inventor
王夏鸣
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Volkswagen Mobvoi Beijing Information Technology Co Ltd
Original Assignee
Volkswagen Mobvoi Beijing Information Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Volkswagen Mobvoi Beijing Information Technology Co Ltd filed Critical Volkswagen Mobvoi Beijing Information Technology Co Ltd
Priority to CN202010591971.3A priority Critical patent/CN111782172B/en
Publication of CN111782172A publication Critical patent/CN111782172A/en
Application granted granted Critical
Publication of CN111782172B publication Critical patent/CN111782172B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/16Sound input; Sound output
    • G06F3/167Audio in a user interface, e.g. using voice commands for navigating, audio feedback
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/332Query formulation
    • G06F16/3329Natural language query formulation or dialogue systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/334Query execution
    • G06F16/3343Query execution using phonetics
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/223Execution procedure of a spoken command

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Human Computer Interaction (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Mathematical Physics (AREA)
  • Multimedia (AREA)
  • General Health & Medical Sciences (AREA)
  • Databases & Information Systems (AREA)
  • Acoustics & Sound (AREA)
  • Artificial Intelligence (AREA)
  • Telephonic Communication Services (AREA)
  • User Interface Of Digital Computer (AREA)

Abstract

The embodiment of the invention discloses an information display method and a device, wherein the method comprises the following steps: when the picture information to be displayed is acquired, analyzing the picture information to be displayed to acquire picture content; the picture content comprises image information and/or text information, and the image information comprises an entity object and/or a two-dimensional code; generating a customized session according to the picture content and a preset session; and executing voice interaction operation according to the customized conversation. According to the technical scheme of the embodiment of the invention, the customized conversation is generated according to the picture content acquired by analyzing the picture information to be displayed and the preset conversation, and the voice interaction operation is carried out with the user through the customized conversation, so that the user can acquire the content in the picture information to be displayed in a voice interaction mode, and the conversion effect of the displayed information is improved.

Description

Information display method and device
Technical Field
The embodiment of the invention relates to a voice interaction technology and a vehicle-mounted communication technology, in particular to an information display method and device.
Background
Along with the continuous progress of science and technology, vehicle-mounted terminal equipment is also more functional, intelligent more and more, can realize multiple functions such as navigation, call and music broadcast, simultaneously, along with the continuous expansion of car ownership, also more and more huge through vehicle-mounted terminal equipment acquisition show audience group of information.
In the prior art, except listening to a broadcast program, the information (e.g., advertisement) display mode of the vehicle-mounted terminal device still takes a picture form as a main mode, for example, when the vehicle-mounted terminal device is started, the picture information is displayed; when a user calls an Application program (APP), the picture information is displayed in the starting process of the Application program.
However, due to the particularity of the use scene of the vehicle-mounted terminal device, the user does not always look at the screen in the driving scene, the information conversion effect is not obvious in the mode of displaying the commodity information or the service information in the form of pictures, and meanwhile, the driver pays attention to the screen picture, so that the great potential safety hazard is brought, and the normal driving is influenced.
Disclosure of Invention
The embodiment of the invention provides an information display method and device, which realize voice interaction operation with a user through picture content acquired by analyzing picture information to be displayed.
In a first aspect, an embodiment of the present invention provides an information display method, including:
when the picture information to be displayed is acquired, analyzing the picture information to be displayed to acquire picture content; the picture content comprises image information and/or text information, and the image information comprises an entity object and/or a two-dimensional code;
generating a customized session according to the picture content and a preset session;
and executing voice interaction operation according to the customized conversation.
In a second aspect, an embodiment of the present invention provides an information displaying method, applied to a server, including:
when the picture information to be displayed is acquired, analyzing the picture information to be displayed to acquire picture content; the picture content comprises image information and/or text information, and the image information comprises an entity object and/or a two-dimensional code;
generating a customized session according to the picture content and a preset session;
and executing voice interaction operation through the target terminal equipment according to the customized conversation.
In a third aspect, an embodiment of the present invention provides an information display apparatus, including:
the first picture content acquisition module is used for analyzing the picture information to be displayed to acquire picture content when the picture information to be displayed is acquired; the picture content comprises image information and/or text information, and the image information comprises an entity object and/or a two-dimensional code;
the first customized session acquisition module is used for generating a customized session according to the picture content and a preset session;
and the first voice interaction execution module is used for executing voice interaction operation according to the customized conversation.
In a fourth aspect, an embodiment of the present invention provides an information displaying apparatus, including:
the second picture content acquisition module is used for analyzing the picture information to be displayed to acquire picture content when the picture information to be displayed is acquired; the picture content comprises image information and/or text information, and the image information comprises an entity object and/or a two-dimensional code;
the second customized session acquisition module is used for generating a customized session according to the picture content and a preset session;
and the second voice interaction execution module is used for executing voice interaction operation through the target terminal equipment according to the customized conversation.
In a fifth aspect, an embodiment of the present invention provides an apparatus, including:
one or more processors;
storage means for storing one or more programs;
when the one or more programs are executed by the one or more processors, the one or more processors implement the information presentation method according to any embodiment of the present invention.
In a sixth aspect, an embodiment of the present invention further provides a computer-readable storage medium, on which a computer program is stored, where the computer program, when executed by a processor, implements the information presentation method according to any embodiment of the present invention.
According to the technical scheme of the embodiment of the invention, the customized conversation is generated according to the picture content acquired by analyzing the picture information to be displayed and the preset conversation, and the voice interaction operation is carried out with the user through the customized conversation, so that the user can acquire the content in the picture information to be displayed in a voice interaction mode, the conversion effect of the display information is improved, the potential safety hazard caused by the fact that a driver watches a screen picture is avoided, normal driving is influenced, meanwhile, special display information does not need to be designed aiming at the driving scene, the existing display information in a large number of picture forms is directly converted into the voice interaction display information suitable for the driving scene, and the information utilization rate of the existing picture is improved.
Drawings
Fig. 1A is a flowchart of an information displaying method according to an embodiment of the present invention;
FIG. 1B is a flow chart of voice interaction provided in a first embodiment of the present invention;
FIG. 1C is a flow chart of voice interaction provided by a second embodiment of the present invention;
FIG. 2 is a flowchart of an information displaying method according to a second embodiment of the present invention;
FIG. 3 is a block diagram of an information display apparatus according to a third embodiment of the present invention;
FIG. 4 is a block diagram of an information display apparatus according to a fourth embodiment of the present invention;
fig. 5 is a block diagram of a device according to a fifth embodiment of the present invention.
Detailed Description
The present invention will be described in further detail with reference to the accompanying drawings and examples. It is to be understood that the specific embodiments described herein are merely illustrative of the invention and are not limiting of the invention. It should be further noted that, for the convenience of description, only some of the structures related to the present invention are shown in the drawings, not all of the structures.
Example one
Fig. 1A is a flowchart of an information displaying method according to an embodiment of the present invention, where this embodiment is applicable to a terminal device performing voice interaction with a user according to acquired picture information to be displayed, and the method may be executed by an information displaying apparatus according to a third embodiment of the present invention, where the apparatus may be implemented by software and/or hardware and integrated in the terminal device, and typically, may be integrated in a vehicle-mounted terminal device, and the method specifically includes the following steps:
s110, when the picture information to be displayed is obtained, analyzing the picture information to be displayed to obtain picture content; the picture content comprises image information and/or text information, and the image information comprises an entity object and/or a two-dimensional code.
When the terminal equipment acquires the picture information to be displayed, acquiring specific picture content in the picture information to be displayed through an image recognition technology; the text information is a character part appearing in the picture information to be displayed, and the entity object is an image (such as a person, a building, an article and the like) with practical significance appearing in the picture information to be displayed; specifically, text information and image information may appear at a plurality of positions, and therefore, by identifying the picture information to be presented, a plurality of text information pieces and a plurality of image information areas are acquired, and the positions of the text information pieces and the image information areas in the picture information to be presented are recorded. In particular, the terminal device in the present application includes any intelligent terminal with a voice broadcast function, and in the present application, the type of the terminal device is not particularly limited.
Optionally, in an embodiment of the present invention, the obtaining of the picture content includes: classifying the entity object and marking the class of the entity object; and/or performing website address conversion processing on the two-dimension code, and marking the category of the two-dimension code; the two-dimension code category comprises WeChat public number two-dimension codes or webpage information two-dimension codes. The entity object category may be divided as needed, for example, an automobile is used as an entity object category, and when it is determined that the image content includes an automobile, the corresponding entity object is marked as an automobile; the SUV (sport utility Vehicle), SPV (Special Purpose Vehicle) and truck may also be used as the entity object category, and when it is determined that the picture content includes a Vehicle, the specific category of the Vehicle needs to be further determined to be used as the corresponding entity object category. The two-dimensional Code (QR Code) represents literal numerical value information through a plurality of geometric shapes corresponding to a binary system, each two-dimensional Code substantially corresponds to a specific website, and therefore, when a two-dimensional Code in picture information to be displayed is acquired, the two-dimensional Code is identified and jumped to the corresponding website to determine whether the two-dimensional Code is a corresponding micro-message public signal link or a common website link.
Optionally, in this embodiment of the present invention, the obtaining of the picture content further includes: extracting text attributes according to the text information; wherein the text attribute comprises a brand name, a contact phone number, website information, price information, address information and/or a publicity language. The brand name can be searched in the picture information to be displayed according to each alternative brand name included in a pre-stored brand name list so as to obtain the brand name appearing in the picture information to be displayed; wherein the pre-stored list of brand names may be updated periodically by the server. Particularly, if a plurality of brand names are extracted, the brand name matched with the picture information to be displayed is determined according to the entity object type in the picture information to be displayed. For example, the sales promotion information of a new automobile of company a, which refers to the fuel card of company "buy and send value 1000 yuan B", is obtained by extracting the text information, and two brand names of company "a" and company "B" are obtained, and the picture information to be displayed, which includes the entity object category "automobile", can be determined according to the entity object categories extracted from the advertisement, and the product of company "a" includes the entity object category "automobile", and the product of company "B" does not include the entity object category "automobile", so that it can be determined that the sales promotion information is the sales promotion information of company "a", and thus, the brand name matching the sales promotion information is determined to be company "a".
Optionally, in an embodiment of the present invention, the extracting text attributes according to the text information includes at least one of: extracting the contact phone through a first regular expression; extracting website information through a second regular expression; extracting price information through currency characters or currency names; extracting address information by a natural language processing technology; and extracting the propaganda words according to the area of the character area. The Regular Expression (RE) is a logical formula for string operations, and is a "Regular string" composed of predefined specific characters and executing filtering logic, and in the embodiment of the present invention, is used to retrieve text information that conforms to the phone composition rule and the website composition rule. The first regular expression defines the composition rule of the telephone number, for example, when the text information of continuously appearing 11 digits is acquired and '139' is taken as the initial digit, the part of the text information is extracted as the contact telephone; when the number "010" is acquired and then the character "-" and the 8-bit numbers appearing consecutively appear, "010", "-" and these 8-bit numbers are extracted as the contact phone. The second regular expression defines a composition rule of the web address, for example, the web address obtained by "https: and if the continuous character string at the beginning of// www and ending at com. The partial content is extracted as price information when numbers that are prefixed with currency characters (e.g., "$" and "rah") or currency names (e.g., RMB and USD) and appear consecutively are acquired, and/or extracted as price information when numbers that are suffixed with "yuan" and appear consecutively are acquired. The address information can be obtained by Natural Language Processing (NLP) technology. In each acquired text information segment, except the text information segment related to the related information (namely brand name, contact telephone, website address information, price information and/or address information), the text information segment with the largest occupied area in other text information segments is extracted as a publicity phrase.
And S120, generating a customized session according to the picture content and the preset session.
The preset session is a preset information display mode and is used for displaying the acquired picture content according to a preset format so that a user can comprehensively and clearly know the picture content; the preset session may specify a playing sequence of the picture content, for example, the preset session is "publicity + price + brand name + contact call (broadcast twice) + website information", and after the picture content is obtained, the extracted text information is sequentially played according to the above format; the preset session can also be preset man-machine conversation content, and text information and image information in the picture content are filled into the preset session to form a customized session; specifically, at least one data item to be filled in a preset session is obtained; and filling at least one data item to be filled according to the acquired entity object type, the two-dimension code type, the brand name, the contact telephone, the website information, the price information, the address information and/or the publicity language to generate a customized conversation. For example, the session is preset to "this is an advertisement of a brand name? And (c), wherein the price is price information, the contact is called, the details can access website information, and the brand name, the publicity words, the price information, the contact and the website information are respectively filled in corresponding positions to generate the customized conversation when the brand name, the publicity words, the price information, the contact and the website information are extracted according to the text information.
Optionally, in an embodiment of the present invention, after at least one data item to be filled is filled according to the obtained entity object type, the two-dimensional code type, the brand name, the contact phone, the website information, the price information, the address information, and/or the publicity language, the method further includes: judging whether at least one data item to be filled in the preset session is filled completely; and if at least one target data item to be filled is not filled, deleting the target session content matched with the at least one target data item to be filled, and determining that the preset session is filled completely. If the data items to be filled in the preset session are filled completely, directly generating a customized session; if one or more data items to be filled exist in the preset session and are not filled, deleting the corresponding session content, and using the filled preset session as a customized session, for example, in the above technical solution, the preset session is "is an advertisement of a brand name? And [ publicity ], the price is [ price information ], please contact [ contact phone ], the details can be accessed [ website information ], only the brand name, publicity and website information are extracted according to the text information, the price information and the contact phone are not extracted, and then the corresponding preset session is modified to be the 'advertisement of the brand name', do you want to know the details? "and" [ publicity ], details can be accessed [ website information ].
And S130, executing voice interactive operation according to the customized conversation.
If the preset session only specifies the playing sequence of the picture content, for example, the preset session in the above technical solution is "publicity + price + brand name + contact phone (broadcast twice) + website information", then when the voice interaction operation is performed, if the relevant instruction of "stop" or "pause" of the user is not obtained before or during the playing, the picture content will be completely and continuously broadcast, and if the relevant instruction of "stop" or "replay" of the user is obtained, the corresponding operation is performed.
If the preset session is the preset content of the man-machine conversation, for example, the preset session in the above technical solution is "this is an advertisement of a brand name", do you want to know the details? And (a) and (a publicity term), the price is (price information), please contact (telephone contact), the details can be accessed (website information), then according to the playing sequence, only the first piece of information is broadcasted, namely, "is this a piece of (brand name) advertisement, do you want to know the details? If the ' confirmation ' instruction of the user is obtained, the next piece of information is continuously played, namely, ' publicity ', price, ' price information ', contact ' and detail can access ' website information ', and if the ' stop ' instruction is obtained, the broadcasting is finished.
According to the technical scheme of the embodiment of the invention, the customized conversation is generated according to the picture content acquired by analyzing the picture information to be displayed and the preset conversation, and the voice interaction operation is carried out with the user through the customized conversation, so that the user can acquire the content in the picture information to be displayed in a voice interaction mode, the conversion effect of the display information is improved, the potential safety hazard caused by the fact that a driver watches a screen picture is avoided, normal driving is influenced, meanwhile, special display information does not need to be designed aiming at the driving scene, the existing display information in a large number of picture forms is directly converted into the voice interaction display information suitable for the driving scene, and the information utilization rate of the existing picture is improved.
Specific application scenario one
Fig. 1B is a flowchart of voice interaction provided in a specific application scenario of the present invention, which is embodied based on the foregoing embodiment, in this embodiment, after all data items to be filled in a preset session are filled, a vehicle-mounted terminal device performs voice interaction with a user according to an acquired first customized session. Correspondingly, the method of the embodiment specifically includes the following steps:
s210: play the first session content in the first customized session, i.e. "this is an advertisement of [ brand name ], do you want to know how detailed? "; s220 is performed.
S220: acquiring first reply information of a user; if the first reply message is determined to be the confirmation instruction, executing S230; if the first reply message is determined to be a reject instruction, S270 is executed.
S230: playing the second session content in the first customized session, namely, ' publicity ', price, ' contact telephone, ' detail can scan the two-dimensional code '; s240 is performed.
S240: acquiring second reply information of the user; if the second reply message is determined to be a call-making command, executing S250; if the second reply message is determined to be the view details instruction, execution proceeds to S260.
And S250, executing the call dialing instruction, and playing the first preset reply message 'good, which is called for you' matched with the call dialing instruction.
S260, the two-dimension code is sent to the mobile phone of the user, and a second preset reply message matched with the sent two-dimension code is played, namely 'the two-dimension code is sent to your mobile phone and please check after parking'.
And S270, closing the first customization session.
According to the technical scheme of the embodiment of the invention, the customized conversation is generated according to the picture content acquired by analyzing the picture information to be displayed and the preset conversation, and the voice interaction operation is carried out with the user through the customized conversation, so that the user can acquire the picture content in the picture information to be displayed in a voice interaction mode, the conversion effect of the displayed information is improved, meanwhile, the user can acquire the detailed information more conveniently in the voice interaction mode, and the user experience is improved.
Specific application scenario two
Fig. 1C is a flowchart of voice interaction provided in a specific application scenario two of the present invention, which is embodied based on the above embodiment, in the application scenario, a target to-be-filled data item that is not filled exists in a preset session, and after a vehicle-mounted terminal device deletes target session content that matches the target to-be-filled data item, the vehicle-mounted terminal device performs voice interaction with a user according to an obtained second customized session. Correspondingly, the voice interaction process of the embodiment specifically includes the following steps:
s310: play the first session content in the second customized session, i.e. "this is an advertisement for a piece of brand name, do you want to know how detailed? "; s320 is performed.
S320: acquiring first reply information of a user; if the first reply message is determined to be the confirmation instruction, executing S330; if the first reply message is determined to be a reject instruction, S370 is executed.
S330: playing second session content in a second customized session, namely "[ publicity ], the details can be scanned with the two-dimensional code"; s340 is performed.
The second session content in the preset session is 'publicity', the price is 'price information', the user can contact 'contact telephone', the details can access 'website information', only the brand name, the publicity and the website information are extracted according to the text information, the price information and the contact telephone are not extracted, the corresponding second session content in the preset session is modified into 'publicity', and the details can access 'website information'.
S340: acquiring second reply information of the user; if the second reply message is determined to be a call-making command, executing S350; if the second reply message is determined to be the view details instruction, S360 is executed.
And S350, playing a third preset reply message of sorry and no phone call is found.
S360, the two-dimension code is sent to the mobile phone of the user, and a second preset reply message matched with the sent two-dimension code is played, namely that the two-dimension code is sent to the mobile phone of the user and the user asks to check the two-dimension code after parking.
And S370, closing the second customization session.
According to the technical scheme of the embodiment of the invention, the customized conversation is generated according to the picture content acquired by analyzing the picture information to be displayed and the preset conversation, and the voice interaction operation is carried out with the user through the customized conversation, so that the user can acquire the content in the picture information to be displayed in a voice interaction mode, the conversion effect of the displayed information is improved, meanwhile, the customized conversation generated aiming at the self content of the picture information to be displayed not only deletes useless content in the original preset conversation, but also ensures the integrity of the content, so that the user can acquire detailed information more conveniently and more conveniently, and the user experience is improved.
Example two
Fig. 2 is a flowchart of an information displaying method provided in the second embodiment of the present invention, where this embodiment is applicable to a server performing voice interaction with a user through a target terminal device according to acquired picture information to be displayed, and the method may be executed by an information displaying apparatus in the fourth embodiment of the present invention, where the apparatus may be implemented by software and/or hardware and is integrated in the server, and typically may be integrated in a background server corresponding to a vehicle-mounted terminal device, and the method specifically includes the following steps:
s410, when the picture information to be displayed is obtained, analyzing the picture information to be displayed to obtain picture content; the picture content comprises image information and/or text information, and the image information comprises an entity object and/or a two-dimensional code.
And S420, generating a customized session according to the picture content and the preset session.
And S430, executing voice interaction operation through the target terminal equipment according to the customized conversation.
According to the technical scheme of the embodiment of the invention, the customized conversation is generated according to the picture content acquired by analyzing the picture information to be displayed and the preset conversation, and the voice interaction operation is carried out with the user through the customized conversation, so that the user can acquire the content in the picture information to be displayed in a voice interaction mode, the conversion effect of the display information is improved, the potential safety hazard caused by the fact that a driver watches a screen picture is avoided, normal driving is influenced, meanwhile, special display information does not need to be designed aiming at the driving scene, the existing display information in a large number of picture forms is directly converted into the voice interaction display information suitable for the driving scene, and the information utilization rate of the existing picture is improved.
EXAMPLE III
Fig. 3 is a block diagram of an information display apparatus provided in the third embodiment of the present invention, which specifically includes: a first picture content obtaining module 310, a first customized session obtaining module 320 and a first voice interaction executing module 330.
The first picture content acquiring module 310 is configured to, when picture information to be displayed is acquired, analyze the picture information to be displayed to acquire picture content; the picture content comprises image information and/or text information, and the image information comprises an entity object and/or a two-dimensional code;
a first customized session obtaining module 320, configured to generate a customized session according to the picture content and a preset session;
the first voice interaction executing module 330 is configured to execute a voice interaction operation according to the customized session.
According to the technical scheme of the embodiment of the invention, the customized conversation is generated according to the picture content acquired by analyzing the picture information to be displayed and the preset conversation, and the voice interaction operation is carried out with the user through the customized conversation, so that the user can acquire the content in the picture information to be displayed in a voice interaction mode, the conversion effect of the display information is improved, the potential safety hazard caused by the fact that a driver watches a screen picture is avoided, normal driving is influenced, meanwhile, special display information does not need to be designed aiming at the driving scene, the existing display information in a large number of picture forms is directly converted into the voice interaction display information suitable for the driving scene, and the information utilization rate of the existing picture is improved.
Optionally, on the basis of the foregoing technical solution, the first picture content obtaining module 310 includes:
the first entity object processing unit is used for classifying the entity objects and marking the entity object types;
and/or the first two-dimensional code processing unit is used for performing website address conversion processing on the two-dimensional code and marking the category of the two-dimensional code; the two-dimension code category comprises WeChat public number two-dimension codes or webpage information two-dimension codes.
Optionally, on the basis of the foregoing technical solution, the first picture content obtaining module 310 further includes:
the first text attribute extraction unit is used for extracting text attributes according to the text information; wherein the text attribute comprises a brand name, a contact phone number, website information, price information, address information and/or a publicity language.
Optionally, on the basis of the above technical solution, the information display apparatus includes:
and the first brand name determining module is used for determining the brand name matched with the picture information to be displayed according to the entity object type in the picture information to be displayed if a plurality of brand names are extracted.
Optionally, on the basis of the above technical solution, the first text attribute extraction unit includes at least one of:
the first contact phone extracting subunit is used for extracting the contact phone through a first regular expression;
the first website information extraction subunit is used for extracting website information through a second regular expression;
a first price information extracting subunit operable to extract price information by a currency character or a currency name;
a first address information extraction subunit, configured to extract address information by a natural language processing technique;
and the first propaganda language extracting subunit is used for extracting the propaganda languages according to the area of the character area.
Optionally, on the basis of the foregoing technical solution, the first customized session obtaining module 320 includes:
the device comprises a first data item to be filled determining unit, a second data item to be filled determining unit and a processing unit, wherein the first data item to be filled determining unit is used for acquiring at least one data item to be filled in a preset session;
the first filling execution unit is used for filling at least one data item to be filled according to the obtained entity object type, the two-dimension code type, the brand name, the contact telephone, the website information, the price information, the address information and/or the publicity language;
and the first customized session generation unit is used for generating a customized session according to the filled preset session.
Optionally, on the basis of the foregoing technical solution, the first customized session obtaining module 320 includes:
a first data filling judgment unit, configured to judge whether filling of at least one to-be-filled data item in the preset session is completed;
and the first target session content deleting unit is used for deleting the target session content matched with at least one target data item to be filled if at least one target data item to be filled is not filled, and determining that the preset session is filled completely.
The device can execute the information display method provided by the first embodiment of the invention, and has the corresponding functional modules and beneficial effects of the execution method. For technical details that are not described in detail in this embodiment, reference may be made to the method provided in the first embodiment of the present invention.
Example four
Fig. 4 is a block diagram of an information display apparatus according to a fourth embodiment of the present invention, which specifically includes: a second picture content obtaining module 410, a second customized session obtaining module 420 and a second voice interaction executing module 430.
The second picture content obtaining module 410 is configured to, when the picture information to be displayed is obtained, analyze the picture information to be displayed to obtain picture content; the picture content comprises image information and/or text information, and the image information comprises an entity object and/or a two-dimensional code;
a second customized session obtaining module 420, configured to generate a customized session according to the picture content and a preset session;
and a second voice interaction executing module 430, configured to execute a voice interaction operation through the target terminal device according to the customized session.
Optionally, on the basis of the foregoing technical solution, the second picture content obtaining module 410 includes:
the second entity object processing unit is used for classifying the entity objects and marking the entity object types;
and/or the second two-dimension code processing unit is used for performing website address conversion processing on the two-dimension codes and marking the category of the two-dimension codes; the two-dimension code category comprises WeChat public number two-dimension codes or webpage information two-dimension codes.
Optionally, on the basis of the foregoing technical solution, the second picture content obtaining module 410 further includes:
the second text attribute extraction unit is used for extracting text attributes according to the text information; wherein the text attribute comprises a brand name, a contact phone number, website information, price information, address information and/or a publicity language.
Optionally, on the basis of the above technical solution, the information display apparatus includes:
and the second brand name determining module is used for determining the brand name matched with the picture information to be displayed according to the entity object type in the picture information to be displayed if the plurality of brand names are extracted.
Optionally, on the basis of the above technical solution, the second text attribute extraction unit includes at least one of:
the second contact phone extracting subunit is used for extracting the contact phone through the first regular expression;
the second website information extraction subunit is used for extracting website information through a second regular expression;
a second price information extraction subunit operable to extract price information by a currency character or a currency name;
a second address information extraction subunit, configured to extract address information by a natural language processing technique;
and the second propaganda language extracting subunit is used for extracting the propaganda languages according to the area of the character area.
Optionally, on the basis of the foregoing technical solution, the second customized session obtaining module 420 includes:
the second data item to be filled determining unit is used for acquiring at least one data item to be filled in the preset session;
the second filling execution unit is used for filling at least one data item to be filled according to the obtained entity object type, the two-dimension code type, the brand name, the contact telephone, the website information, the price information, the address information and/or the publicity language;
and the second customized session generation unit is used for generating a customized session according to the filled preset session.
Optionally, on the basis of the foregoing technical solution, the second customized session obtaining module 420 includes:
a second data filling judgment unit, configured to judge whether filling of at least one to-be-filled data item in the preset session is completed;
and the second target session content deleting unit is used for deleting the target session content matched with at least one target data item to be filled if at least one target data item to be filled is not filled, and determining that the preset session is filled completely.
According to the technical scheme of the embodiment of the invention, the customized conversation is generated according to the picture content acquired by analyzing the picture information to be displayed and the preset conversation, and the voice interaction operation is carried out with the user through the customized conversation, so that the user can acquire the content in the picture information to be displayed in a voice interaction mode, the conversion effect of the display information is improved, the potential safety hazard caused by the fact that a driver watches a screen picture is avoided, normal driving is influenced, meanwhile, special display information does not need to be designed aiming at the driving scene, the existing display information in a large number of picture forms is directly converted into the voice interaction display information suitable for the driving scene, and the information utilization rate of the existing picture is improved.
The device can execute the information display method provided by the second embodiment of the invention, and has the corresponding functional modules and beneficial effects of the execution method. For details of the technique not described in detail in this embodiment, reference may be made to the method provided in the second embodiment of the present invention.
EXAMPLE five
Fig. 5 is a schematic structural diagram of an apparatus according to a fifth embodiment of the present invention, as shown in fig. 5, the apparatus includes a processor 50, a memory 51, an input device 52, and an output device 53; the number of processors 50 in the device may be one or more, and one processor 50 is taken as an example in fig. 5; the device processor 50, the memory 51, the input device 52 and the output device 53 may be connected by a bus or other means, as exemplified by the bus connection in fig. 5.
The memory 51 serves as a computer-readable storage medium, and can be used for storing software programs, computer-executable programs, and modules, such as modules corresponding to the information presentation apparatus (the first picture content acquiring module 310, the first customized session acquiring module 320, and the first voice interaction performing module 330) executed by the client or modules corresponding to the information presentation apparatus (the second picture content acquiring module 410, the second customized session acquiring module 420, and the second voice interaction performing module 430) executed by the server in the embodiment of the present invention. The processor 50 executes various functional applications and data processing of the device by executing software programs, instructions and modules stored in the memory 51, so as to realize the information presentation method.
The memory 51 may mainly include a storage program area and a storage data area, wherein the storage program area may store an operating system, an application program required for at least one function; the storage data area may store data created according to the use of the terminal, and the like. Further, the memory 51 may include high speed random access memory, and may also include non-volatile memory, such as at least one magnetic disk storage device, flash memory device, or other non-volatile solid state storage device. In some examples, the memory 51 may further include memory located remotely from the processor 50, which may be connected to the device over a network. Examples of such networks include, but are not limited to, the internet, intranets, local area networks, mobile communication networks, and combinations thereof.
The input device 52 is operable to receive input numeric or character information and to generate key signal inputs relating to user settings and function controls of the apparatus. The output device 53 may include a display device such as a display screen.
EXAMPLE six
An embodiment of the present invention further provides a computer-readable storage medium, which when executed by a computer processor is configured to perform an information presentation method, where the method includes:
when the picture information to be displayed is acquired, analyzing the picture information to be displayed to acquire picture content; the picture content comprises image information and/or text information, and the image information comprises an entity object and/or a two-dimensional code;
generating a customized session according to the picture content and a preset session;
and executing voice interaction operation according to the customized conversation.
Or when the picture information to be displayed is acquired, analyzing the picture information to be displayed to acquire picture content; the picture content comprises image information and/or text information, and the image information comprises an entity object and/or a two-dimensional code;
generating a customized session according to the picture content and a preset session;
and executing voice interaction operation through the target terminal equipment according to the customized conversation.
Of course, the storage medium provided by the embodiment of the present invention contains computer-executable instructions, and the computer-executable instructions are not limited to the operations of the method described above, and may also perform related operations in the information presentation method provided by any embodiment of the present invention.
From the above description of the embodiments, it is obvious for those skilled in the art that the present invention can be implemented by software and necessary general hardware, and certainly, can also be implemented by hardware, but the former is a better embodiment in many cases. Based on such understanding, the technical solutions of the present invention may be embodied in the form of a software product, which may be stored in a computer-readable storage medium, such as a floppy disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), a FLASH Memory (FLASH), a hard disk or an optical disk of a computer, and includes several instructions for enabling a computer device (which may be a personal computer, a server, or a network device) to execute the methods according to the embodiments of the present invention.
It should be noted that, in the embodiment of the information displaying apparatus, the modules included in the embodiment are only divided according to functional logic, but are not limited to the above division as long as the corresponding functions can be implemented; in addition, the specific names of the functional modules are only for convenience of distinguishing from each other and are not used for limiting the protection scope of the present invention.
It is to be noted that the foregoing is only illustrative of the preferred embodiments of the present invention and the technical principles employed. It will be understood by those skilled in the art that the present invention is not limited to the particular embodiments described herein, but is capable of various obvious changes, rearrangements and substitutions as will now become apparent to those skilled in the art without departing from the scope of the invention. Therefore, although the present invention has been described in greater detail by the above embodiments, the present invention is not limited to the above embodiments, and may include other equivalent embodiments without departing from the spirit of the present invention, and the scope of the present invention is determined by the scope of the appended claims.

Claims (16)

1. An information display method, comprising:
when the picture information to be displayed is acquired, analyzing the picture information to be displayed to acquire picture content; the picture content comprises image information and/or text information, and the image information comprises an entity object and/or a two-dimensional code;
generating a customized session according to the picture content and a preset session;
and executing voice interaction operation according to the customized conversation.
2. The method of claim 1, wherein the obtaining the picture content comprises:
classifying the entity object and marking the class of the entity object;
and/or performing website address conversion processing on the two-dimension code, and marking the category of the two-dimension code; the two-dimension code category comprises WeChat public number two-dimension codes or webpage information two-dimension codes.
3. The method of claim 2, wherein the obtaining picture content further comprises:
extracting text attributes according to the text information; wherein the text attribute comprises a brand name, a contact phone number, website information, price information, address information and/or a publicity language.
4. The method of claim 3, after extracting text attributes from the text information, further comprising:
and if the plurality of brand names are extracted, determining the brand name matched with the picture information to be displayed according to the entity object type in the picture information to be displayed.
5. The method of claim 3, wherein extracting text attributes from the text information comprises at least one of:
extracting the contact phone through a first regular expression;
extracting website information through a second regular expression;
extracting price information through currency characters or currency names;
extracting address information by a natural language processing technology;
and extracting the propaganda words according to the area of the character area.
6. The method according to claim 3, wherein the generating a customized session according to the picture content and a preset session comprises:
acquiring at least one data item to be filled in a preset session;
filling at least one data item to be filled according to the obtained entity object type, the two-dimension code type, the brand name, the contact telephone, the website information, the price information, the address information and/or the publicity language;
and generating a customized session according to the filled preset session.
7. The method according to claim 6, wherein after at least one of the data items to be filled is filled according to the obtained entity object type, the two-dimensional code type, the brand name, the contact phone, the website information, the price information, the address information, and/or the publicity language, the method further comprises:
judging whether at least one data item to be filled in the preset session is filled completely;
and if at least one target data item to be filled is not filled, deleting the target session content matched with the at least one target data item to be filled, and determining that the preset session is filled completely.
8. An information display method is applied to a server and comprises the following steps:
when the picture information to be displayed is acquired, analyzing the picture information to be displayed to acquire picture content; the picture content comprises image information and/or text information, and the image information comprises an entity object and/or a two-dimensional code;
generating a customized session according to the picture content and a preset session;
and executing voice interaction operation through the target terminal equipment according to the customized conversation.
9. An information presentation device, comprising:
the first picture content acquisition module is used for analyzing the picture information to be displayed to acquire picture content when the picture information to be displayed is acquired; the picture content comprises image information and/or text information, and the image information comprises an entity object and/or a two-dimensional code;
the first customized session acquisition module is used for generating a customized session according to the picture content and a preset session;
and the first voice interaction execution module is used for executing voice interaction operation according to the customized conversation.
10. The apparatus of claim 9, wherein the first picture content obtaining module comprises:
the first entity object processing unit is used for classifying the entity objects and marking the entity object types;
and/or the first two-dimensional code processing unit is used for performing website address conversion processing on the two-dimensional code and marking the category of the two-dimensional code; the two-dimension code category comprises WeChat public number two-dimension codes or webpage information two-dimension codes.
11. The apparatus of claim 10, wherein the first picture content obtaining module further comprises:
the first text attribute extraction unit is used for extracting text attributes according to the text information; wherein the text attribute comprises a brand name, a contact phone number, website information, price information, address information and/or a publicity language.
12. The apparatus of claim 11, wherein the first customized session acquisition module comprises:
the device comprises a first data item to be filled determining unit, a second data item to be filled determining unit and a processing unit, wherein the first data item to be filled determining unit is used for acquiring at least one data item to be filled in a preset session;
the first filling execution unit is used for filling at least one data item to be filled according to the obtained entity object type, the two-dimension code type, the brand name, the contact telephone, the website information, the price information, the address information and/or the publicity language;
and the first customized session generation unit is used for generating a customized session according to the filled preset session.
13. An information presentation device, comprising:
the second picture content acquisition module is used for analyzing the picture information to be displayed to acquire picture content when the picture information to be displayed is acquired; the picture content comprises image information and/or text information, and the image information comprises an entity object and/or a two-dimensional code;
the second customized session acquisition module is used for generating a customized session according to the picture content and a preset session;
and the second voice interaction execution module is used for executing voice interaction operation through the target terminal equipment according to the customized conversation.
14. The apparatus of claim 13, wherein the second picture content obtaining module comprises:
the second entity object processing unit is used for classifying the entity objects and marking the entity object types;
and/or the second two-dimension code processing unit is used for performing website address conversion processing on the two-dimension codes and marking the category of the two-dimension codes; the two-dimension code category comprises WeChat public number two-dimension codes or webpage information two-dimension codes.
15. The apparatus of claim 14, wherein the second picture content obtaining module further comprises:
the second text attribute extraction unit is used for extracting text attributes according to the text information; wherein the text attribute comprises a brand name, a contact phone number, website information, price information, address information and/or a publicity language.
16. The apparatus of claim 15, wherein the second customized session acquisition module comprises:
the second data item to be filled determining unit is used for acquiring at least one data item to be filled in the preset session;
the second filling execution unit is used for filling at least one data item to be filled according to the obtained entity object type, the two-dimension code type, the brand name, the contact telephone, the website information, the price information, the address information and/or the publicity language;
and the second customized session generation unit is used for generating a customized session according to the filled preset session.
CN202010591971.3A 2020-06-24 2020-06-24 Information display method and device Active CN111782172B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010591971.3A CN111782172B (en) 2020-06-24 2020-06-24 Information display method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010591971.3A CN111782172B (en) 2020-06-24 2020-06-24 Information display method and device

Publications (2)

Publication Number Publication Date
CN111782172A true CN111782172A (en) 2020-10-16
CN111782172B CN111782172B (en) 2024-03-12

Family

ID=72761549

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010591971.3A Active CN111782172B (en) 2020-06-24 2020-06-24 Information display method and device

Country Status (1)

Country Link
CN (1) CN111782172B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113204615A (en) * 2021-04-29 2021-08-03 北京百度网讯科技有限公司 Entity extraction method, device, equipment and storage medium

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6961712B1 (en) * 1996-10-25 2005-11-01 Ipf, Inc. Consumer product information request (CPIR) enabling servlets and web-based consumer product information catalogs employing the same
US20080097854A1 (en) * 2006-10-24 2008-04-24 Hello-Hello, Inc. Method for Creating and Analyzing Advertisements
US20170154450A1 (en) * 2015-11-30 2017-06-01 Le Shi Zhi Xin Electronic Technology (Tianjin) Limited Multimedia Picture Generating Method, Device and Electronic Device
CN108960200A (en) * 2018-07-31 2018-12-07 北京微播视界科技有限公司 A kind of data processing method and electronic equipment based on intelligent interaction
CN109151225A (en) * 2018-09-04 2019-01-04 北京小鱼在家科技有限公司 Call handling method, device and verbal system
CN109753558A (en) * 2018-12-26 2019-05-14 出门问问信息科技有限公司 Method, apparatus and system based on user's manual building question answering system
CN110032355A (en) * 2018-12-24 2019-07-19 阿里巴巴集团控股有限公司 Speech playing method, device, terminal device and computer storage medium
WO2019196238A1 (en) * 2018-04-09 2019-10-17 平安科技(深圳)有限公司 Speech recognition method, terminal device, and computer readable storage medium

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6961712B1 (en) * 1996-10-25 2005-11-01 Ipf, Inc. Consumer product information request (CPIR) enabling servlets and web-based consumer product information catalogs employing the same
US20080097854A1 (en) * 2006-10-24 2008-04-24 Hello-Hello, Inc. Method for Creating and Analyzing Advertisements
US20170154450A1 (en) * 2015-11-30 2017-06-01 Le Shi Zhi Xin Electronic Technology (Tianjin) Limited Multimedia Picture Generating Method, Device and Electronic Device
WO2019196238A1 (en) * 2018-04-09 2019-10-17 平安科技(深圳)有限公司 Speech recognition method, terminal device, and computer readable storage medium
CN108960200A (en) * 2018-07-31 2018-12-07 北京微播视界科技有限公司 A kind of data processing method and electronic equipment based on intelligent interaction
CN109151225A (en) * 2018-09-04 2019-01-04 北京小鱼在家科技有限公司 Call handling method, device and verbal system
CN110032355A (en) * 2018-12-24 2019-07-19 阿里巴巴集团控股有限公司 Speech playing method, device, terminal device and computer storage medium
CN109753558A (en) * 2018-12-26 2019-05-14 出门问问信息科技有限公司 Method, apparatus and system based on user's manual building question answering system

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113204615A (en) * 2021-04-29 2021-08-03 北京百度网讯科技有限公司 Entity extraction method, device, equipment and storage medium
CN113204615B (en) * 2021-04-29 2023-11-24 北京百度网讯科技有限公司 Entity extraction method, device, equipment and storage medium

Also Published As

Publication number Publication date
CN111782172B (en) 2024-03-12

Similar Documents

Publication Publication Date Title
CN110035314A (en) Methods of exhibiting and device, storage medium, the electronic device of information
CN107590267B (en) Information-pushing method and device, terminal and readable storage medium storing program for executing based on picture
CN104579909B (en) Method and equipment for classifying user information and acquiring user grouping information
CN110930186A (en) System, method, device, equipment and storage medium for task display
CN108304368B (en) Text information type identification method and device, storage medium and processor
CN111488186B (en) Data processing method, device, electronic equipment and computer storage medium
CN103237136B (en) The search method of mobile terminal and descriptor thereof
CN106776909B (en) Page creating method and device
CN114902702B (en) Short message pushing method, device, server and storage medium
CN104657668A (en) Terminal
CN102760157B (en) A kind of for generating the method that release news, device and the equipment corresponding with mobile terminal
CN111782172B (en) Information display method and device
CN117319699B (en) Live video generation method and device based on intelligent digital human model
CN108829882B (en) Information collection method, device, terminal and medium
CN111933133A (en) Intelligent customer service response method and device, electronic equipment and storage medium
CN104657991A (en) Picture processing method
CN109120509B (en) Information collection method and device
CN114741144B (en) Web-side complex form display method, device and system
CN115860829A (en) Intelligent advertisement image generation method and device
CN115061785A (en) Information issuing method and device, storage medium and server
CN113240447A (en) Advertisement pushing method and device, storage medium and server
CN113949887A (en) Method and device for processing network live broadcast data
CN112131895B (en) Information transmission method, device, electronic equipment and storage medium
CN107194004B (en) Data processing method and electronic equipment
CN110931014A (en) Speech recognition method and device based on regular matching rule

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant