CN113779300A - Voice input guiding method and device and vehicle machine - Google Patents

Voice input guiding method and device and vehicle machine Download PDF

Info

Publication number
CN113779300A
CN113779300A CN202010519922.9A CN202010519922A CN113779300A CN 113779300 A CN113779300 A CN 113779300A CN 202010519922 A CN202010519922 A CN 202010519922A CN 113779300 A CN113779300 A CN 113779300A
Authority
CN
China
Prior art keywords
voice
user
information
instruction
vehicle
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202010519922.9A
Other languages
Chinese (zh)
Other versions
CN113779300B (en
Inventor
赵伟
肖金富
刘柯
杨冬生
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
BYD Co Ltd
Original Assignee
BYD Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by BYD Co Ltd filed Critical BYD Co Ltd
Priority to CN202010519922.9A priority Critical patent/CN113779300B/en
Publication of CN113779300A publication Critical patent/CN113779300A/en
Application granted granted Critical
Publication of CN113779300B publication Critical patent/CN113779300B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/60Information retrieval; Database structures therefor; File system structures therefor of audio data
    • G06F16/63Querying
    • G06F16/635Filtering based on additional data, e.g. user or group profiles
    • G06F16/637Administration of user profiles, e.g. generation, initialization, adaptation or distribution
    • BPERFORMING OPERATIONS; TRANSPORTING
    • B60VEHICLES IN GENERAL
    • B60WCONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
    • B60W50/00Details of control systems for road vehicle drive control not related to the control of a particular sub-unit, e.g. process diagnostic or vehicle driver interfaces
    • B60W50/08Interaction between the driver and the control system
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/60Information retrieval; Database structures therefor; File system structures therefor of audio data
    • G06F16/68Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/687Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using geographical or spatial information, e.g. location

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Multimedia (AREA)
  • Automation & Control Theory (AREA)
  • Library & Information Science (AREA)
  • Human Computer Interaction (AREA)
  • Transportation (AREA)
  • Mechanical Engineering (AREA)
  • Navigation (AREA)
  • User Interface Of Digital Computer (AREA)

Abstract

The embodiment of the invention provides a voice input guiding method, a voice input guiding device and a vehicle machine, wherein the voice input guiding method comprises the following steps: acquiring a voice guidance instruction set matched with a driving scene of a vehicle and a user portrait of a user; and under the condition that the car machine is not detected to have a foreground running application program, executing a guiding operation on at least one voice guiding instruction in the voice guiding instruction set. By the technical scheme, the guidance requirement of voice input on the vehicle machine when a user is in a driving state can be met, the input operation steps of the user can be simplified without triggered user operation, and therefore the use experience of the user can be improved.

Description

Voice input guiding method and device and vehicle machine
Technical Field
The invention relates to the technical field of a vehicle machine, in particular to a voice input guiding method, a voice input guiding device and a vehicle machine.
Background
With the development of the voice recognition technology, a voice input guiding device is integrated on a mobile phone or a computer and other terminals, after receiving operation information of a user, a voice input interface is displayed according to the operation information, and guiding words are displayed on the voice input interface, so that the voice input guiding is realized, and the purpose of guiding the user to use voice searching is achieved.
As can be seen from the above, the guiding device for voice input in the prior art is not suitable for use in a car machine, and a user is usually in a driving state while the car machine is running, and the way of triggering the guiding device to start based on user operation information also affects the concentration of the user in driving.
Disclosure of Invention
The technical problem to be solved by the embodiment of the invention is to provide a voice input guiding method to meet the guiding requirement of voice input on a vehicle machine when a user is in a driving state.
Correspondingly, the embodiment of the invention also provides a voice input guiding device and a car machine, which are used for ensuring the realization and the application of the method.
An embodiment of a first aspect of the present invention provides a method for guiding voice input of a car machine, including:
acquiring a voice guidance instruction set matched with a driving scene of a vehicle and a user portrait of a user;
and under the condition that the car machine is not detected to have a foreground running application program, executing a guiding operation on at least one voice guiding instruction in the voice guiding instruction set.
Optionally, before the voice guidance instruction set matching with the driving scene of the vehicle and the user portrait of the user is obtained, the method includes: configuring a driving scene of the vehicle, and/or generating a user representation of the user.
Optionally, the configuring a driving scene of the vehicle includes:
acquiring at least one item of vehicle speed information, position information, driving map data and gear information of the vehicle as scene configuration information;
and configuring the driving scene according to the scene configuration information.
Optionally, the generating a user representation of the user comprises:
acquiring working condition information and historical voice records of the vehicle machine;
generating the user portrait according to the working condition information and the historical voice record,
the working condition information comprises touch information of the vehicle machine and/or an application program operated by the vehicle machine.
Optionally, the performing, when it is not detected that the car machine has a foreground-running application, a guidance operation on at least one voice guidance instruction in the voice guidance instruction set includes:
determining the updating frequency and/or the updating time of the voice guidance instruction set according to the driving scene;
updating the at least one voice guidance instruction for performing a guidance operation according to the update frequency and/or update time.
Optionally, the acquiring a voice guidance instruction set matched with the driving scene and the user portrait specifically includes:
detecting whether the user portrait is matched with a pre-stored user portrait;
if the user portrait is matched with the pre-stored user portrait, extracting the voice guide instruction set matched with the user portrait and the driving scene from a pre-stored voice guide instruction library;
and if the user portrait is not matched with the pre-stored user portrait, configuring the voice guidance instruction set according to a cold start strategy.
Optionally, before the voice guidance instruction set matching the driving scene and the user portrait is obtained, the method includes:
collecting guide information, wherein the guide information comprises at least one of function instruction information, voice self-learning instruction information, promotion instruction information and user instruction information;
deleting information which is not matched with the guiding rule in the guiding information so as to reserve the guiding information to be processed;
dividing the information to be processed into a training set and a testing set;
performing machine learning based on the training set, the test set, pre-stored driving scenarios and pre-stored user profiles,
generating a plurality of groups of pre-stored voice instruction sets and index information corresponding to each group of pre-stored voice instruction sets;
and generating the voice guide instruction library according to the plurality of groups of pre-stored voice instruction sets and the index information.
Optionally, the configuring the voice guidance instruction set according to the cold start policy includes:
configuring at least one group of pre-stored voice instruction sets with the highest use frequency in the voice guidance instruction library as the voice guidance instruction set.
Optionally, extracting the voice guidance instruction set matching with the user portrait and the driving scene from a pre-stored voice guidance instruction library, including:
and determining the index information with the highest matching degree with the user image and the driving scene, and determining the pre-stored voice instruction set corresponding to the index information as the voice guide instruction set.
Optionally, the method further comprises:
and stopping executing the guide operation under the condition that the car machine is detected to have the application program running in the foreground.
An embodiment of a second aspect of the present invention provides a voice input guiding device of a car machine, including:
the system comprises an acquisition unit, a display unit and a control unit, wherein the acquisition unit is used for acquiring a voice guide instruction set matched with a driving scene of a vehicle and a user portrait of a user;
and the execution unit is used for executing the guiding operation on at least one voice guiding instruction in the voice guiding instruction set under the condition that the car machine is not detected to have the application program running in the foreground.
An embodiment of a third aspect of the present invention provides a vehicle machine, including:
a voice input guidance apparatus according to an embodiment of the second aspect of the present invention.
According to the embodiment of the invention, the voice guide instruction set is determined according to the driving scene and the user image, so that the obtained voice guide instruction set can simultaneously take the driving state of the vehicle and the preference of the user into consideration, and further, the voice input guide device on the vehicle machine guides the voice input of the user in the driving process of the vehicle, the requirement of the user for sending a related voice instruction in the driving process is met, and further, the travel related information can be pushed to the user based on the voice feedback of the user, so that the purpose of assisting the user in traveling is achieved. Further, whether the car machine has the application program running in the foreground or not is detected, so that when the application program running in the foreground is not detected, the guiding operation of voice input is triggered and executed, on one hand, the voice input guiding device and other application programs can be guaranteed to run relatively independently, on the other hand, the user operation which is triggered is not needed, the input operation steps of a user can be simplified, and the use experience of the user can be improved.
Drawings
FIG. 1 is a flow chart of the steps of one embodiment of a method for voice input guidance of the present invention;
FIG. 2 is a flow chart of steps of another voice input guidance method embodiment of the present invention;
FIG. 3 is a schematic flow chart diagram of an embodiment of the present invention for starting the voice input guidance module of the car machine;
FIG. 4 is a schematic flow chart diagram of a generated embodiment of a voice guidance instruction library of the present invention;
FIG. 5 is a schematic flow chart diagram of an embodiment of the present invention for obtaining a voice guidance instruction set matching a driving scene and a user profile;
FIG. 6 is a flow chart of steps of yet another embodiment of a method for voice input guidance in accordance with the present invention;
FIG. 7 is a block diagram of an embodiment of a voice input guidance apparatus according to the present invention;
fig. 8 is a block diagram of an embodiment of a vehicle machine according to the present invention.
Detailed Description
So that the manner in which the above recited objects, features and advantages of the present invention can be understood in detail, a more particular description of the invention, briefly summarized above, may be had by reference to the embodiments thereof which are illustrated in the appended drawings. It should be noted that the embodiments of the present invention and features of the embodiments may be combined with each other without conflict.
In the following description, numerous specific details are set forth in order to provide a thorough understanding of the present invention, however, the present invention may be practiced in other ways than those specifically described herein, and therefore the scope of the present invention is not limited by the specific embodiments disclosed below.
The embodiment of the invention is applied to a car machine, which is a short name of a vehicle-mounted information entertainment product arranged in a car, and can be used for information communication with the car and the outside (car-to-car) functionally.
Referring to fig. 1, a flowchart illustrating steps of an embodiment of a voice input guidance method according to the present invention is shown, which may specifically include the following steps:
step S102, a voice guidance instruction set matched with a driving scene of a vehicle and a user portrait of a user is obtained.
The driving scene of the vehicle and the user image of the user are respectively generated, and the voice guide instruction set is determined according to the driving scene and the user image, so that the obtained voice guide instruction set can simultaneously give consideration to the driving state of the vehicle and the preference of the user, and further, the voice input guide device on the vehicle machine guides the voice input of the user in the driving process of the vehicle, and the requirement that the user sends a related voice instruction in the driving process is met.
Specifically, the operation of obtaining the voice guidance instruction set may be performed locally in the car machine, or may be performed through communication interaction with an adapted server, such as: and sending the driving scene and the user portrait to an adaptive server so as to receive a voice guidance instruction set issued by the server according to the driving scene and the user portrait.
And step S104, under the condition that the car machine is not detected to have the application program running in the foreground, executing the guiding operation on at least one voice guiding instruction in the voice guiding instruction set.
The voice guidance instruction performs guidance operation, specifically, guides the user to speak the displayed voice guidance information.
The method comprises the steps of detecting whether a car machine has a foreground running application program or not, triggering and executing voice input guiding operation when the foreground running application program is not detected, under the ordinary condition, if the foreground running application program does not exist, displaying a main interface on a display screen of the car machine, and executing the guiding operation under the working condition without influencing the running of other application programs.
Specifically, the guidance operation is performed on at least one voice guidance instruction in the voice guidance instruction set, and may be displaying a guidance sentence on a display screen, or playing guidance audio.
In addition, the guidance operation is performed on at least one voice guidance instruction in the voice guidance instruction set, and only one voice guidance instruction may be displayed at a time, or a plurality of voice guidance instructions may be displayed at the same time, and the first mode is preferable.
According to the voice input guiding method provided by the embodiment, the voice guiding instruction set is determined according to the driving scene and the user image, so that the voice input guiding device on the vehicle can guide the voice input of the user in the driving process of the vehicle, the requirement of the user for sending the related voice instruction in the driving process is met, the travel related information can be pushed to the user based on the voice feedback of the user, and the purpose of assisting the user in traveling is achieved.
Further, whether the car machine has the application program running in the foreground or not is detected, so that when the application program running in the foreground is not detected, the guiding operation of voice input is triggered and executed, on one hand, the voice input guiding device and other application programs can be guaranteed to run relatively independently, on the other hand, the user operation which is triggered is not needed, the input operation steps of a user can be simplified, and the use experience of the user can be improved.
In some embodiments, one possible implementation manner of step S104 is: determining the updating frequency and/or the updating time of the voice guidance instruction set according to the driving scene; and updating at least one voice guidance instruction for executing the guidance operation according to the updating frequency and/or the updating time.
As a preferred implementation manner, the performing of the guidance operation on at least one voice guidance instruction in the voice guidance instruction set specifically includes displaying only voice guidance information corresponding to one voice guidance instruction on a display screen of the car machine, and updating at least one voice guidance instruction for performing the guidance operation according to the update frequency, and specifically includes: by combining the current driving scene, the updating frequency and the updating opportunity of the single data in the voice guidance instruction set at the vehicle end are determined, so that when the driving scene is changed, the displayed voice guidance information is correspondingly adjusted or when the user has no feedback on the current voice guidance information, the next voice guidance information is timely switched and displayed.
Updating at least one voice guidance instruction for executing guidance operation according to the update time, which specifically comprises: in different driving scenes, the corresponding time for displaying the voice guidance information is different, for example, in the driving scene of traffic jam driving, the corresponding voice guidance information is to switch the driving route, at this time, the voice guidance information can be displayed on the display screen of the vehicle machine in real time, and in the driving scenes of normal driving, overspeed driving, low-speed driving, traffic light parking and the like, the corresponding driving speed is different, and if the voice guidance instruction is to play a multimedia file, the voice guidance instruction needs to correspond to different display time.
Referring to fig. 2, a flowchart illustrating steps of another embodiment of a voice input guidance method of the present invention is shown, which may specifically include the following steps:
step S202, configuring a driving scene of the vehicle and/or generating a user portrait of a user.
The driving scene is used for determining the current driving state, the user represents the preference of the user, and the matched voice guide instruction set can be determined based on the information by determining the information so as to assist the driving of the user.
Specifically, the driving scenes may include normal driving, speeding, low-speed driving, temporary parking, traffic light parking, high-speed driving, traffic jam driving, and the like.
Step S204, a voice guidance instruction set matched with the driving scene of the vehicle and the user portrait of the user is obtained.
And step S206, under the condition that the car machine is not detected to have the application program running in the foreground, executing the guiding operation on at least one voice guiding instruction in the voice guiding instruction set.
In this embodiment, specific implementation processes of step S204 and step S206 may refer to the description related to step 102 and step 104 in the embodiment shown in fig. 1, and are not described herein again.
In some embodiments, one possible implementation manner of configuring the driving scenario of the vehicle described in step S202 is as follows: collecting at least one item of vehicle speed information, position information, driving map data and gear information of a vehicle as scene configuration information; and configuring the driving scene according to the scene configuration information.
The information is collected, so that the current vehicle can be helped to be judged in which driving scene of normal driving, overspeed driving, low-speed driving, temporary parking, traffic light parking, high-speed driving, traffic jam driving and the like.
In some embodiments, one possible implementation of generating a user representation of a user described in step S202 is: acquiring working condition information and historical voice records of the vehicle machine; and generating a user image according to the working condition information and the historical voice record, wherein the working condition information comprises touch information of the vehicle machine and/or an application program operated by the vehicle machine.
The user can know the use habit and the use preference of the user to the application program in the vehicle machine through the analysis of the working condition information of the vehicle machine, the working condition information comprises touch information of the vehicle machine, the application program with high running frequency in the vehicle machine and the like, the user preference can be known through the analysis of the historical voice record, and then the user portrait can be constructed through the information such as the use habit, the use preference, the user preference and the like, so that the user portrait can be more accurately applied to voice guidance.
Specifically, as shown in fig. 3, the starting module of the voice input guidance module of the car machine includes: a data collector 302, a content decider 304, a presentation decider 306, and a presentation module 308.
The data collector 302 is configured to collect vehicle speed information, position information, driving map data, gear information, touch information of a vehicle, an application program run by the vehicle, and a historical voice record.
The content decision device 304 is configured to configure a driving scene according to the vehicle speed information, the position information, the driving map data and the gear information of the vehicle.
And configuring a user image according to the touch information of the vehicle machine, the application program operated by the vehicle machine and the historical voice record.
The presentation decision maker 306 is used for determining a voice guidance instruction set and a voice guidance instruction updating frequency and/or updating time according to the driving scene and the user image.
The presentation module 308 is used for presenting the decision result, i.e. the voice guidance instruction.
Alternatively, as shown in fig. 4, one possible implementation manner of step S102, that is, an implementation manner of acquiring a voice guidance instruction set matching with a driving scene and a user portrait, includes:
step S402, detecting whether the user portrait is matched with a pre-stored user portrait.
Step S404, if the user image is matched with the pre-stored user image, extracting a voice guide instruction set matched with the user image and the driving scene from a pre-stored voice guide instruction library.
Step S406, if the user portrait does not match the pre-stored user portrait, configuring a voice guidance instruction set according to the cold start strategy.
The implementation process may be completed at a vehicle end or at a server end adapted to a vehicle machine, as can be understood by those skilled in the art.
In some embodiments, one possible implementation manner of step S404 is: and determining index information with the highest matching degree with the user portrait and the driving scene, and determining a pre-stored voice instruction set corresponding to the index information as a voice guide instruction set.
In some embodiments, one possible implementation manner of step S406 is: and configuring at least one group of pre-stored voice instruction sets with the highest use frequency in the voice guidance instruction library as a voice guidance instruction set.
Specifically, if it is detected that the user profile does not match the pre-stored user profile, it indicates that the voice guidance instruction library does not have a voice guidance instruction set matching the user profile, in which case, the cold start policy may be to provide the car machine with a set of voice guidance instruction sets with the highest frequency of use.
Optionally, before step S202, a generation process of a voice guidance instruction library is further included to query the voice guidance instruction library for the adapted voice guidance instruction set.
Specifically, as shown in fig. 5, the generation process of the voice guidance instruction library includes:
step S502, collecting guide information, wherein the guide information comprises function instruction information, voice self-learning instruction information, promotion instruction information and user instruction information.
The functional instruction information can be an operation instruction of a specified application program, the voice self-learning instruction information can be a voice instruction identified through a training model, the promotion instruction information can be voice information based on promotion requirements, and the user instruction information is instruction information commonly used by a user.
Step S504, detecting whether the guiding information matches with the guiding rule, so as to perform security review and content review on the guiding information.
Specifically, the main function of detecting whether the guide information is matched with the guide rule is to perform security inspection and content inspection on data provided by a plurality of guide word input sources, wherein the security inspection is used for detecting whether sensitive words such as yellow and black are included in the guide information, the content inspection comprises two steps, as the voice guide instruction only supports Chinese and English, the first step is to detect whether special symbols, numbers and the like are included, and the second step is an intelligent voice semantic analysis process to see whether intelligent voice can correctly analyze the instruction so as to ensure the reliability of the voice guide instruction.
Step S506, the matched guidance information to be processed is retained.
Step S508, the information to be processed is divided into a training set and a test set.
Step S510, machine learning is executed according to the training set, the test set, the pre-stored driving scene and the pre-stored user portrait, so that a plurality of groups of pre-stored voice instruction sets and index information corresponding to each group of pre-stored voice instruction sets are generated.
The method comprises the steps that machine learning operation is executed, guide information in a training set and a test set is classified and processed based on a pre-stored driving scene and a pre-stored user portrait, a plurality of groups of pre-stored voice instruction sets are obtained, and corresponding index information is set in each group of pre-stored voice instruction sets to establish indexes.
The pre-stored voice instruction set can be understood as another name of the voice guidance instruction set in the voice guidance instruction library.
Step S512, a voice guidance instruction library is generated according to the plurality of groups of pre-stored voice instruction sets and the index information.
The voice guidance instruction library is specifically a database of the voice guidance instruction library, and preferably, the database is stored in the server side.
The generation process of the voice guidance instruction library can be completed at a vehicle end or at a server end adapted to a vehicle machine, as can be understood by those skilled in the art.
After the two steps, an index is established for the voice guide instruction according to the pre-stored driving scene and the pre-stored user image so as to form a voice guide instruction library, wherein the voice guide instruction library comprises a plurality of groups of voice guide instruction sets.
Referring to fig. 6, a flowchart illustrating steps of another embodiment of a voice input guidance method according to the present invention is shown, where the method for guiding input is based on interaction between a car machine and an adapted server, and specifically includes the following steps:
step S602, at least one item of vehicle speed information, position information, driving map data and gear information of the vehicle is collected as scene configuration information.
And step S604, configuring the driving scene according to the scene configuration information.
And step S606, obtaining the working condition information and the historical voice record of the vehicle machine.
And step S608, generating a user image according to the working condition information and the historical voice record.
And step S610, sending the driving scene and the user portrait to an adaptive server.
And step S612, receiving a voice guide instruction set issued by the server according to the driving scene and the user portrait.
And step S614, determining the updating frequency and the updating time of the voice guidance instruction set according to the driving scene.
Step S616, detecting whether the car machine has a foreground running application.
Step S618, displaying the voice guidance instruction according to the update frequency and the update time when the car machine is not detected to have the foreground running application.
And step S620, stopping executing the guiding operation under the condition that the car machine is detected to have the application program running in the foreground.
According to the voice input guiding method provided by the embodiment, the voice input guiding module is displayed on the desktop of the screen of the vehicle, the voice instruction is actively displayed to the user according to the use habit and the current vehicle state of the user, and the user only needs to speak the instruction according to the instruction displayed by the voice skill guiding system.
Specifically, as long as the user does not use the car screen, the voice instruction can be directly recommended to the user, the recommendation is performed based on the voice use history of the user and the user portrait, the recommendation is more biased to recommend new voice skills, and vehicle-mounted traffic information is intelligently pushed according to the vehicle condition and road condition aiming at the vehicle-mounted scene, so that the car-using knowledge of the car owner is widened, and the driving alertness of the user is improved.
It should be noted that, for simplicity of description, the method embodiments are described as a series of acts or combination of acts, but those skilled in the art will recognize that the present invention is not limited by the illustrated order of acts, as some steps may occur in other orders or concurrently in accordance with the embodiments of the present invention. Further, those skilled in the art will appreciate that the embodiments described in the specification are presently preferred and that no particular act is required to implement the invention.
Referring to fig. 7, there is shown a schematic block diagram of an embodiment of a voice input guidance apparatus 700 of the present invention, comprising:
an obtaining unit 702, configured to obtain a voice guidance instruction set that matches a driving scene of a vehicle and a user portrait of a user;
and the execution unit 704 is configured to execute a guidance operation on at least one voice guidance instruction in the voice guidance instruction set when the car machine is not detected to have the application program running in the foreground.
The obtaining unit 702 may be integrated in a host of a vehicle or a network module of the vehicle, and the executing unit 704 may be a display module and/or a speaker module.
The voice input guidance device provided by this embodiment determines the voice guidance instruction set through the obtaining unit 702, and can implement guidance of voice input of the user by the voice input guidance device on the vehicle in the execution unit 704 during the vehicle driving process, so as to meet the requirement of the user for sending a related voice instruction during the driving process, and further push travel-related information to the user based on voice feedback of the user, thereby achieving the purpose of assisting the user in traveling.
Optionally, the voice input guidance apparatus 700 further includes: a configuration unit 706, configured to configure a driving scene of the vehicle, and/or generate a user representation of the user.
The configuration unit 706 may be integrated in a host of the vehicle machine.
Optionally, the configuration unit 706 comprises: the collecting subunit 7062 is configured to collect at least one of vehicle speed information, position information, driving map data, and gear information of the vehicle as scene configuration information; the configuration unit 706 is further configured to: and configuring the driving scene according to the scene configuration information.
Wherein, the acquisition subunit 7062 may be a sensor and/or a data acquisition interface.
Optionally, the configuration unit 706 comprises: an acquiring subunit 7064, configured to acquire operating condition information and a historical voice record of the vehicle device; and a generating subunit 7066, configured to generate a user icon according to the operating condition information and the historical voice record, where the operating condition information includes touch information about the vehicle and/or an application program run by the vehicle.
The obtaining sub-unit 7064 and the generating sub-unit 7066 may be integrated in a host of a vehicle machine.
Optionally, the execution unit 704 includes: a first determining subunit 7062, configured to determine, according to the driving scene, an update frequency and/or an update time of the voice guidance instruction set; update subunit 7064, the user updates at least one voice guidance instruction for performing the guidance operation according to the update frequency and/or the update time.
The first determining subunit 7062 and the updating subunit 7064 may be integrated in a host of the vehicle machine. Optionally, the obtaining unit 702 includes: a detecting subunit 7022, configured to detect whether the user portrait matches a pre-stored user portrait; an extracting subunit 7024, configured to, if the user image matches a pre-stored user image, extract a voice guidance instruction set matching the user image and the driving scene from a pre-stored voice guidance instruction library; a configuration subunit 7026, configured to configure the voice guidance instruction set according to the cold start policy if the user profile does not match the pre-stored user profile.
The detecting sub-unit 7022, the extracting sub-unit 7024 and the configuring sub-unit 7026 may be integrated into a server adapted to a vehicle.
Optionally, the apparatus 700 further comprises: a collecting unit 708, configured to collect guidance information, where the guidance information includes at least one of function instruction information, voice self-learning instruction information, promotion instruction information, and user instruction information; a deleting unit 610, configured to delete information that does not match the guidance rule in the guidance information, so as to retain the guidance information to be processed; a dividing unit 612, configured to divide information to be processed into a training set and a test set; the generating unit 614 is used for executing machine learning according to the training set, the test set, the pre-stored driving scene and the pre-stored user portrait so as to generate a plurality of groups of pre-stored voice instruction sets and index information corresponding to each group of pre-stored voice instruction sets; the generating unit 614 is further configured to: and generating a voice guide instruction library according to the plurality of groups of pre-stored voice instruction sets and the index information.
The collecting unit 708, the deleting unit 710, the dividing unit 712 and the generating unit 714 may be integrated on a server adapted to the car machine.
Optionally, the configuration subunit 7026 is further configured to: and configuring at least one group of pre-stored voice instruction sets with the highest use frequency in the voice guidance instruction library as a voice guidance instruction set.
Optionally, the extracting subunit 7024 includes: the second determining subunit 7024A is configured to determine index information with the highest matching degree with the user portrait and the driving scene, so as to determine a corresponding voice guidance instruction set according to the index information.
Optionally, the execution unit 704 is further configured to: and stopping executing the guide operation under the condition that the car machine is detected to have the application program running in the foreground.
For the device embodiment, since it is basically similar to the method embodiment, the description is simple, and for the relevant points, refer to the partial description of the method embodiment.
Referring to fig. 8, a schematic block diagram of an embodiment of a car machine 80 of the present invention is shown, including: the voice input guiding apparatus 700 according to the above embodiment.
The embodiments in the present specification are described in a progressive manner, each embodiment focuses on differences from other embodiments, and the same and similar parts among the embodiments are referred to each other.
As will be appreciated by one skilled in the art, embodiments of the present invention may be provided as a method, apparatus, or computer program product. Accordingly, embodiments of the present invention may take the form of an entirely hardware embodiment, an entirely software embodiment or an embodiment combining software and hardware aspects. Furthermore, embodiments of the present invention may take the form of a computer program product embodied on one or more computer-usable storage media (including, but not limited to, disk storage, CD-ROM, optical storage, and the like) having computer-usable program code embodied therein.
Embodiments of the present invention are described with reference to flowchart illustrations and/or block diagrams of methods, terminal devices (systems), and computer program products according to embodiments of the invention. It will be understood that each flow and/or block of the flow diagrams and/or block diagrams, and combinations of flows and/or blocks in the flow diagrams and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to a processor of a general purpose computer, special purpose computer, embedded processor, or other programmable data processing terminal to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing terminal, create means for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be stored in a computer-readable memory that can direct a computer or other programmable data processing terminal to function in a particular manner, such that the instructions stored in the computer-readable memory produce an article of manufacture including instruction means which implement the function specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be loaded onto a computer or other programmable data processing terminal to cause a series of operational steps to be performed on the computer or other programmable terminal to produce a computer implemented process such that the instructions which execute on the computer or other programmable terminal provide steps for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
While preferred embodiments of the present invention have been described, additional variations and modifications of these embodiments may occur to those skilled in the art once they learn of the basic inventive concepts. Therefore, it is intended that the appended claims be interpreted as including preferred embodiments and all such alterations and modifications as fall within the scope of the embodiments of the invention.
Finally, it should also be noted that, herein, relational terms such as first and second, and the like may be used solely to distinguish one entity or action from another entity or action without necessarily requiring or implying any actual such relationship or order between such entities or actions. Also, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or terminal that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or terminal. Without further limitation, an element defined by the phrase "comprising an … …" does not exclude the presence of other like elements in a process, method, article, or terminal that comprises the element.
The voice input guiding method and the voice input guiding device provided by the invention are described in detail, specific examples are applied in the text to explain the principle and the implementation mode of the invention, and the description of the above embodiments is only used to help understanding the method and the core idea of the invention; meanwhile, for a person skilled in the art, according to the idea of the present invention, there may be variations in the specific embodiments and the application scope, and in summary, the content of the present specification should not be construed as a limitation to the present invention.

Claims (12)

1. A voice input guiding method of a car machine is characterized by comprising the following steps:
acquiring a voice guidance instruction set matched with a driving scene of a vehicle and a user portrait of a user;
and under the condition that the car machine is not detected to have a foreground running application program, executing a guiding operation on at least one voice guiding instruction in the voice guiding instruction set.
2. The method of claim 1, wherein prior to obtaining the set of voice guidance instructions that match the driving scene of the vehicle and the user representation of the user, the method comprises:
configuring a driving scene of the vehicle, and/or generating a user representation of the user.
3. The method of claim 2, wherein configuring the driving scenario of the vehicle comprises:
acquiring at least one item of vehicle speed information, position information, driving map data and gear information of the vehicle as scene configuration information;
and configuring the driving scene according to the scene configuration information.
4. The method of claim 2, wherein generating a user representation of a user comprises:
acquiring working condition information and historical voice records of the vehicle machine;
generating the user portrait according to the working condition information and the historical voice record,
the working condition information comprises touch information of the vehicle machine and/or an application program operated by the vehicle machine.
5. The method according to claim 1, wherein said performing a guidance operation on at least one voice guidance instruction in the voice guidance instruction set without detecting that the car machine has an application running in the foreground comprises:
determining the updating frequency and/or the updating time of the voice guidance instruction set according to the driving scene;
updating the at least one voice guidance instruction for performing a guidance operation according to the update frequency and/or update time.
6. The method according to claim 1, wherein the obtaining of the set of voice guidance instructions matching the driving scene and the user representation comprises:
detecting whether the user portrait is matched with a pre-stored user portrait;
if the user portrait is matched with the pre-stored user portrait, extracting the voice guide instruction set matched with the user portrait and the driving scene from a pre-stored voice guide instruction library;
and if the user portrait is not matched with the pre-stored user portrait, configuring the voice guidance instruction set according to a cold start strategy.
7. The method of claim 6, wherein before the obtaining of the set of voice guidance instructions matching the driving scenario and the user representation, the method comprises:
collecting guide information, wherein the guide information comprises at least one of function instruction information, voice self-learning instruction information, promotion instruction information and user instruction information;
deleting information which is not matched with the guiding rule in the guiding information so as to reserve the guiding information to be processed;
dividing the information to be processed into a training set and a testing set;
executing machine learning according to the training set, the test set, a prestored driving scene and a prestored user portrait to generate a plurality of groups of prestored voice instruction sets and index information corresponding to each group of prestored voice instruction sets;
and generating the voice guide instruction library according to the plurality of groups of pre-stored voice instruction sets and the index information.
8. The method of claim 6, wherein the configuring the voice guidance instruction set according to a cold start policy comprises:
configuring at least one group of pre-stored voice instruction sets with the highest use frequency in the voice guidance instruction library as the voice guidance instruction set.
9. The method of claim 7, wherein the extracting the set of voice guidance instructions matching the user representation and the driving scenario from a pre-stored library of voice guidance instructions comprises:
and determining the index information with the highest matching degree with the user image and the driving scene, and determining the pre-stored voice instruction set corresponding to the index information as the voice guide instruction set.
10. The method according to any one of claims 1 to 9, further comprising:
and stopping executing the guide operation under the condition that the car machine is detected to have the application program running in the foreground.
11. The utility model provides a speech input guiding device of car machine which characterized in that includes:
the system comprises an acquisition unit, a display unit and a control unit, wherein the acquisition unit is used for acquiring a voice guide instruction set matched with a driving scene of a vehicle and a user portrait of a user;
and the execution unit is used for executing the guiding operation on at least one voice guiding instruction in the voice guiding instruction set under the condition that the car machine is not detected to have the application program running in the foreground.
12. The utility model provides a car machine, its characterized in that includes:
the voice input guide apparatus of claim 11.
CN202010519922.9A 2020-06-09 2020-06-09 Voice input guiding method, device and car machine Active CN113779300B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010519922.9A CN113779300B (en) 2020-06-09 2020-06-09 Voice input guiding method, device and car machine

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010519922.9A CN113779300B (en) 2020-06-09 2020-06-09 Voice input guiding method, device and car machine

Publications (2)

Publication Number Publication Date
CN113779300A true CN113779300A (en) 2021-12-10
CN113779300B CN113779300B (en) 2024-05-07

Family

ID=78834526

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010519922.9A Active CN113779300B (en) 2020-06-09 2020-06-09 Voice input guiding method, device and car machine

Country Status (1)

Country Link
CN (1) CN113779300B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116016578A (en) * 2022-11-22 2023-04-25 中国第一汽车股份有限公司 Intelligent voice guiding method based on equipment state and user behavior

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20130322665A1 (en) * 2012-06-05 2013-12-05 Apple Inc. Context-aware voice guidance
CN104335152A (en) * 2012-06-05 2015-02-04 苹果公司 Providing navigation instructions while device is in locked mode
WO2015079331A1 (en) * 2013-11-28 2015-06-04 Sony Corporation Application activation method and apparatus and electronic equipment
CN108766423A (en) * 2018-05-25 2018-11-06 三星电子(中国)研发中心 A kind of active awakening method and device based on scene
CN109377115A (en) * 2018-12-19 2019-02-22 Oppo广东移动通信有限公司 Vehicular applications recommended method, device, terminal device and storage medium
CN110096249A (en) * 2018-01-31 2019-08-06 阿里巴巴集团控股有限公司 Methods, devices and systems for prompting fast to wake up word
CN110275692A (en) * 2019-05-20 2019-09-24 北京百度网讯科技有限公司 A kind of recommended method of phonetic order, device, equipment and computer storage medium
CN110472095A (en) * 2019-08-16 2019-11-19 百度在线网络技术(北京)有限公司 Voice guide method, apparatus, equipment and medium
CN110784833A (en) * 2019-08-29 2020-02-11 腾讯科技(深圳)有限公司 Message reminding method and device, vehicle-mounted equipment and storage medium

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20130322665A1 (en) * 2012-06-05 2013-12-05 Apple Inc. Context-aware voice guidance
CN104335152A (en) * 2012-06-05 2015-02-04 苹果公司 Providing navigation instructions while device is in locked mode
WO2015079331A1 (en) * 2013-11-28 2015-06-04 Sony Corporation Application activation method and apparatus and electronic equipment
CN110096249A (en) * 2018-01-31 2019-08-06 阿里巴巴集团控股有限公司 Methods, devices and systems for prompting fast to wake up word
CN108766423A (en) * 2018-05-25 2018-11-06 三星电子(中国)研发中心 A kind of active awakening method and device based on scene
CN109377115A (en) * 2018-12-19 2019-02-22 Oppo广东移动通信有限公司 Vehicular applications recommended method, device, terminal device and storage medium
CN110275692A (en) * 2019-05-20 2019-09-24 北京百度网讯科技有限公司 A kind of recommended method of phonetic order, device, equipment and computer storage medium
CN110472095A (en) * 2019-08-16 2019-11-19 百度在线网络技术(北京)有限公司 Voice guide method, apparatus, equipment and medium
CN110784833A (en) * 2019-08-29 2020-02-11 腾讯科技(深圳)有限公司 Message reminding method and device, vehicle-mounted equipment and storage medium

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116016578A (en) * 2022-11-22 2023-04-25 中国第一汽车股份有限公司 Intelligent voice guiding method based on equipment state and user behavior
CN116016578B (en) * 2022-11-22 2024-04-16 中国第一汽车股份有限公司 Intelligent voice guiding method based on equipment state and user behavior

Also Published As

Publication number Publication date
CN113779300B (en) 2024-05-07

Similar Documents

Publication Publication Date Title
CN111767021A (en) Voice interaction method, vehicle, server, system and storage medium
EP3319081A1 (en) On-board voice command identification method and apparatus, and storage medium
EP3166023A1 (en) In-vehicle interactive system and in-vehicle information appliance
EP2518447A1 (en) System and method for fixing user input mistakes in an in-vehicle electronic device
CN109631920B (en) Map application with improved navigation tool
CN105719648B (en) personalized unmanned vehicle interaction method and unmanned vehicle
JP2008058039A (en) On-vehicle device for collecting dissatisfaction information, information collection center, and system for collecting dissatisfaction information
EP2669631A1 (en) Apparatus for operating in-vehicle information apparatus
US10741178B2 (en) Method for providing vehicle AI service and device using the same
CN116368353A (en) Content aware navigation instructions
US20200286479A1 (en) Agent device, method for controlling agent device, and storage medium
CN102867005A (en) Retrieving device, retrieving method and vehicle-mounted navigation apparatus
CN109976515B (en) Information processing method, device, vehicle and computer readable storage medium
CN113779300B (en) Voice input guiding method, device and car machine
CN110767219A (en) Semantic updating method, device, server and storage medium
JP2013097758A (en) Information processing system, server device, terminal device, program and information processing method
US8886668B2 (en) Navigation system with search-term boundary detection mechanism and method of operation thereof
CN111261149B (en) Voice information recognition method and device
JP2015007595A (en) Device for vehicle, communication system, communication method, and program
JP2018081102A (en) Communication device, communication method, and program
US11620994B2 (en) Method for operating and/or controlling a dialog system
CN116168704B (en) Voice interaction guiding method, device, equipment, medium and vehicle
KR20200044777A (en) Method and system for recommending inforamtion contents based on question and answer between user and voice agent while moving
CN112562668A (en) Semantic information deviation rectifying method and device
CN113888846B (en) Method and device for reminding driving in advance

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant