WO2023173657A1 - Intelligent interaction method and apparatus, device, and storage medium - Google Patents

Intelligent interaction method and apparatus, device, and storage medium Download PDF

Info

Publication number
WO2023173657A1
WO2023173657A1 PCT/CN2022/110117 CN2022110117W WO2023173657A1 WO 2023173657 A1 WO2023173657 A1 WO 2023173657A1 CN 2022110117 W CN2022110117 W CN 2022110117W WO 2023173657 A1 WO2023173657 A1 WO 2023173657A1
Authority
WO
WIPO (PCT)
Prior art keywords
interaction
requirement
interest
interactive content
vehicle
Prior art date
Application number
PCT/CN2022/110117
Other languages
French (fr)
Chinese (zh)
Inventor
黄际洲
张昊
Original Assignee
北京百度网讯科技有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 北京百度网讯科技有限公司 filed Critical 北京百度网讯科技有限公司
Publication of WO2023173657A1 publication Critical patent/WO2023173657A1/en

Links

Images

Classifications

    • GPHYSICS
    • G01MEASURING; TESTING
    • G01CMEASURING DISTANCES, LEVELS OR BEARINGS; SURVEYING; NAVIGATION; GYROSCOPIC INSTRUMENTS; PHOTOGRAMMETRY OR VIDEOGRAMMETRY
    • G01C21/00Navigation; Navigational instruments not provided for in groups G01C1/00 - G01C19/00
    • G01C21/26Navigation; Navigational instruments not provided for in groups G01C1/00 - G01C19/00 specially adapted for navigation in a road network
    • G01C21/34Route searching; Route guidance
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01CMEASURING DISTANCES, LEVELS OR BEARINGS; SURVEYING; NAVIGATION; GYROSCOPIC INSTRUMENTS; PHOTOGRAMMETRY OR VIDEOGRAMMETRY
    • G01C21/00Navigation; Navigational instruments not provided for in groups G01C1/00 - G01C19/00
    • G01C21/26Navigation; Navigational instruments not provided for in groups G01C1/00 - G01C19/00 specially adapted for navigation in a road network
    • G01C21/34Route searching; Route guidance
    • G01C21/3453Special cost functions, i.e. other than distance or default speed limit of road segments
    • G01C21/3492Special cost functions, i.e. other than distance or default speed limit of road segments employing speed data or traffic data, e.g. real-time or historical
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01CMEASURING DISTANCES, LEVELS OR BEARINGS; SURVEYING; NAVIGATION; GYROSCOPIC INSTRUMENTS; PHOTOGRAMMETRY OR VIDEOGRAMMETRY
    • G01C21/00Navigation; Navigational instruments not provided for in groups G01C1/00 - G01C19/00
    • G01C21/26Navigation; Navigational instruments not provided for in groups G01C1/00 - G01C19/00 specially adapted for navigation in a road network
    • G01C21/34Route searching; Route guidance
    • G01C21/36Input/output arrangements for on-board computers
    • GPHYSICS
    • G08SIGNALLING
    • G08GTRAFFIC CONTROL SYSTEMS
    • G08G1/00Traffic control systems for road vehicles
    • G08G1/09Arrangements for giving variable traffic instructions
    • G08G1/0962Arrangements for giving variable traffic instructions having an indicator mounted inside the vehicle, e.g. giving voice messages
    • G08G1/0968Systems involving transmission of navigation instructions to the vehicle
    • GPHYSICS
    • G08SIGNALLING
    • G08GTRAFFIC CONTROL SYSTEMS
    • G08G1/00Traffic control systems for road vehicles
    • G08G1/09Arrangements for giving variable traffic instructions
    • G08G1/0962Arrangements for giving variable traffic instructions having an indicator mounted inside the vehicle, e.g. giving voice messages
    • G08G1/0968Systems involving transmission of navigation instructions to the vehicle
    • G08G1/096855Systems involving transmission of navigation instructions to the vehicle where the output is provided in a suitable form to the driver

Definitions

  • the present disclosure relates to the field of computer technology, in particular to artificial intelligence, intelligent transportation, voice technology, etc., and in particular to an intelligent interaction method, device, equipment and storage medium.
  • the present disclosure provides an intelligent interaction method, device, equipment and storage medium.
  • a method of intelligent interaction is provided, which may include the following steps:
  • driving parameters are generated, and the driving parameters include parameters used to drive the digital human in the interactive interface to broadcast the interactive content.
  • an intelligent interactive device which may include:
  • the traffic information acquisition module is used to obtain corresponding traffic information based on the detected interaction requirements
  • the interactive content generation module is used to generate interactive content based on interactive needs and screening results of traffic information
  • the digital human driving module is used to generate driving parameters based on interactive content.
  • the driving parameters include parameters used to drive the digital human in the interactive interface to broadcast interactive content.
  • an electronic device including:
  • a memory communicatively connected to the at least one processor; wherein,
  • the memory stores instructions executable by the at least one processor, and the instructions are executed by the at least one processor, so that the at least one processor can perform the method in any embodiment of the present disclosure.
  • a non-transitory computer-readable storage medium storing computer instructions, the computer instructions being used to cause a computer to execute the method in any embodiment of the present disclosure.
  • a computer program product including a computer program/instructions that, when executed by a processor, implements the method in any embodiment of the present disclosure.
  • terminals with intelligent interaction functions such as cars and machines, can proactively identify interaction needs, and then determine different interaction needs.
  • Terminals with intelligent interaction functions can determine traffic information based on interaction needs, thereby providing assistance to drivers from different dimensions.
  • Figure 1 is a flow chart of a method of intelligent interaction according to some embodiments of the present disclosure
  • Figure 2 is a schematic diagram of an interactive interface according to some embodiments of the present disclosure.
  • Figure 3 is a flow chart for obtaining traffic information according to some embodiments of the present disclosure.
  • Figure 4 is a flowchart of generating interactive content according to some embodiments of the present disclosure.
  • Figure 5 is a schematic diagram of an interactive interface according to some embodiments of the present disclosure.
  • Figure 6 is a flowchart of a manner of confirming navigation requirements according to some embodiments of the present disclosure
  • Figure 7 is a flowchart of generating interactive content according to some embodiments of the present disclosure.
  • Figure 8 is a schematic diagram of an interactive interface according to some embodiments of the present disclosure.
  • Figure 9 is a flowchart of generating interactive content according to some embodiments of the present disclosure.
  • Figure 10 is a schematic diagram of an interactive interface according to some embodiments of the present disclosure.
  • Figure 11 is a flowchart of determining interest point recommendation requirements according to some embodiments of the present disclosure.
  • Figure 12 is a flowchart of determining interest point recommendation requirements according to some embodiments of the present disclosure.
  • Figure 13 is a scene diagram of a method that can implement intelligent interaction according to an embodiment of the present disclosure
  • Figure 14 is a schematic diagram of an intelligent interactive device according to some embodiments of the present disclosure.
  • FIG. 15 is a block diagram of an electronic device used to implement an intelligent interaction method according to an embodiment of the present disclosure.
  • the present disclosure relates to a method of intelligent interaction, which may include the following steps:
  • S102 Generate interactive content based on interaction requirements and the screening results of traffic information
  • the driving parameters include parameters used to drive the digital human in the interactive interface to broadcast the interactive content.
  • the execution subject of this disclosure may be a vehicle machine of a vehicle with automatic driving or assisted driving functions, or it may be a cloud server that communicates with the car machine, etc.
  • the interaction needs may be determined by analyzing the data detected by the vehicle's sensors.
  • sensors may include image capture devices, sound capture devices, etc. inside the vehicle.
  • sensors may also include image sensors outside the vehicle, sensors on different components of the vehicle, etc.
  • the vehicle only includes the driver.
  • Interaction requirements can be actively triggered by the driver.
  • the sound collection device collects the driver's trigger command
  • interaction requirements can be triggered by detecting driver behavior. For example, by analyzing the content collected by image collection equipment and sound collection equipment, it can be determined whether the driver is tired, hungry, irritated, etc. Based on this, it can be determined that an interaction requirement is detected.
  • the interaction requirement can be triggered based on the status of the vehicle. For example, when the vehicle's driving speed is lower than the corresponding speed threshold for a long time, it can be determined that a traffic jam has occurred, and thus it can be determined that the interaction requirement has been detected. For another example, when the sensor of a certain component of the vehicle detects that the component is not in normal working condition, it can also be determined that an interaction requirement is detected.
  • the interaction objects may also include other passengers.
  • corresponding traffic information can be obtained.
  • the traffic information obtained can be determined based on the interaction content sent by the driver. For example, if the driver's interaction content is "Chat with me", then the corresponding traffic information can be the traffic condition information of the remaining road sections. Based on this, traffic information can be referenced when generating interactive content. For example, chatting with the driver can only be supported if it is determined that the current vehicle is traveling on a road section with relatively simple road conditions.
  • points of interest can be determined as corresponding traffic information.
  • the identified points of interest can be hotels or service areas.
  • the identified points of interest can be restaurants, etc.
  • the method can be the same as the above-mentioned handling method when the driver's interaction content is "Chat with me.”
  • the corresponding traffic information obtained may be to query other routes, etc.
  • the corresponding traffic information obtained can be nearby parking spots, or nearby maintenance points, etc.
  • Filtering of traffic information can include the following situations.
  • the traffic information is multiple navigation paths including traffic condition information
  • the navigation path with relatively simple traffic conditions can be used as the filtering result.
  • the target interest point can be selected as the filtering result based on the driver's preferences or the evaluation of each interest point.
  • the interactive content may be content formed by nesting the filtered results of traffic information using predetermined utterances that match the interaction requirements. In addition, it can also be summary information generated based on the filtering results, etc.
  • the so-called digital human can be the crystallization of digital character technology and artificial intelligence technology.
  • a digital person can be a virtual character with a digital appearance that relies on a display device to exist.
  • Digital people can have the appearance of humans (including cartoon images) and have specific characteristics such as appearance, gender, and personality.
  • the digital human can output voice and movements, etc.
  • the interactive interface including the digital human can be covered with the original picture of the car screen in the form of a translucent floating layer.
  • the interactive interface may include multiple display areas. For example, as shown in FIG. 2 , the interactive interface may include a digital human area, an interactive content display area, and an additional content display area.
  • the digital human area includes at least one digital human, and the digital human can perform expressions and different mouth shapes through driving parameters to correspondingly broadcast interactive content.
  • the additional content display area can be used to display additional content such as candidate roads and candidate interest point introductions, and the interactive content display area can be used to display the interaction history within a certain period of time in the past.
  • terminals with intelligent interaction functions such as cars and machines, can actively detect interaction needs. This allows terminals with intelligent interaction functions to determine traffic information based on interaction needs and provide assistance to drivers from different dimensions.
  • the acquisition method of interaction requirements involved in step S101 may include the following process:
  • S301 Analyze the obtained reference information to obtain an analysis result;
  • the reference information includes at least one of user information and current vehicle driving information, and the driving information includes vehicle parameters detected by the vehicle sensor;
  • user information includes the user's voice information and At least one type of action information;
  • the reference information may be information collected with the driver's authorization.
  • the reference information may be driver information. Specifically, it may be video information, image information, etc. including the driver collected by an image collection device.
  • the driver's action information can be determined based on the video information and/or image information.
  • it can also be the driver's voice information collected through the radio equipment.
  • the reference information may be the driving information of the current vehicle.
  • the driving information may include vehicle parameters detected by vehicle sensors, such as vehicle speed, remaining fuel level, tire pressure, engine parameters, gearbox parameters, motor parameters, battery parameters, etc.
  • the voice message contains content with clear instructions such as "turn on interactive mode", "chat with me”, etc., indicating that the driver has interaction needs.
  • detecting that the driver makes a specific action through video information and/or image information can also indicate that predetermined conditions are met, that is, the driver has interaction needs. It is easy to understand that the above instructions and specific actions are determined in advance. For this, interaction needs can be determined as companionship needs.
  • the interaction requirement can be determined as a companionship requirement or a point-of-interest recommendation requirement.
  • the corresponding point of interest can be selected according to the driver's specific situation. For example, a movie theater, a restaurant, or going home.
  • the vehicle speed when the vehicle speed is determined to be 0 through vehicle parameters, or the vehicle speed is lower than the corresponding threshold for a long period of time, it can indicate the existence of traffic congestion. From this, it can be determined that the predetermined condition is satisfied. For this, the interaction requirement can be determined as a navigation requirement, that is, navigating other paths to avoid congestion.
  • the interaction requirement can be determined as the point of interest recommendation requirement.
  • Points of interest can be automobile sales service 4S (Automobile Sales Servicshop 4S) stores, service areas or vehicle repair points, etc.
  • the interactive demand for drinks can be determined based on driver information, vehicle information, etc. This can provide drivers with more diversified services.
  • step S102 may include the following process:
  • S401 Based on the navigation requirements, select the target navigation path that meets the specified conditions from the traffic information.
  • the specified conditions are determined based on at least one of the driving mileage of the navigation path and the driving time of the navigation path;
  • S402 Generate interactive content according to the target navigation path.
  • the traffic information may be based on multiple navigation paths to the destination.
  • the result of filtering traffic information may be to select a target navigation path from multiple navigation paths.
  • the filtering method may be based on at least one of the driving mileage of the navigation path, the driving time of the navigation path, and the travel cost of the navigation path.
  • Travel costs can include fuel, electricity, tolls, etc.
  • the weight of driving time is greater than the weight of driving mileage
  • the weight of driving mileage is greater than the weight of travel cost.
  • the determined target route only the travel time and mileage of the target route can be retained.
  • the corresponding interactive content can be "According to the current traffic status, it is expected that more and more vehicles will converge on the congested road section ahead. I helped you find a new road.
  • Route XXX the time can be 2 minutes faster and the distance is about the same. Do you want to change it?"
  • the traditional navigation interface recommends routes to users, it usually also displays navigation guidance information, toll station information, remaining mileage, remaining time, and recommended congestion avoidance routes. As well as route time, tolls, traffic lights and mileage comparison, etc. For drivers who want to know if congestion ahead is increasing and they need to change routes, such information is too scattered to assist in quick decision-making.
  • the confirmation method of navigation requirements may include the following process:
  • S601 Determine the speed of the current vehicle within a predetermined time period based on vehicle parameters
  • S602 Determine the congestion level based on vehicle speed
  • the confirmation method of navigation requirements can be determined based on the vehicle speed in the vehicle parameters. For example, during driving, if the vehicle speed is 0 for more than a predetermined time (for example, 10 seconds) due to non-traffic lights, or if the vehicle speed is lower than the corresponding first speed threshold (for example, 5km/h) for more than a predetermined time, congestion can be determined.
  • the level is "critical”. For another example, if the vehicle speed is between the first vehicle speed threshold and the second vehicle speed threshold (for example, 15 km/h) for more than a predetermined time, the congestion level may be determined to be "normal".
  • navigation needs can be determined. For example, when the congestion level is higher than "average", the interaction requirement can be determined to be a navigation requirement.
  • the navigation needs can be determined based on the vehicle speed.
  • step S102 may include the following process:
  • S701 Determine a safe driving section based on the target navigation path in the traffic information and the corresponding road conditions.
  • the safe driving section is a road section where the complexity of the road conditions is lower than the corresponding complexity threshold;
  • S702 Determine the driving time of the current vehicle on the safe driving section
  • S703 Use artificial intelligence-based content production technology AIGC to generate interactive content that matches the driving time.
  • the target navigation path and the traffic conditions corresponding to the target navigation path can be obtained from the traffic information.
  • the exit can be determined in the remaining road segments.
  • the distance to the exit exceeds the corresponding distance threshold, it can mean that the driver does not need to perform complex operations, that is, it can correspond to a safe driving section where the road condition complexity is lower than the corresponding complexity threshold. That is, when the distance to the exit exceeds the corresponding distance threshold, it can mean that the driver only needs to drive on the main road (highway or expressway), so it can be considered that the road condition is less complex.
  • the driving time for safe driving sections can be estimated based on distance and road conditions.
  • the remaining duration of the red light when waiting for a red light, can be obtained from the traffic conditions. If the remaining duration of the red light is greater than the corresponding duration threshold, it can be corresponding to driving in a road where the complexity of the traffic condition is lower than the corresponding complexity. Threshold safe driving section. That is, when the red light time is long, the driver is not required to perform driving operations, so it can be considered that the road condition is less complex. Thus, the road section corresponding to the current stopping position can be determined as a safe driving section. Correspondingly, the end time of the red light can be corresponding to the driving time of the safe driving section.
  • the traffic jam section can also be determined as a safe driving section where the road condition complexity is lower than the corresponding complexity threshold.
  • the time from driving to the end point of the traffic jam can be determined as the driving time of the safe driving section.
  • AIGC artificial intelligence content production technology
  • the topic of the topic can be a current hot search topic, a topic determined based on the driver's user portrait, etc.
  • the corresponding topic can also be determined based on the questions raised by the driver. For example, ask whether sports events were won or lost, ask about celebrity anecdotes, etc.
  • the so-called AIGC means that artificial intelligence technology can be used to automatically generate multimedia content. For example, you can write poetry, compose music, generate news, etc.
  • AIGC can generate multimedia content based on hot events on the Internet, and can also generate multimedia content based on received user inquiries.
  • interaction strategies can also be determined based on user portraits.
  • user portraits can include the user's long-term preferences, medium-term preferences, and short-term preferences.
  • long-term preferences may include the user's interests and occupation, etc.
  • the mid-term preference can be the current purpose of travel, such as traveling, attending a conference, etc.
  • Short-term preferences can be the user's current emotional needs, such as current mood and tolerance to interruptions.
  • priorities can be determined in order of short-term, medium-term, and long-term, so as to better improve the user experience.
  • step S102 may include the following process:
  • S901 Determine the POI type corresponding to the POI recommendation requirement
  • S902 According to the type of interest point, select the target interest point from the candidate interest points included in the traffic information;
  • S903 Generate interactive content based on target points of interest.
  • Points of interest recommendation requirements may include multiple categories. For example, if a driver is detected to yawn or blink frequently, it may indicate that the driver is tired. Based on this, it can be determined that the point of interest recommendation requirement is the need for rest.
  • the point of interest type may be a leisure type, such as a service area, etc.
  • the store information contained in the service area obtained from the network can also be displayed based on the point of interest retrieval function.
  • service area information can be introduced to the user based on the brands of gas stations that the user often goes to.
  • the user can also be introduced based on the user's preferences, such as the type of food the user likes, preferred restaurants and shops, etc.
  • the type of interest point recommendation requirement can be determined based on the abnormal situation. For example, when a fuel vehicle is low on fuel, a gas station can be used as a point of interest type. When the power of new energy vehicles is insufficient, charging piles (stations) can be used as point-of-interest types. In the event that some parts of the vehicle fail, a 4S shop or repair point can be used as a point of interest type.
  • the corresponding target interest points are screened out from the candidate interest points included in the traffic information.
  • the target gas station can be selected as the target point of interest based on the gas station brand preferred by the driver.
  • the target hotel can be selected as the target point of interest based on the hotel's reputation, reviews, the driver's preferred hotel brand, etc. The selection methods for other target points of interest are similar and will not be described again.
  • the most suitable points of interest can be automatically selected as interaction results based on the detected interaction requirements.
  • the method for determining the recommendation requirements for points of interest may include the following process:
  • S1101 Determine the type of the user's physiological state based on the user information
  • S1102 When the type of physiological state is a predetermined type, determine the interaction requirement as a point of interest recommendation requirement, and the predetermined type includes at least one of fatigue, hunger, internal urgency, and illness.
  • the determination of the interest point recommendation requirements may be based on the driver's information. For example, different physiological states of the driver can be determined based on the detection results of images, videos, voices, etc. of the driver (corresponding to the user).
  • the type of the driver's physiological state may be determined to be fatigue.
  • the interaction requirements include the recommendation requirements for points of interest, and hotels, service areas, etc. are used as points of interest to be recommended.
  • the interaction requirements include the recommendation requirements for points of interest, and restaurants, etc. are used as points of interest to be recommended.
  • the interaction requirements include the recommendation requirements for points of interest, and service areas, shopping malls, etc. are used as points of interest to be recommended.
  • the interaction requirements include the recommendation requirements for points of interest, and service areas, hospitals, etc. are used as points of interest to be recommended.
  • the driver's status can be determined based on the detection of the driver. Then the requirements for recommendation of points of interest are confirmed.
  • the method for determining the recommendation requirements for points of interest may include the following process:
  • S1201 Determine the current vehicle condition based on vehicle parameters
  • the interaction requirement can be determined as the POI recommendation requirement.
  • the present disclosure relates to a method of intelligent interaction, which is executed by an input layer, a calculation layer, and an output layer.
  • the main function of the input layer is to receive user (driver) input triggers or actively trigger based on detected information.
  • the input layer can include a speech recognition module, an audio and video fusion emotion recognition module, a scene discrimination module, etc.
  • the speech recognition module is based on automatic speech recognition technology (Automatic Speech Recognition, ASR). It recognizes the user's voice in the scenario where the user actively wakes up and converts the voice into text.
  • ASR Automatic Speech Recognition
  • the speech recognition module has full-duplex multi-round communication capabilities.
  • the audio and video fusion emotion recognition module is used to understand the user's current status through voice and visual emotion recognition with the user's authorization, such as fatigue recognition, emotion classification (anger, relaxation, joy, boredom, etc.).
  • the scene discrimination module makes active trigger judgments based on preset rules, such as whether the user is in congestion, whether the congestion is serious, whether the user passes through scenic spots or rest areas, and whether the user needs to rest after driving for a long time, etc.
  • demand analysis processing requires the construction of user portrait capabilities, and understanding the user's long-term (interests, occupations, etc.), mid-term (travel purpose, etc.) and short-term (current mood and tolerance to interruption) needs and preferences. Complete the understanding of requirements based on user input or triggering of preset rules.
  • the main function of the computing layer is to serve the needs of users and complete background calculations.
  • the computing layer can include a traffic brain module, a knowledge graph module, a map point of interest module, a retrieval module, etc.
  • the traffic brain module can be used for traffic prediction, dynamic event perception, etc., providing users with rich route traffic information.
  • Dynamic event awareness can include road construction, road control, road accidents, etc.
  • the knowledge graph module is used to provide chat information. For example, when users want to know about certain news, gossip, entertainment, and sports information, the knowledge graph module will integrate, calculate, and output the content based on this information.
  • the map point of interest module is used to provide information related to points of interest.
  • the map point of interest module will provide the user with relevant content, such as business hours, consumption status, comments, etc.
  • the retrieval module can be used to search for points of interest. When the user's needs are related to searching for a geographical location, such as if the user wants to refuel nearby, rest nearby, etc., the retrieval module will provide services for the user.
  • the main function of the output layer is to provide users with a content presentation method based on the results of analytical processing and the results output by the calculation layer. Including navigation product service module, automatic content generation module, voice broadcast module and 3D virtual human synthesis module, etc.
  • the navigation product service module is used to directly adjust and change product services for users when the user's needs are related to the operation of the navigation product page, such as enlarging the base map, adjusting to the front mode, etc.
  • the automatic content generation module combines the user's preference settings to automatically produce content when it is necessary to provide users with oral content.
  • the broadcast module is used to convert the text generated by the automatic content generation module into speech.
  • the broadcasting module can use real-life vocal samples for model training, thereby converting the text generated by the automatic content generation module into speech.
  • the 3D virtual human synthesis module can drive the 3D virtual human according to the content to be played, so that the 3D virtual human can broadcast the content.
  • an intelligent interactive device which may include:
  • the traffic information acquisition module 1401 is used to obtain corresponding traffic information according to the detected interaction requirements
  • the interactive content generation module 1402 is used to generate interactive content based on the interaction requirements and the screening results of traffic information;
  • the digital human driving module 1403 is used to generate driving parameters based on interactive content.
  • the driving parameters include parameters used to drive the digital human in the interactive interface to broadcast interactive content.
  • the traffic information acquisition module 1401 may include:
  • the parsing sub-module is used to parse the obtained reference information to obtain parsing results;
  • the reference information includes at least one of user information and current vehicle driving information, and the driving information includes vehicle parameters detected by vehicle sensors;
  • user information includes user information At least one of voice information and action information;
  • the interaction requirement determination execution sub-module is used to determine that an interaction requirement is detected when the parsing result satisfies a predetermined condition, and the interaction requirement includes at least one of a navigation requirement, a companionship requirement, or a point of interest recommendation requirement.
  • the interactive content generation module 1402 may include:
  • the target navigation path determination submodule is used to filter out the target navigation path that meets the specified conditions from the traffic information according to the navigation requirements.
  • the specified conditions are determined based on at least one of the driving mileage of the navigation path and the driving time of the navigation path;
  • the interactive content generation execution sub-module is used to generate the interactive content according to the target navigation path.
  • the interaction requirement determination execution submodule may include:
  • a vehicle speed condition determination unit is used to determine the vehicle speed condition of the current vehicle within a predetermined time period based on vehicle parameters
  • the congestion level determination unit determines the congestion level based on the vehicle speed
  • the interaction demand determination unit is used to determine the interaction demand as a navigation demand when the congestion level meets the preset congestion standard.
  • the interactive content generation module 1402 may include:
  • the safe driving section determination submodule is used to determine a safe driving section based on the target navigation path and corresponding road conditions in the traffic information.
  • a safe driving section is a road section where the complexity of the road conditions is lower than the corresponding complexity threshold;
  • the driving time determination submodule is used to determine the driving time of the current vehicle on the safe driving section
  • the interactive content generation execution sub-module is used to generate interactive content that matches the driving time using AIGC, a content production technology based on artificial intelligence.
  • the interactive content generation module 1402 may include:
  • the interest point type determination sub-module is used to determine the interest point type corresponding to the interest point recommendation requirements
  • the target point of interest screening submodule is used to filter out target points of interest from the candidate interest points included in the traffic information according to the type of interest point;
  • the interactive content generation execution submodule is used to generate interactive content based on target points of interest.
  • the interaction requirement determination execution submodule may include:
  • a type determination unit used to determine the type of the user's physiological state based on the user information
  • the interaction demand determination unit is configured to determine the interaction demand as the interest point recommendation demand when the type of physiological state is a predetermined type, and the predetermined type includes at least one of fatigue, hunger, internal urgency, and illness.
  • the interaction requirement determination execution submodule may include:
  • the vehicle condition determination unit is used to determine the current vehicle condition based on vehicle parameters
  • the interaction demand determination unit is used to determine the interaction demand as the recommendation demand for points of interest when the vehicle condition reaches the fault standard.
  • the present disclosure also provides an electronic device, a readable storage medium, and a computer program product.
  • Electronic devices are intended to refer to various forms of digital computers, such as laptop computers, desktop computers, workstations, personal digital assistants, servers, blade servers, mainframe computers, and other suitable computers. Electronic devices may also represent various forms of mobile devices, such as personal digital assistants, cellular phones, smart phones, wearable devices, and other similar computing devices.
  • the components shown herein, their connections and relationships, and their functions are examples only and are not intended to limit implementations of the disclosure described and/or claimed herein.
  • the device 1500 includes a computing unit 1510 that can execute according to a computer program stored in a read-only memory (ROM) 1520 or loaded from a storage unit 1580 into a random access memory (RAM) 1530 Various appropriate actions and treatments.
  • ROM read-only memory
  • RAM random access memory
  • various programs and data required for the operation of the device 1500 can also be stored.
  • Computing unit 1510, ROM 1520 and RAM 1530 are connected to each other via bus 1540.
  • An input/output (I/O) interface 1550 is also connected to bus 1540.
  • I/O interface 1550 Multiple components in device 1500 are connected to I/O interface 1550, including: input unit 1560, such as keyboard, mouse, etc.; output unit 1570, such as various types of displays, speakers, etc.; storage unit 1580, such as magnetic disk, optical disk, etc. ; and communication unit 1590, such as a network card, modem, wireless communication transceiver, etc.
  • the communication unit 1590 allows the device 1500 to exchange information/data with other devices through computer networks such as the Internet and/or various telecommunications networks.
  • Computing unit 1510 may be a variety of general and/or special purpose processing components having processing and computing capabilities. Some examples of computing units 1510 include, but are not limited to, central processing units (CPUs), graphics processing units (GPUs), various dedicated artificial intelligence (AI) computing chips, various computing units that run machine learning model algorithms, digital signal processing processor (DSP), and any appropriate processor, controller, microcontroller, etc.
  • the computing unit 1510 performs various methods and processes described above, such as intelligent interaction methods.
  • the method of intelligent interaction may be implemented as a computer software program, which is tangibly included in a machine-readable medium, such as the storage unit 1580.
  • part or all of the computer program may be loaded and/or installed onto device 1500 via ROM 1520 and/or communication unit 1590.
  • the computer program When the computer program is loaded into the RAM 1530 and executed by the computing unit 1510, one or more steps of the method of intelligent interaction described above may be performed.
  • the computing unit 1510 may be configured to perform the method of intelligent interaction in any other suitable manner (eg, by means of firmware).
  • Various implementations of the systems and techniques described above may be implemented in digital electronic circuit systems, integrated circuit systems, field programmable gate arrays (FPGAs), application specific integrated circuits (ASICs), application specific standard products (ASSPs), systems on a chip implemented in a system (SOC), load programmable logic device (CPLD), computer hardware, firmware, software, and/or a combination thereof.
  • FPGAs field programmable gate arrays
  • ASICs application specific integrated circuits
  • ASSPs application specific standard products
  • SOC system
  • CPLD load programmable logic device
  • computer hardware firmware, software, and/or a combination thereof.
  • These various embodiments may include implementation in one or more computer programs executable and/or interpreted on a programmable system including at least one programmable processor, the programmable processor
  • the processor which may be a special purpose or general purpose programmable processor, may receive data and instructions from a storage system, at least one input device, and at least one output device, and transmit data and instructions to the storage system, the at least one input device, and the at least one output device.
  • An output device may be a special purpose or general purpose programmable processor, may receive data and instructions from a storage system, at least one input device, and at least one output device, and transmit data and instructions to the storage system, the at least one input device, and the at least one output device.
  • An output device may be a special purpose or general purpose programmable processor, may receive data and instructions from a storage system, at least one input device, and at least one output device, and transmit data and instructions to the storage system, the at least one input device, and the at least one output device.
  • Program code for implementing the methods of the present disclosure may be written in any combination of one or more programming languages. These program codes may be provided to a processor or controller of a general-purpose computer, special-purpose computer, or other programmable data processing device, such that the program codes, when executed by the processor or controller, cause the functions specified in the flowcharts and/or block diagrams/ The operation is implemented.
  • the program code may execute entirely on the machine, partly on the machine, as a stand-alone software package, partly on the machine and partly on a remote machine or entirely on the remote machine or server.
  • a machine-readable medium may be a tangible medium that may contain or store a program for use by or in connection with an instruction execution system, apparatus, or device.
  • the machine-readable medium may be a machine-readable signal medium or a machine-readable storage medium.
  • Machine-readable media may include, but are not limited to, electronic, magnetic, optical, electromagnetic, infrared, or semiconductor systems, devices or devices, or any suitable combination of the foregoing.
  • machine-readable storage media would include one or more wire-based electrical connections, laptop disks, hard drives, random access memory (RAM), read only memory (ROM), erasable programmable read only memory (EPROM or flash memory), optical fiber, portable compact disk read-only memory (CD-ROM), optical storage device, magnetic storage device, or any suitable combination of the above.
  • RAM random access memory
  • ROM read only memory
  • EPROM or flash memory erasable programmable read only memory
  • CD-ROM portable compact disk read-only memory
  • magnetic storage device or any suitable combination of the above.
  • the systems and techniques described herein may be implemented on a computer having a display device (eg, a CRT (cathode ray tube) or LCD (liquid crystal display) monitor) for displaying information to the user ); and a keyboard and pointing device (eg, a mouse or a trackball) through which a user can provide input to the computer.
  • a display device eg, a CRT (cathode ray tube) or LCD (liquid crystal display) monitor
  • a keyboard and pointing device eg, a mouse or a trackball
  • Other kinds of devices may also be used to provide interaction with the user; for example, the feedback provided to the user may be any form of sensory feedback (e.g., visual feedback, auditory feedback, or tactile feedback); and may be provided in any form, including Acoustic input, voice input or tactile input) to receive input from the user.
  • the systems and techniques described herein may be implemented in a computing system that includes back-end components (e.g., as a data server), or a computing system that includes middleware components (e.g., an application server), or a computing system that includes front-end components (e.g., A user's computer having a graphical user interface or web browser through which the user can interact with implementations of the systems and technologies described herein), or including such backend components, middleware components, or any combination of front-end components in a computing system.
  • the components of the system may be interconnected by any form or medium of digital data communication (eg, a communications network). Examples of communication networks include: local area network (LAN), wide area network (WAN), and the Internet.
  • Computer systems may include clients and servers.
  • Clients and servers are generally remote from each other and typically interact over a communications network.
  • the relationship of client and server is created by computer programs running on corresponding computers and having a client-server relationship with each other.
  • the server can be a cloud server, a distributed system server, or a server combined with a blockchain.

Landscapes

  • Engineering & Computer Science (AREA)
  • Radar, Positioning & Navigation (AREA)
  • Remote Sensing (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Automation & Control Theory (AREA)
  • Navigation (AREA)

Abstract

An intelligent interaction method and apparatus, a device, and a storage medium. The specific implementation solution comprises: S101: obtaining corresponding traffic information according to a detected interaction requirement; S102: generating interactive content according to the interaction requirement and a screening result of the traffic information; and S103: generating a driving parameter on the basis of the interactive content, the driving parameter comprising a parameter used for driving a digital human in an interactive interface to broadcast the interactive content.

Description

智能交互的方法、装置、设备以及存储介质Intelligent interaction methods, devices, equipment and storage media
相关申请的交叉引用Cross-references to related applications
本申请基于申请号为202210273184.3、申请日为2022年03月18日的中国专利申请提出,并要求该中国专利申请的优先权,该中国专利申请的全部内容在此引入本申请作为参考。This application is filed based on a Chinese patent application with application number 202210273184.3 and a filing date of March 18, 2022, and claims the priority of the Chinese patent application. The entire content of the Chinese patent application is hereby incorporated into this application as a reference.
技术领域Technical field
本公开涉及计算机技术领域,尤其涉及人工智能、智能交通、语音技术等,特别涉及一种智能交互的方法、装置、设备以及存储介质。The present disclosure relates to the field of computer technology, in particular to artificial intelligence, intelligent transportation, voice technology, etc., and in particular to an intelligent interaction method, device, equipment and storage medium.
背景技术Background technique
随着地面交通的不断发展,驾车已经是很多人常见的出行方式。对于大多数驾驶者,在驾驶过程中需要依赖于导航。传统导航提供的内容相对单一,无法满足用户的需求。With the continuous development of ground transportation, driving has become a common way of travel for many people. For most drivers, they need to rely on navigation while driving. The content provided by traditional navigation is relatively simple and cannot meet the needs of users.
发明内容Contents of the invention
本公开提供了一种智能交互的方法、装置、设备以及存储介质。The present disclosure provides an intelligent interaction method, device, equipment and storage medium.
根据本公开的一方面,提供了一种智能交互的方法,该方法可以包括以下步骤:According to one aspect of the present disclosure, a method of intelligent interaction is provided, which may include the following steps:
根据检测到的交互需求,获取对应的交通信息;Obtain corresponding traffic information based on the detected interaction requirements;
根据交互需求和对交通信息的筛选结果,生成交互内容;Generate interactive content based on interaction needs and screening results of traffic information;
基于交互内容,生成驱动参数,驱动参数包括用于驱动交互界面中的数字人进行交互内容播报的参数。Based on the interactive content, driving parameters are generated, and the driving parameters include parameters used to drive the digital human in the interactive interface to broadcast the interactive content.
根据本公开的另一方面,提供了一种智能交互的装置,该装置可以包括:According to another aspect of the present disclosure, an intelligent interactive device is provided, which may include:
交通信息获取模块,用于根据检测到的交互需求,获取对应的交通信息;The traffic information acquisition module is used to obtain corresponding traffic information based on the detected interaction requirements;
交互内容生成模块,用于根据交互需求和对交通信息的筛选结果,生成交互内容;The interactive content generation module is used to generate interactive content based on interactive needs and screening results of traffic information;
数字人驱动模块,用于基于交互内容,生成驱动参数,驱动参数包括用于驱动交互界面中的数字人进行交互内容播报的参数。The digital human driving module is used to generate driving parameters based on interactive content. The driving parameters include parameters used to drive the digital human in the interactive interface to broadcast interactive content.
根据本公开的另一方面,提供了一种电子设备,包括:According to another aspect of the present disclosure, an electronic device is provided, including:
至少一个处理器;以及at least one processor; and
与该至少一个处理器通信连接的存储器;其中,A memory communicatively connected to the at least one processor; wherein,
该存储器存储有可被该至少一个处理器执行的指令,该指令被该至少一个处理器执行,以使该至少一个处理器能够执行本公开任一实施例中的方法。The memory stores instructions executable by the at least one processor, and the instructions are executed by the at least one processor, so that the at least one processor can perform the method in any embodiment of the present disclosure.
根据本公开的另一方面,提供了一种存储有计算机指令的非瞬时计算机可读存储介质,该计算机指令用于使计算机执行本公开任一实施例中的方法。According to another aspect of the present disclosure, there is provided a non-transitory computer-readable storage medium storing computer instructions, the computer instructions being used to cause a computer to execute the method in any embodiment of the present disclosure.
根据本公开的另一方面,提供了一种计算机程序产品,包括计算机程序/指令,该计算机程序/指令被处理器执行时实现本公开任一实施例中的方法。According to another aspect of the present disclosure, a computer program product is provided, including a computer program/instructions that, when executed by a processor, implements the method in any embodiment of the present disclosure.
根据本公开的技术可以使如车机等具有智能交互功能的终端实现主动对交互需求进行 识别,进而判断出不同的交互需求。具有智能交互功能的终端可以针对交互需求确定交通信息,从而可以从不同维度为驾驶员提供帮助。According to the technology of the present disclosure, terminals with intelligent interaction functions, such as cars and machines, can proactively identify interaction needs, and then determine different interaction needs. Terminals with intelligent interaction functions can determine traffic information based on interaction needs, thereby providing assistance to drivers from different dimensions.
应当理解,本部分所描述的内容并非旨在标识本公开的实施例的关键或重要特征,也不用于限制本公开的范围。本公开的其它特征将通过以下的说明书而变得容易理解。It should be understood that what is described in this section is not intended to identify key or important features of the embodiments of the disclosure, nor is it intended to limit the scope of the disclosure. Other features of the present disclosure will become readily understood from the following description.
附图说明Description of the drawings
附图用于更好地理解本方案,不构成对本公开的限定。其中:The accompanying drawings are used to better understand the present solution and do not constitute a limitation of the present disclosure. in:
图1是根据本公开一些实施例的智能交互的方法的流程图;Figure 1 is a flow chart of a method of intelligent interaction according to some embodiments of the present disclosure;
图2是根据本公开一些实施例的交互界面的示意图;Figure 2 is a schematic diagram of an interactive interface according to some embodiments of the present disclosure;
图3是根据本公开一些实施例的获取交通信息的流程图;Figure 3 is a flow chart for obtaining traffic information according to some embodiments of the present disclosure;
图4是根据本公开一些实施例的生成交互内容的流程图;Figure 4 is a flowchart of generating interactive content according to some embodiments of the present disclosure;
图5是根据本公开一些实施例的交互界面的示意图;Figure 5 is a schematic diagram of an interactive interface according to some embodiments of the present disclosure;
图6是根据本公开一些实施例的确认导航需求的方式的流程图;Figure 6 is a flowchart of a manner of confirming navigation requirements according to some embodiments of the present disclosure;
图7是根据本公开一些实施例的生成交互内容的流程图;Figure 7 is a flowchart of generating interactive content according to some embodiments of the present disclosure;
图8是根据本公开一些实施例的交互界面的示意图;Figure 8 is a schematic diagram of an interactive interface according to some embodiments of the present disclosure;
图9是根据本公开一些实施例的生成交互内容的流程图;Figure 9 is a flowchart of generating interactive content according to some embodiments of the present disclosure;
图10是根据本公开一些实施例的交互界面的示意图;Figure 10 is a schematic diagram of an interactive interface according to some embodiments of the present disclosure;
图11是根据本公开一些实施例的确定兴趣点推荐需求的流程图;Figure 11 is a flowchart of determining interest point recommendation requirements according to some embodiments of the present disclosure;
图12是根据本公开一些实施例的确定兴趣点推荐需求的流程图;Figure 12 is a flowchart of determining interest point recommendation requirements according to some embodiments of the present disclosure;
图13是可以实现本公开实施例的智能交互的方法的场景图;Figure 13 is a scene diagram of a method that can implement intelligent interaction according to an embodiment of the present disclosure;
图14是根据本公开一些实施例的智能交互的装置的示意图;Figure 14 is a schematic diagram of an intelligent interactive device according to some embodiments of the present disclosure;
图15是用来实现本公开实施例的智能交互的方法的电子设备的框图。FIG. 15 is a block diagram of an electronic device used to implement an intelligent interaction method according to an embodiment of the present disclosure.
具体实施方式Detailed ways
以下结合附图对本公开的示范性实施例做出说明,其中包括本公开实施例的各种细节以助于理解,应当将它们认为仅仅是示范性的。因此,本领域普通技术人员应当认识到,可以对这里描述的实施例做出各种改变和修改,而不会背离本公开的范围和精神。同样,为了清楚和简明,以下的描述中省略了对公知功能和结构的描述。Exemplary embodiments of the present disclosure are described below with reference to the accompanying drawings, in which various details of the embodiments of the present disclosure are included to facilitate understanding and should be considered to be exemplary only. Accordingly, those of ordinary skill in the art will recognize that various changes and modifications can be made to the embodiments described herein without departing from the scope and spirit of the disclosure. Also, descriptions of well-known functions and constructions are omitted from the following description for clarity and conciseness.
如图1所示,本公开涉及一种智能交互的方法,该方法可以包括以下步骤:As shown in Figure 1, the present disclosure relates to a method of intelligent interaction, which may include the following steps:
S101:根据检测到的交互需求,获取对应的交通信息;S101: Obtain corresponding traffic information based on the detected interaction requirements;
S102:根据交互需求和对交通信息的筛选结果,生成交互内容;S102: Generate interactive content based on interaction requirements and the screening results of traffic information;
S103:基于交互内容,生成驱动参数,驱动参数包括用于驱动交互界面中的数字人进行交互内容播报的参数。S103: Based on the interactive content, generate driving parameters. The driving parameters include parameters used to drive the digital human in the interactive interface to broadcast the interactive content.
本公开的执行主体可以是具有自动驾驶或辅助驾驶功能车辆的车机,也可以是与车机通信的云端服务器等。交互需要可以是利用车辆的传感器检测到的数据进行分析确定出来的。例如,传感器可以包括车辆内部的图像采集设备、声音采集设备等。另外,传感器还可以包 括车辆外部的图像传感器、以及车辆不同部件的传感器等。The execution subject of this disclosure may be a vehicle machine of a vehicle with automatic driving or assisted driving functions, or it may be a cloud server that communicates with the car machine, etc. The interaction needs may be determined by analyzing the data detected by the vehicle's sensors. For example, sensors may include image capture devices, sound capture devices, etc. inside the vehicle. In addition, sensors may also include image sensors outside the vehicle, sensors on different components of the vehicle, etc.
示例性地,以车辆上仅包括驾驶员为例。交互需求可以是驾驶员主动触发的。例如,在声音采集设备采集到驾驶员的触发指令的情况下,即可确定检测到交互需求。或者,交互需求可以是通过检测驾驶员的行为触发的。例如,通过对图像采集设备、声音采集设备采集的内容进行解析,可以确定驾驶员存在疲劳、饥饿、不难烦等情况。基于此,可以确定检测到交互需求。再或者,交互需求可以是根据车辆的状态触发的。例如,当车辆长时间行驶速度低于对应的速度阈值,可以确定出现堵车情况,由此可以确定检测到交互需求。又例如,当车辆某个部件的传感器检测到该部件不在正常的工作状态,也可以确定检测到交互需求。另外,交互对象除上述示例中的驾驶员外,还可以包括其他乘客等。By way of example, the vehicle only includes the driver. Interaction requirements can be actively triggered by the driver. For example, when the sound collection device collects the driver's trigger command, it can be determined that the interaction requirement is detected. Alternatively, interaction requirements can be triggered by detecting driver behavior. For example, by analyzing the content collected by image collection equipment and sound collection equipment, it can be determined whether the driver is tired, hungry, irritated, etc. Based on this, it can be determined that an interaction requirement is detected. Or, the interaction requirement can be triggered based on the status of the vehicle. For example, when the vehicle's driving speed is lower than the corresponding speed threshold for a long time, it can be determined that a traffic jam has occurred, and thus it can be determined that the interaction requirement has been detected. For another example, when the sensor of a certain component of the vehicle detects that the component is not in normal working condition, it can also be determined that an interaction requirement is detected. In addition, in addition to the driver in the above example, the interaction objects may also include other passengers.
针对不同的交互需求,可以获取对应的交通信息。针对驾驶员主动触发的交互需求,获取到的交通信息可以根据驾驶员发出的交互内容确定的。例如,在驾驶员的交互内容是“跟我聊聊天”的情况下,那么对应的交通信息可以是剩余路段的路况信息。基于此,在生成交互内容时,可以参考交通信息。例如,只有在确定出当前车辆行驶在路况相对简单的路段的情况下,才可以支持与驾驶员的聊天。For different interaction needs, corresponding traffic information can be obtained. In response to the interaction requirements triggered by the driver, the traffic information obtained can be determined based on the interaction content sent by the driver. For example, if the driver's interaction content is "Chat with me", then the corresponding traffic information can be the traffic condition information of the remaining road sections. Based on this, traffic information can be referenced when generating interactive content. For example, chatting with the driver can only be supported if it is determined that the current vehicle is traveling on a road section with relatively simple road conditions.
又例如,针对驾驶员存在疲劳、饥饿、不难烦等不同情况,可以确定出兴趣点,作为对应的交通信息。例如,针对疲劳的情况,确定出的兴趣点可以是酒店或者服务区等。针对饥饿的情况,确定出的兴趣点可以是餐厅等。针对不耐烦的情况,可以是与前述接收到驾驶员的交互内容是“跟我聊聊天”的处理方式相同。For another example, according to different situations such as driver fatigue, hunger, boredom, etc., points of interest can be determined as corresponding traffic information. For example, in the case of fatigue, the identified points of interest can be hotels or service areas. For hunger situations, the identified points of interest can be restaurants, etc. For impatient situations, the method can be the same as the above-mentioned handling method when the driver's interaction content is "Chat with me."
再例如,针对检测到的车速行驶速度低于对应阈值的交互需求,获取到的对应的交通信息可以是查询其他路径等。针对车辆部件异常所对应的交互需求,获取到的对应的交通信息可以是附近的停车点,或者是附近的维修点等。For another example, for the interaction requirement where the detected vehicle speed is lower than the corresponding threshold, the corresponding traffic information obtained may be to query other routes, etc. In response to the interaction requirements corresponding to vehicle component abnormalities, the corresponding traffic information obtained can be nearby parking spots, or nearby maintenance points, etc.
对于交通信息的筛选可以包括以下几种情况。例如,在交通信息是包含路况信息的多条导航路径的情况下,可以将路况相对简单的导航路径作为筛选结果。在交通信息包括多个候选兴趣点的情况下,可以根据驾驶员的喜好或者各兴趣点的评价,选择出目标兴趣点作为筛选结果。交互内容可以是利用与交互需求匹配的预定话术,嵌套交通信息的筛选结果以形成的内容。另外,还可以是根据筛选结果生成的摘要信息等。Filtering of traffic information can include the following situations. For example, in the case where the traffic information is multiple navigation paths including traffic condition information, the navigation path with relatively simple traffic conditions can be used as the filtering result. When the traffic information includes multiple candidate interest points, the target interest point can be selected as the filtering result based on the driver's preferences or the evaluation of each interest point. The interactive content may be content formed by nesting the filtered results of traffic information using predetermined utterances that match the interaction requirements. In addition, it can also be summary information generated based on the filtering results, etc.
根据交互内容,可以生成驱动参数。从而驱动交互界面中的数字人进行交互内容的播报。所谓数字人可以是数字角色技术与人工智能技术的结晶。数字人可以是具有数字化外形的虚拟人物,依赖显示设备存在。数字人可以是拥有人类(包括卡通形象)的外观,具有特定的相貌、性别和性格等人物特征。根据控制指令,数字人可以输出语音和动作等。其中,包含数字人的交互界面可以以半透明悬浮层的形式覆盖于车机屏幕的原有画面。交互界面可以包括多个显示区域,示例性地,如图2所示,交互界面可以包括数字人区域、交互内容显示区域和附加内容展示区域。其中,数字人区域中至少包括一个数字人,该数字人可以通过驱动参数进行表情以及不同口型,以对应播报交互内容。附加内容展示区域可以用于显示候选道路,候选兴趣点介绍等附加内容,交互内容显示区域可以用于展示过往一定时间内的交互历史。Based on the interaction content, driving parameters can be generated. This drives the digital humans in the interactive interface to broadcast interactive content. The so-called digital human can be the crystallization of digital character technology and artificial intelligence technology. A digital person can be a virtual character with a digital appearance that relies on a display device to exist. Digital people can have the appearance of humans (including cartoon images) and have specific characteristics such as appearance, gender, and personality. According to the control instructions, the digital human can output voice and movements, etc. Among them, the interactive interface including the digital human can be covered with the original picture of the car screen in the form of a translucent floating layer. The interactive interface may include multiple display areas. For example, as shown in FIG. 2 , the interactive interface may include a digital human area, an interactive content display area, and an additional content display area. Among them, the digital human area includes at least one digital human, and the digital human can perform expressions and different mouth shapes through driving parameters to correspondingly broadcast interactive content. The additional content display area can be used to display additional content such as candidate roads and candidate interest point introductions, and the interactive content display area can be used to display the interaction history within a certain period of time in the past.
通过上述过程,可以使如车机等具有智能交互功能的终端实现对于交互需求的主动检测。进而使得具有智能交互功能的终端可以针对交互需求确定交通信息,可以从不同维度为驾驶员提供帮助。Through the above process, terminals with intelligent interaction functions, such as cars and machines, can actively detect interaction needs. This allows terminals with intelligent interaction functions to determine traffic information based on interaction needs and provide assistance to drivers from different dimensions.
如图3所示,在一种实施方式中,步骤S101所涉及的交互需求的获取方式,可以包括以下过程:As shown in Figure 3, in one implementation, the acquisition method of interaction requirements involved in step S101 may include the following process:
S301:对获取到的参考信息进行解析,得到解析结果;参考信息包括用户信息和当前车辆的行驶信息中的至少一种,行驶信息包括车辆传感器检测的车辆参数;用户信息包括用户的语音信息和动作信息中的至少一种;S301: Analyze the obtained reference information to obtain an analysis result; the reference information includes at least one of user information and current vehicle driving information, and the driving information includes vehicle parameters detected by the vehicle sensor; user information includes the user's voice information and At least one type of action information;
S302:在解析结果满足预定条件的情况下,确定检测到交互需求;交互需求包括导航需求、陪伴需求或兴趣点推荐需求中的至少一种。S302: When the parsing result satisfies the predetermined conditions, it is determined that an interaction requirement is detected; the interaction requirement includes at least one of a navigation requirement, a companionship requirement, or a point of interest recommendation requirement.
以用户为驾驶员为例,参考信息可以是在获取到驾驶员的授权的情况下采集到的信息。例如,参考信息可以是驾驶员信息,具体的,可以是通过图像采集设备采集到的包含驾驶员的视频信息、图像信息等。从而可以根据视频信息和/或图像信息确定出驾驶员的动作信息。此外,还可以是通过收音设备采集到的驾驶员的语音信息等。又例如,参考信息可以是当前车辆的行驶信息,例如,行驶信息可以包括车辆传感器检测的车辆参数,如车速、剩余油量、胎压、发动机参数、变速箱参数、电动机参数、电池参数等。Taking the user as a driver as an example, the reference information may be information collected with the driver's authorization. For example, the reference information may be driver information. Specifically, it may be video information, image information, etc. including the driver collected by an image collection device. Thus, the driver's action information can be determined based on the video information and/or image information. In addition, it can also be the driver's voice information collected through the radio equipment. For another example, the reference information may be the driving information of the current vehicle. For example, the driving information may include vehicle parameters detected by vehicle sensors, such as vehicle speed, remaining fuel level, tire pressure, engine parameters, gearbox parameters, motor parameters, battery parameters, etc.
对获取到的参考信息进行解析,通过判断是否满足预定条件,即可实现交互需求的自动检测。例如,语音信息中包含“开启交互模式”、“跟我聊聊天”等带有明确指令的内容,表示驾驶员存在交互需求。或者,通过视频信息和/或图像信息检测到驾驶员做出特定动作,也可以表示满足预定条件,即驾驶员存在交互需求。不难理解,上述指令和特定动作是预先已经确定的。对于此,可以确定交互需求为陪伴需求。By analyzing the obtained reference information and judging whether it meets the predetermined conditions, automatic detection of interaction requirements can be achieved. For example, the voice message contains content with clear instructions such as "turn on interactive mode", "chat with me", etc., indicating that the driver has interaction needs. Alternatively, detecting that the driver makes a specific action through video information and/or image information can also indicate that predetermined conditions are met, that is, the driver has interaction needs. It is easy to understand that the above instructions and specific actions are determined in advance. For this, interaction needs can be determined as companionship needs.
又例如,通过画面、视频或音频等检测到驾驶员存在焦虑情绪、疲劳状态或饥饿状态等情况下,也可以确定满足预定条件。对于此,可以确定交互需求为陪伴需求或兴趣点推荐需求。在兴趣点推荐需求的情况下,可以根据驾驶员具体情况,选择对应的兴趣点。例如,电影院、餐厅或者回家等。For another example, if the driver's anxiety, fatigue, or hunger is detected through pictures, videos, or audio, it can also be determined that the predetermined conditions are met. For this, the interaction requirement can be determined as a companionship requirement or a point-of-interest recommendation requirement. When there is a demand for point of interest recommendation, the corresponding point of interest can be selected according to the driver's specific situation. For example, a movie theater, a restaurant, or going home.
再例如,在通过车辆参数确定车速为0,或者车速较长时间段内的车速均低于对应阈值,可以表示存在交通拥堵情况。由此可以确定满足预定条件。对于此,可以确定交互需求为导航需求,即导航出其他路径以躲避拥堵。For another example, when the vehicle speed is determined to be 0 through vehicle parameters, or the vehicle speed is lower than the corresponding threshold for a long period of time, it can indicate the existence of traffic congestion. From this, it can be determined that the predetermined condition is satisfied. For this, the interaction requirement can be determined as a navigation requirement, that is, navigating other paths to avoid congestion.
还例如,在通过车辆参数确定车辆存在故障的情况下,也可以确定满足预定条件。对于此,可以确定交互需求为兴趣点推荐需求。兴趣点可以是汽车销售服务4S(Automobile Sales Servicshop 4S)店,服务区或车辆维修点等。For another example, when it is determined that there is a fault in the vehicle through vehicle parameters, it can also be determined that the predetermined condition is met. For this, the interaction requirement can be determined as the point of interest recommendation requirement. Points of interest can be automobile sales service 4S (Automobile Sales Servicshop 4S) stores, service areas or vehicle repair points, etc.
通过上述过程,可以基于驾驶员信息、车辆信息等确定出对饮的交互需求。从而可以为驾驶员提供更多元化的服务。Through the above process, the interactive demand for drinks can be determined based on driver information, vehicle information, etc. This can provide drivers with more diversified services.
如图4所示,在一种实施方式中,在交互需求包括导航需求的情况下,步骤S102可以包括以下过程:As shown in Figure 4, in one implementation, when the interaction requirements include navigation requirements, step S102 may include the following process:
S401:根据导航需求,从交通信息中筛选出满足指定条件的目标导航路径,指定条件是 根据导航路径的行驶里程和导航路径的行驶时间中的至少一项确定的;S401: Based on the navigation requirements, select the target navigation path that meets the specified conditions from the traffic information. The specified conditions are determined based on at least one of the driving mileage of the navigation path and the driving time of the navigation path;
S402:根据目标导航路径,生成交互内容。S402: Generate interactive content according to the target navigation path.
在交互需求包括导航需求的情况下,交通信息可以是基于抵达目的地的多条导航路径。对交通信息的筛选结果可以是在多条导航路径中筛选出目标导航路径。In the case where the interaction requirements include navigation requirements, the traffic information may be based on multiple navigation paths to the destination. The result of filtering traffic information may be to select a target navigation path from multiple navigation paths.
筛选方式可以根据导航路径的行驶里程、导航路径的行驶时间和导航路径的路途花费中的至少一项。路途花费可以包括燃油费、电费、过路费等。例如,可以对不同筛选因子确定权重。示例性地,行驶时间的权重大于行驶里程的权重,行驶里程的权重大于路途花费的权重。对于确定的目标路径,可以仅保留目标路径的通行时间,以及里程等。The filtering method may be based on at least one of the driving mileage of the navigation path, the driving time of the navigation path, and the travel cost of the navigation path. Travel costs can include fuel, electricity, tolls, etc. For example, you can weight different filter factors. For example, the weight of driving time is greater than the weight of driving mileage, and the weight of driving mileage is greater than the weight of travel cost. For the determined target route, only the travel time and mileage of the target route can be retained.
结合图5所示,对于目标路径,对应的交互内容可以是“根据当前的交通状态,预期会有会越来越多的车辆汇聚到前方拥堵路段上,我帮你发现了一条新路,途径XXX路,时间可以快2分钟,距离差不多,是否更换?”由于传统的导航界面在为用户推荐路线时,通常还同时展现导航引导信息、收费站信息、剩余里程、剩余时间、推荐避堵路线以及路线的时间、收费、红绿灯和里程对比等。对于想知道前方拥堵是否加剧而需要变换路线的驾驶者而言,这样的信息太分散,无法辅助快速决策。As shown in Figure 5, for the target path, the corresponding interactive content can be "According to the current traffic status, it is expected that more and more vehicles will converge on the congested road section ahead. I helped you find a new road. Route XXX, the time can be 2 minutes faster and the distance is about the same. Do you want to change it?" When the traditional navigation interface recommends routes to users, it usually also displays navigation guidance information, toll station information, remaining mileage, remaining time, and recommended congestion avoidance routes. As well as route time, tolls, traffic lights and mileage comparison, etc. For drivers who want to know if congestion ahead is increasing and they need to change routes, such information is too scattered to assist in quick decision-making.
当前实施方式中,通过较为简洁的信息,可以帮助驾驶员快速进行决策。In the current implementation, relatively concise information can be used to help the driver make quick decisions.
如图6所示,在一种实施方式中,导航需求的确认方式,可以包括以下过程:As shown in Figure 6, in one implementation, the confirmation method of navigation requirements may include the following process:
S601:根据车辆参数,确定当前车辆的在预定时间段内的车速情况;S601: Determine the speed of the current vehicle within a predetermined time period based on vehicle parameters;
S602:根据车速情况,确定拥堵级别;S602: Determine the congestion level based on vehicle speed;
S603:在拥堵级别符合预先设定的拥堵标准的情况下,确定交互需求为导航需求。S603: When the congestion level meets the preset congestion standard, determine the interaction requirement as a navigation requirement.
导航需求的确认方式可以基于车辆参数中的车速情况进行确定。例如,在行驶过程中,出现由非红绿灯导致的超过预定时间(例如10秒)车速均为0,或者,超过预定时间车速低于对应的第一车速阈值(例如5km/小时),可以确定拥堵级别为“严重”。又例如,超过预定时间车速在前述第一车速阈值和第二车速阈值(例如15km/小时)之间,可以确定拥堵级别为“一般”。The confirmation method of navigation requirements can be determined based on the vehicle speed in the vehicle parameters. For example, during driving, if the vehicle speed is 0 for more than a predetermined time (for example, 10 seconds) due to non-traffic lights, or if the vehicle speed is lower than the corresponding first speed threshold (for example, 5km/h) for more than a predetermined time, congestion can be determined. The level is "critical". For another example, if the vehicle speed is between the first vehicle speed threshold and the second vehicle speed threshold (for example, 15 km/h) for more than a predetermined time, the congestion level may be determined to be "normal".
基于拥堵级别,可以确定导航需求。例如,在拥堵级别高于“一般”的情况下,即可确定交互需求为导航需求。Based on the congestion level, navigation needs can be determined. For example, when the congestion level is higher than "average", the interaction requirement can be determined to be a navigation requirement.
通过上述过程,可以基于车速情况进行导航需求的确定。Through the above process, the navigation needs can be determined based on the vehicle speed.
如图7所示,在一种实施方式中,在交互需求包括陪伴需求的情况下,步骤S102可以包括以下过程:As shown in Figure 7, in one implementation, when the interaction requirement includes a companionship requirement, step S102 may include the following process:
S701:根据交通信息中的目标导航路径以及对应的路况,确定出安全行驶路段,安全行驶路段为路况复杂程度低于对应复杂程度阈值的路段;S701: Determine a safe driving section based on the target navigation path in the traffic information and the corresponding road conditions. The safe driving section is a road section where the complexity of the road conditions is lower than the corresponding complexity threshold;
S702:确定当前车辆在安全行驶路段的行驶时间;S702: Determine the driving time of the current vehicle on the safe driving section;
S703:利用基于人工智能的内容生产技术AIGC,生成与行驶时间相匹配的交互内容。S703: Use artificial intelligence-based content production technology AIGC to generate interactive content that matches the driving time.
在交互需求包括陪伴需求的情况下,可以从交通信息中获取目标导航路径,以及与目标导航路径对应的路况。例如,在高速路行驶或快速路行驶等场景,可以在剩余路段中确定出口。在与出口的距离超过对应距离阈值的情况下,可以表示不需要驾驶者进行复杂的操作, 即,可以对应路况复杂程度低于对应复杂程度阈值的安全行驶路段。即,在与出口的距离超过对应距离阈值的情况下,可以表示驾驶员只需在主路(高速路或快速路)行驶即可,因此可以认为是路况复杂程度较低。安全行驶路段的行驶时间可以根据距离以及路况进行估算。When the interaction requirements include companionship requirements, the target navigation path and the traffic conditions corresponding to the target navigation path can be obtained from the traffic information. For example, in scenarios such as highway driving or expressway driving, the exit can be determined in the remaining road segments. When the distance to the exit exceeds the corresponding distance threshold, it can mean that the driver does not need to perform complex operations, that is, it can correspond to a safe driving section where the road condition complexity is lower than the corresponding complexity threshold. That is, when the distance to the exit exceeds the corresponding distance threshold, it can mean that the driver only needs to drive on the main road (highway or expressway), so it can be considered that the road condition is less complex. The driving time for safe driving sections can be estimated based on distance and road conditions.
又例如,在等待红灯的情况下,可以在路况中获取红灯的剩余时长,根据红灯的剩余时长大于对应的时长阈值的情况下,可以对应为行驶在路况复杂程度低于对应复杂程度阈值的安全行驶路段。即,红灯时间较长的情况下,无需驾驶员进行驾驶操作,因此可以认为是路况复杂程度较低。由此,可以将当前停止位置所对应的路段确定为安全行驶路段。对应的,可以将红灯结束时间对应安全行驶路段的行驶时间。For another example, when waiting for a red light, the remaining duration of the red light can be obtained from the traffic conditions. If the remaining duration of the red light is greater than the corresponding duration threshold, it can be corresponding to driving in a road where the complexity of the traffic condition is lower than the corresponding complexity. Threshold safe driving section. That is, when the red light time is long, the driver is not required to perform driving operations, so it can be considered that the road condition is less complex. Thus, the road section corresponding to the current stopping position can be determined as a safe driving section. Correspondingly, the end time of the red light can be corresponding to the driving time of the safe driving section.
再例如,在堵车较为严重,且不存在候选路径的情况下,也可以将堵车路段确定为路况复杂程度低于对应复杂程度阈值的安全行驶路段。对应的,可以将行驶至堵车结束点的时间确定为安全行驶路段的行驶时间。For another example, when the traffic jam is severe and there are no candidate routes, the traffic jam section can also be determined as a safe driving section where the road condition complexity is lower than the corresponding complexity threshold. Correspondingly, the time from driving to the end point of the traffic jam can be determined as the driving time of the safe driving section.
根据安全行驶路段的行驶时间,可以利用人工智能的内容生产技术AIGC生成话题,以实现与驾驶员的互动。对应的,关于话题的主题,可以是当下的热搜话题、根据驾驶员的用户画像确定的话题等。或者,结合图8所示,还可以根据驾驶员提出的问题确定对应的话题。例如,询问体育赛事是输赢、打听明星轶事等。所谓AIGC是指可以利用人工智能技术,自动生成多媒体内容。例如,可以写诗、作曲、生成新闻等。AIGC可以根据互联网上的热点事件进行多媒体内容的生成,也可以根据接收到的用户的询问进行多媒体内容的生成。Based on the driving time of safe driving sections, artificial intelligence content production technology AIGC can be used to generate topics to achieve interaction with drivers. Correspondingly, the topic of the topic can be a current hot search topic, a topic determined based on the driver's user portrait, etc. Alternatively, as shown in FIG. 8 , the corresponding topic can also be determined based on the questions raised by the driver. For example, ask whether sports events were won or lost, ask about celebrity anecdotes, etc. The so-called AIGC means that artificial intelligence technology can be used to automatically generate multimedia content. For example, you can write poetry, compose music, generate news, etc. AIGC can generate multimedia content based on hot events on the Internet, and can also generate multimedia content based on received user inquiries.
在与用户进行多轮次互动过程中,还可以根据用户画像确定互动策略。其中,用户画像可以包括用户的长期偏好、中期偏好和短期偏好。例如,长期偏好可以包括用户的兴趣和职业等。中期偏好可以是当前出行的目的,例如可以是旅游、参会等。短期偏好可以是用户当前的情感需求,例如当前情绪以及对打扰的耐受度等。在进行内容生成时,可以依次依照短期、中期、长期确定优先级,从而可以更好地提高用户的体验。During multiple rounds of interaction with users, interaction strategies can also be determined based on user portraits. Among them, user portraits can include the user's long-term preferences, medium-term preferences, and short-term preferences. For example, long-term preferences may include the user's interests and occupation, etc. The mid-term preference can be the current purpose of travel, such as traveling, attending a conference, etc. Short-term preferences can be the user's current emotional needs, such as current mood and tolerance to interruptions. When generating content, priorities can be determined in order of short-term, medium-term, and long-term, so as to better improve the user experience.
通过上述过程,可以在驾驶员需要陪伴的情况下,生成驾驶员感兴趣的话题,以帮助用户度过较为单调的驾驶过程。并且,对于驾驶员的陪伴可以以路况为依据,从而最大限度的保障驾驶员的安全行驶。Through the above process, when the driver needs companionship, topics of interest to the driver can be generated to help the user get through the relatively monotonous driving process. Moreover, the driver's companionship can be based on road conditions, thereby ensuring the driver's safe driving to the greatest extent.
如图9所示,在一种实施方式中,在交互需求包括兴趣点推荐需求的情况下,步骤S102可以包括以下过程:As shown in Figure 9, in one implementation, when the interaction requirement includes a point of interest recommendation requirement, step S102 may include the following process:
S901:确定与兴趣点推荐需求对应的兴趣点类型;S901: Determine the POI type corresponding to the POI recommendation requirement;
S902:根据兴趣点类型,在交通信息包含的候选兴趣点中筛选出目标兴趣点;S902: According to the type of interest point, select the target interest point from the candidate interest points included in the traffic information;
S903:根据目标兴趣点生成交互内容。S903: Generate interactive content based on target points of interest.
兴趣点推荐需求可以包括多个种类。例如,在检测到驾驶员频繁打哈欠或者眨眼的情况下,可以表示驾驶员较为疲劳。基于此,可以确定兴趣点推荐需求为需要休息。结合图10所示,兴趣点类型可以是休闲类,诸如服务区等。对应的,还可以基于兴趣点检索功能,从网络中将获取到的服务区所包含的店铺信息进行显示。图10所示示例中,可以根据用户经常去的加油站品牌,对用户进行服务区信息的介绍。另外,还可以结合用户的喜好,例如用户喜欢的美食种类、偏好的饭店店铺等对用户进行介绍。Points of interest recommendation requirements may include multiple categories. For example, if a driver is detected to yawn or blink frequently, it may indicate that the driver is tired. Based on this, it can be determined that the point of interest recommendation requirement is the need for rest. As shown in FIG. 10 , the point of interest type may be a leisure type, such as a service area, etc. Correspondingly, the store information contained in the service area obtained from the network can also be displayed based on the point of interest retrieval function. In the example shown in Figure 10, service area information can be introduced to the user based on the brands of gas stations that the user often goes to. In addition, the user can also be introduced based on the user's preferences, such as the type of food the user likes, preferred restaurants and shops, etc.
又例如,在检测到车辆出现异常的情况下,可以根据异常情况确定兴趣点推荐需求的种类。例如,在燃油车的油量不足的情况下,可以将加油站作为兴趣点类型。在新能源车的电量不足的情况下,可以将充电桩(站)作为兴趣点类型。在车辆某些部件出现故障的情况下,可以将4S店或维修点作为兴趣点类型。For another example, when an abnormality in a vehicle is detected, the type of interest point recommendation requirement can be determined based on the abnormal situation. For example, when a fuel vehicle is low on fuel, a gas station can be used as a point of interest type. When the power of new energy vehicles is insufficient, charging piles (stations) can be used as point-of-interest types. In the event that some parts of the vehicle fail, a 4S shop or repair point can be used as a point of interest type.
根据兴趣点类型,在交通信息包含的候选兴趣点中筛选出对应的目标兴趣点。例如,在兴趣点类型为加油站的情况下,可以根据驾驶员偏好的加油站品牌,选择目标加油站作为目标兴趣点。同理,在兴趣点类型是酒店的情况下,可以根据酒店的口碑、评论,驾驶员偏好的酒店品牌等,选择目标酒店作为目标兴趣点。其他目标兴趣点的选择方式雷同,不再赘述。According to the type of interest points, the corresponding target interest points are screened out from the candidate interest points included in the traffic information. For example, when the point of interest type is a gas station, the target gas station can be selected as the target point of interest based on the gas station brand preferred by the driver. Similarly, when the type of point of interest is a hotel, the target hotel can be selected as the target point of interest based on the hotel's reputation, reviews, the driver's preferred hotel brand, etc. The selection methods for other target points of interest are similar and will not be described again.
通过上述过程,可以根据检测到的交互需求,自动选择出最适合的兴趣点作为交互结果。Through the above process, the most suitable points of interest can be automatically selected as interaction results based on the detected interaction requirements.
如图11所示,在一种实施方式中,兴趣点推荐需求的确定方式,可以包括以下过程:As shown in Figure 11, in one implementation, the method for determining the recommendation requirements for points of interest may include the following process:
S1101:根据用户信息,确定用户的生理状态的类型;S1101: Determine the type of the user's physiological state based on the user information;
S1102:在生理状态的类型为预定类型的情况下,确定交互需求为兴趣点推荐需求,预定类型包括:疲劳、饥饿、内急、患病中的至少一种。S1102: When the type of physiological state is a predetermined type, determine the interaction requirement as a point of interest recommendation requirement, and the predetermined type includes at least one of fatigue, hunger, internal urgency, and illness.
在一种实施方式中,兴趣点推荐需求的确定方式可以根据驾驶员的信息作为依据。例如,可以根据对驾驶员(对应用户)的图像、视频、语音等检测结果,确定出驾驶员的不同生理状态。In one implementation, the determination of the interest point recommendation requirements may be based on the driver's information. For example, different physiological states of the driver can be determined based on the detection results of images, videos, voices, etc. of the driver (corresponding to the user).
例如,在驾驶员频繁眨眼或打哈欠的情况下,可以确定驾驶员的生理状态的类型为疲劳。对此,可以确定交互需求包括兴趣点推荐需求,并且将酒店、服务区等作为待推荐的兴趣点。For example, in the case where the driver frequently blinks or yawns, the type of the driver's physiological state may be determined to be fatigue. In this regard, it can be determined that the interaction requirements include the recommendation requirements for points of interest, and hotels, service areas, etc. are used as points of interest to be recommended.
在驾驶员表达饥饿等信息的情况下,可以确定驾驶员的生理状态的类型为饥饿。对此,可以确定交互需求包括兴趣点推荐需求,并且将餐馆等作为待推荐的兴趣点。When the driver expresses information such as hunger, it can be determined that the type of the driver's physiological state is hunger. In this regard, it can be determined that the interaction requirements include the recommendation requirements for points of interest, and restaurants, etc. are used as points of interest to be recommended.
在驾驶员表达想去洗手间的情况下,可以确定驾驶员的生理状态的类型为内急。对此,可以确定交互需求包括兴趣点推荐需求,并且将服务区、商场等作为待推荐的兴趣点。When the driver expresses that he wants to go to the bathroom, it can be determined that the type of the driver's physiological state is internal urgency. In this regard, it can be determined that the interaction requirements include the recommendation requirements for points of interest, and service areas, shopping malls, etc. are used as points of interest to be recommended.
在检测到驾驶员频繁皱眉、吸气以及手部重复抚摸同一部位等情况下,可以确定驾驶员的生理状态的类型为患病。对此,可以确定交互需求包括兴趣点推荐需求,并且将服务区、医院等作为待推荐的兴趣点。When it is detected that the driver frequently frowns, inhales, and repeatedly touches the same part with his hand, it can be determined that the driver's physiological state is sick. In this regard, it can be determined that the interaction requirements include the recommendation requirements for points of interest, and service areas, hospitals, etc. are used as points of interest to be recommended.
通过上述过程,可以根据对驾驶员的检测情况,确定出驾驶员的状态。进而为兴趣点推荐需求进行确认。Through the above process, the driver's status can be determined based on the detection of the driver. Then the requirements for recommendation of points of interest are confirmed.
如图12所示,在一种实施方式中,兴趣点推荐需求的确定方式,可以包括以下过程:As shown in Figure 12, in one implementation, the method for determining the recommendation requirements for points of interest may include the following process:
S1201:根据车辆参数,确定当前车辆的车况;S1201: Determine the current vehicle condition based on vehicle parameters;
S1202:在车况达到故障标准的情况下,确定交互需求为兴趣点推荐需求。S1202: When the vehicle condition reaches the fault standard, determine the interaction requirement as the point of interest recommendation requirement.
根据车辆参数,可以确定当前车辆的车况是否正常。在车辆参数不在预定的正常工作参数范围内的情况下,可以确定车况达到故障标准。基于此,可以确定交互需求为兴趣点推荐需求。Based on the vehicle parameters, it can be determined whether the current vehicle condition is normal. When the vehicle parameters are not within the predetermined normal operating parameter range, it can be determined that the vehicle condition reaches the failure standard. Based on this, the interaction requirement can be determined as the POI recommendation requirement.
进一步的,还可以根据故障类型,选择对应的兴趣点。例如前述的如果油量过低,则将加油站作为兴趣点。如果是电量过低,则将充电桩(站)作为兴趣点。Furthermore, you can also select the corresponding points of interest based on the fault type. For example, if the fuel level is too low as mentioned above, the gas station will be used as a point of interest. If the battery is too low, use the charging pile (station) as a point of interest.
通过上述过程,可以对有可能出现的车辆故障进行预判,从而保障车辆的正常行驶。Through the above process, possible vehicle failures can be predicted to ensure the normal driving of the vehicle.
如图13所示,本公开涉及一种智能交互的方法,该方法由输入层、计算层和输出层执行。As shown in Figure 13, the present disclosure relates to a method of intelligent interaction, which is executed by an input layer, a calculation layer, and an output layer.
输入层的主要功能是接收用户(驾驶员)输入触发或基于检测到的信息主动触发的。输入层可以包括语音识别模块、音视频融合情感识别模块和场景判别模块等。The main function of the input layer is to receive user (driver) input triggers or actively trigger based on detected information. The input layer can include a speech recognition module, an audio and video fusion emotion recognition module, a scene discrimination module, etc.
语音识别模块基于语音识别技术(Automatic Speech Recognition,ASR),在用户主动唤醒的场景下识别用户的语音,并将语音转换为文字。语音识别模块具有全双工的多轮沟通能力。The speech recognition module is based on automatic speech recognition technology (Automatic Speech Recognition, ASR). It recognizes the user's voice in the scenario where the user actively wakes up and converts the voice into text. The speech recognition module has full-duplex multi-round communication capabilities.
音视频融合情感识别模块用于在用户授权的情况下,通过语音与视觉的情感识别,了解用户当前的状态,比如疲劳识别、情感分类(愤怒、轻松、欢乐、无聊等)。The audio and video fusion emotion recognition module is used to understand the user's current status through voice and visual emotion recognition with the user's authorization, such as fatigue recognition, emotion classification (anger, relaxation, joy, boredom, etc.).
场景判别模块基于预设规则进行主动触发的判断,比如用户是否处于拥堵中、拥堵是否严重,用户是否途径景区、休息区,用户是否长时间驾驶需要休息等。The scene discrimination module makes active trigger judgments based on preset rules, such as whether the user is in congestion, whether the congestion is serious, whether the user passes through scenic spots or rest areas, and whether the user needs to rest after driving for a long time, etc.
输入层和计算层之间,还包括需求解析处理。需求解析处理需要进行用户画像能力的建设,了解用户长期(兴趣、职业等)、中期(出行目的等)和短期(当前情绪以及对打扰的耐受程度)的需求和偏好。基于用户的输入或者预设规则的触发,完成需求的理解。Between the input layer and the calculation layer, demand analysis processing is also included. Demand analysis processing requires the construction of user portrait capabilities, and understanding the user's long-term (interests, occupations, etc.), mid-term (travel purpose, etc.) and short-term (current mood and tolerance to interruption) needs and preferences. Complete the understanding of requirements based on user input or triggering of preset rules.
计算层的主要功能是服务于用户的需求,并完成后台的计算。计算层可以包括交通大脑模块、知识图谱模块、地图兴趣点模块和检索模块等。The main function of the computing layer is to serve the needs of users and complete background calculations. The computing layer can include a traffic brain module, a knowledge graph module, a map point of interest module, a retrieval module, etc.
交通大脑模块可以用于路况预测、动态事件感知等,为用户提供丰富的路线交通信息。动态事件感知可以包括道路施工、道路管控、道路事故等。The traffic brain module can be used for traffic prediction, dynamic event perception, etc., providing users with rich route traffic information. Dynamic event awareness can include road construction, road control, road accidents, etc.
知识图谱模块用于提供闲聊信息。例如用户希望了解关于某些新闻、八卦、娱乐、体育信息的时候,知识图谱模块会基于这些信息进行内容的整合、计算以及输出。The knowledge graph module is used to provide chat information. For example, when users want to know about certain news, gossip, entertainment, and sports information, the knowledge graph module will integrate, calculate, and output the content based on this information.
地图兴趣点模块用于提供兴趣点相关信息。当用户的需求与地理位置有关的时候,比如用户想知道目的地信息等,地图兴趣点模块会为用户提供相关内容,例如营业时间、消费情况、评论等。The map point of interest module is used to provide information related to points of interest. When the user's needs are related to geographical location, for example, the user wants to know destination information, etc., the map point of interest module will provide the user with relevant content, such as business hours, consumption status, comments, etc.
检索模块可以用于对兴趣点的搜索。当用户的需求与查找地理位置有关的时候,比如用户想附近加油、附近休息等,检索模块会为用户提供服务。The retrieval module can be used to search for points of interest. When the user's needs are related to searching for a geographical location, such as if the user wants to refuel nearby, rest nearby, etc., the retrieval module will provide services for the user.
输出层的主要功能根据解析处理的结果以及计算层输出的结果,为用户提供内容的呈现方式。包括导航产品服务模块、自动内容生成模块、语音播报模块和3D虚拟人合成模块等。The main function of the output layer is to provide users with a content presentation method based on the results of analytical processing and the results output by the calculation layer. Including navigation product service module, automatic content generation module, voice broadcast module and 3D virtual human synthesis module, etc.
导航产品服务模块用于在用户的需求与操作导航产品页面有关的时候,为用户直接进行产品服务的调整和变化,比如放大底图,调整为车头模式等。The navigation product service module is used to directly adjust and change product services for users when the user's needs are related to the operation of the navigation product page, such as enlarging the base map, adjusting to the front mode, etc.
自动内容生成模块在需要为用户提供口播内容的情况下,结合用户的偏好设定进行内容的自动化生产。The automatic content generation module combines the user's preference settings to automatically produce content when it is necessary to provide users with oral content.
播报模块用于将自动内容生成模块生成的文本转换为语音。为了让口播的内容更加的贴近于拟人化,赋予情感化和人格化特征,播报模块可以利用真人发声样本进行模型训练,从而将自动内容生成模块生成的文本转换为语音。The broadcast module is used to convert the text generated by the automatic content generation module into speech. In order to make the spoken content closer to personification and endow it with emotional and personalized characteristics, the broadcasting module can use real-life vocal samples for model training, thereby converting the text generated by the automatic content generation module into speech.
3D虚拟人合成模块可以依照待播放内容进行3D虚拟人驱动,以使3D虚拟人对内容进行播报。The 3D virtual human synthesis module can drive the 3D virtual human according to the content to be played, so that the 3D virtual human can broadcast the content.
如图14所示,本公开涉及一种智能交互的装置,该装置可以包括:As shown in Figure 14, the present disclosure relates to an intelligent interactive device, which may include:
交通信息获取模块1401,用于根据检测到的交互需求,获取对应的交通信息;The traffic information acquisition module 1401 is used to obtain corresponding traffic information according to the detected interaction requirements;
交互内容生成模块1402,用于根据交互需求和对交通信息的筛选结果,生成交互内容;The interactive content generation module 1402 is used to generate interactive content based on the interaction requirements and the screening results of traffic information;
数字人驱动模块1403,用于基于交互内容,生成驱动参数,驱动参数包括用于驱动交互界面中的数字人进行交互内容播报的参数。The digital human driving module 1403 is used to generate driving parameters based on interactive content. The driving parameters include parameters used to drive the digital human in the interactive interface to broadcast interactive content.
在一种实施方式中,交通信息获取模块1401,可以包括:In one implementation, the traffic information acquisition module 1401 may include:
解析子模块,用于对获取到的参考信息进行解析,得到解析结果;参考信息包括用户信息、当前车辆的行驶信息中的至少一种,行驶信息包括车辆传感器检测的车辆参数;用户信息包括用户的语音信息和动作信息中的至少一种;The parsing sub-module is used to parse the obtained reference information to obtain parsing results; the reference information includes at least one of user information and current vehicle driving information, and the driving information includes vehicle parameters detected by vehicle sensors; user information includes user information At least one of voice information and action information;
交互需求确定执行子模块,用于在解析结果满足预定条件的情况下,确定检测到交互需求,交互需求包括导航需求、陪伴需求或兴趣点推荐需求中的至少一种。The interaction requirement determination execution sub-module is used to determine that an interaction requirement is detected when the parsing result satisfies a predetermined condition, and the interaction requirement includes at least one of a navigation requirement, a companionship requirement, or a point of interest recommendation requirement.
在一种实施方式中,在交互需求包括导航需求的情况下,交互内容生成模块1402,可以包括:In one implementation, when the interaction requirements include navigation requirements, the interactive content generation module 1402 may include:
目标导航路径确定子模块,用于根据导航需求,从交通信息中筛选出满足指定条件的目标导航路径,指定条件是根据导航路径的行驶里程和导航路径的行驶时间中的至少一项确定的;The target navigation path determination submodule is used to filter out the target navigation path that meets the specified conditions from the traffic information according to the navigation requirements. The specified conditions are determined based on at least one of the driving mileage of the navigation path and the driving time of the navigation path;
交互内容生成执行子模块,用于根据目标导航路径生成所述交互内容。The interactive content generation execution sub-module is used to generate the interactive content according to the target navigation path.
在一种实施方式中,交互需求确定执行子模块,可以包括:In one implementation, the interaction requirement determination execution submodule may include:
车速情况确定单元,用于根据车辆参数,确定当前车辆的在预定时间段内的车速情况;A vehicle speed condition determination unit is used to determine the vehicle speed condition of the current vehicle within a predetermined time period based on vehicle parameters;
拥堵级别确定单元,根据车速情况,确定拥堵级别;The congestion level determination unit determines the congestion level based on the vehicle speed;
交互需求确定单元,用于在拥堵级别符合预先设定的拥堵标准的情况下,确定交互需求为导航需求。The interaction demand determination unit is used to determine the interaction demand as a navigation demand when the congestion level meets the preset congestion standard.
在一种实施方式中,在交互需求包括陪伴需求的情况下,交互内容生成模块1402,可以包括:In one implementation, when the interaction requirements include companionship requirements, the interactive content generation module 1402 may include:
安全行驶路段确定子模块,用于根据交通信息中的目标导航路径以及对应的路况,确定出安全行驶路段,安全行驶路段为路况复杂程度低于对应复杂程度阈值的路段;The safe driving section determination submodule is used to determine a safe driving section based on the target navigation path and corresponding road conditions in the traffic information. A safe driving section is a road section where the complexity of the road conditions is lower than the corresponding complexity threshold;
行驶时间确定子模块,用于确定当前车辆在安全行驶路段的行驶时间;The driving time determination submodule is used to determine the driving time of the current vehicle on the safe driving section;
交互内容生成执行子模块,用于利用基于人工智能的内容生产技术AIGC,生成与行驶时间相匹配的交互内容。The interactive content generation execution sub-module is used to generate interactive content that matches the driving time using AIGC, a content production technology based on artificial intelligence.
在一种实施方式中,在交互需求包括兴趣点推荐需求的情况下,交互内容生成模块1402,可以包括:In one implementation, when the interaction requirements include interest point recommendation requirements, the interactive content generation module 1402 may include:
兴趣点类型确定子模块,用于确定与兴趣点推荐需求对应的兴趣点类型;The interest point type determination sub-module is used to determine the interest point type corresponding to the interest point recommendation requirements;
目标兴趣点筛选子模块,用于根据兴趣点类型,在交通信息包含的候选兴趣点中筛选出目标兴趣点;The target point of interest screening submodule is used to filter out target points of interest from the candidate interest points included in the traffic information according to the type of interest point;
交互内容生成执行子模块,用于根据目标兴趣点生成交互内容。The interactive content generation execution submodule is used to generate interactive content based on target points of interest.
在一种实施方式中,交互需求确定执行子模块,可以包括:In one implementation, the interaction requirement determination execution submodule may include:
类型确定单元,用于根据用户信息,确定用户的生理状态的类型;A type determination unit, used to determine the type of the user's physiological state based on the user information;
交互需求确定单元,用于在生理状态的类型为预定类型的情况下,确定交互需求为兴趣点推荐需求,预定类型包括:疲劳、饥饿、内急、患病中的至少一种。The interaction demand determination unit is configured to determine the interaction demand as the interest point recommendation demand when the type of physiological state is a predetermined type, and the predetermined type includes at least one of fatigue, hunger, internal urgency, and illness.
在一种实施方式中,交互需求确定执行子模块,可以包括:In one implementation, the interaction requirement determination execution submodule may include:
车况确定单元,用于根据车辆参数,确定当前车辆的车况;The vehicle condition determination unit is used to determine the current vehicle condition based on vehicle parameters;
交互需求确定单元,用于在车况达到故障标准的情况下,确定交互需求为兴趣点推荐需求。The interaction demand determination unit is used to determine the interaction demand as the recommendation demand for points of interest when the vehicle condition reaches the fault standard.
本公开的技术方案中,所涉及的用户个人信息的获取,存储和应用等,均符合相关法律法规的规定,且不违背公序良俗。In the technical solution of this disclosure, the acquisition, storage and application of user personal information involved are in compliance with relevant laws and regulations and do not violate public order and good customs.
根据本公开的实施例,本公开还提供了一种电子设备、一种可读存储介质和一种计算机程序产品。According to embodiments of the present disclosure, the present disclosure also provides an electronic device, a readable storage medium, and a computer program product.
图15示出了可以用来实施本公开的实施例的示例电子设备1500的示意性框图。电子设备旨在表示各种形式的数字计算机,诸如,膝上型计算机、台式计算机、工作台、个人数字助理、服务器、刀片式服务器、大型计算机、和其它适合的计算机。电子设备还可以表示各种形式的移动装置,诸如,个人数字处理、蜂窝电话、智能电话、可穿戴设备和其它类似的计算装置。本文所示的部件、它们的连接和关系、以及它们的功能仅仅作为示例,并且不意在限制本文中描述的和/或者要求的本公开的实现。15 illustrates a schematic block diagram of an example electronic device 1500 that may be used to implement embodiments of the present disclosure. Electronic devices are intended to refer to various forms of digital computers, such as laptop computers, desktop computers, workstations, personal digital assistants, servers, blade servers, mainframe computers, and other suitable computers. Electronic devices may also represent various forms of mobile devices, such as personal digital assistants, cellular phones, smart phones, wearable devices, and other similar computing devices. The components shown herein, their connections and relationships, and their functions are examples only and are not intended to limit implementations of the disclosure described and/or claimed herein.
如图15所示,设备1500包括计算单元1510,其可以根据存储在只读存储器(ROM)1520中的计算机程序或者从存储单元1580加载到随机访问存储器(RAM)1530中的计算机程序,来执行各种适当的动作和处理。在RAM 1530中,还可存储设备1500操作所需的各种程序和数据。计算单元1510、ROM 1520以及RAM 1530通过总线1540彼此相连。输入/输出(I/O)接口1550也连接至总线1540。As shown in FIG. 15 , the device 1500 includes a computing unit 1510 that can execute according to a computer program stored in a read-only memory (ROM) 1520 or loaded from a storage unit 1580 into a random access memory (RAM) 1530 Various appropriate actions and treatments. In the RAM 1530, various programs and data required for the operation of the device 1500 can also be stored. Computing unit 1510, ROM 1520 and RAM 1530 are connected to each other via bus 1540. An input/output (I/O) interface 1550 is also connected to bus 1540.
设备1500中的多个部件连接至I/O接口1550,包括:输入单元1560,例如键盘、鼠标等;输出单元1570,例如各种类型的显示器、扬声器等;存储单元1580,例如磁盘、光盘等;以及通信单元1590,例如网卡、调制解调器、无线通信收发机等。通信单元1590允许设备1500通过诸如因特网的计算机网络和/或各种电信网络与其他设备交换信息/数据。Multiple components in device 1500 are connected to I/O interface 1550, including: input unit 1560, such as keyboard, mouse, etc.; output unit 1570, such as various types of displays, speakers, etc.; storage unit 1580, such as magnetic disk, optical disk, etc. ; and communication unit 1590, such as a network card, modem, wireless communication transceiver, etc. The communication unit 1590 allows the device 1500 to exchange information/data with other devices through computer networks such as the Internet and/or various telecommunications networks.
计算单元1510可以是各种具有处理和计算能力的通用和/或专用处理组件。计算单元1510的一些示例包括但不限于中央处理单元(CPU)、图形处理单元(GPU)、各种专用的人工智能(AI)计算芯片、各种运行机器学习模型算法的计算单元、数字信号处理器(DSP)、以及任何适当的处理器、控制器、微控制器等。计算单元1510执行上文所描述的各个方法和处理,例如智能交互的方法。例如,在一些实施例中,智能交互的方法可被实现为计算机软件程序,其被有形地包含于机器可读介质,例如存储单元1580。在一些实施例中,计算机程序的部分或者全部可以经由ROM 1520和/或通信单元1590而被载入和/或安装到设备1500上。当计算机程序加载到RAM 1530并由计算单元1510执行时,可以执行上文描述的智能交互的方法的一个或多个步骤。备选地,在其他实施例中,计算单元1510可以通过其他任何适当的方式(例如,借助于固件)而被配置为执行智能交互的方法。 Computing unit 1510 may be a variety of general and/or special purpose processing components having processing and computing capabilities. Some examples of computing units 1510 include, but are not limited to, central processing units (CPUs), graphics processing units (GPUs), various dedicated artificial intelligence (AI) computing chips, various computing units that run machine learning model algorithms, digital signal processing processor (DSP), and any appropriate processor, controller, microcontroller, etc. The computing unit 1510 performs various methods and processes described above, such as intelligent interaction methods. For example, in some embodiments, the method of intelligent interaction may be implemented as a computer software program, which is tangibly included in a machine-readable medium, such as the storage unit 1580. In some embodiments, part or all of the computer program may be loaded and/or installed onto device 1500 via ROM 1520 and/or communication unit 1590. When the computer program is loaded into the RAM 1530 and executed by the computing unit 1510, one or more steps of the method of intelligent interaction described above may be performed. Alternatively, in other embodiments, the computing unit 1510 may be configured to perform the method of intelligent interaction in any other suitable manner (eg, by means of firmware).
本文中以上描述的系统和技术的各种实施方式可以在数字电子电路系统、集成电路系统、场可编程门阵列(FPGA)、专用集成电路(ASIC)、专用标准产品(ASSP)、芯片上系统的系统(SOC)、负载可编程逻辑设备(CPLD)、计算机硬件、固件、软件、和/或它们的组合中实现。这些各种实施方式可以包括:实施在一个或者多个计算机程序中,该一个或者多个计算机程序可在包括至少一个可编程处理器的可编程系统上执行和/或解释,该可编程处理器可以是专用或者通用可编程处理器,可以从存储系统、至少一个输入装置、和至少一个输出装置接收数据和指令,并且将数据和指令传输至该存储系统、该至少一个输入装置、和该至少一个输出装置。Various implementations of the systems and techniques described above may be implemented in digital electronic circuit systems, integrated circuit systems, field programmable gate arrays (FPGAs), application specific integrated circuits (ASICs), application specific standard products (ASSPs), systems on a chip implemented in a system (SOC), load programmable logic device (CPLD), computer hardware, firmware, software, and/or a combination thereof. These various embodiments may include implementation in one or more computer programs executable and/or interpreted on a programmable system including at least one programmable processor, the programmable processor The processor, which may be a special purpose or general purpose programmable processor, may receive data and instructions from a storage system, at least one input device, and at least one output device, and transmit data and instructions to the storage system, the at least one input device, and the at least one output device. An output device.
用于实施本公开的方法的程序代码可以采用一个或多个编程语言的任何组合来编写。这些程序代码可以提供给通用计算机、专用计算机或其他可编程数据处理装置的处理器或控制器,使得程序代码当由处理器或控制器执行时使流程图和/或框图中所规定的功能/操作被实施。程序代码可以完全在机器上执行、部分地在机器上执行,作为独立软件包部分地在机器上执行且部分地在远程机器上执行或完全在远程机器或服务器上执行。Program code for implementing the methods of the present disclosure may be written in any combination of one or more programming languages. These program codes may be provided to a processor or controller of a general-purpose computer, special-purpose computer, or other programmable data processing device, such that the program codes, when executed by the processor or controller, cause the functions specified in the flowcharts and/or block diagrams/ The operation is implemented. The program code may execute entirely on the machine, partly on the machine, as a stand-alone software package, partly on the machine and partly on a remote machine or entirely on the remote machine or server.
在本公开的上下文中,机器可读介质可以是有形的介质,其可以包含或存储以供指令执行系统、装置或设备使用或与指令执行系统、装置或设备结合地使用的程序。机器可读介质可以是机器可读信号介质或机器可读储存介质。机器可读介质可以包括但不限于电子的、磁性的、光学的、电磁的、红外的、或半导体系统、装置或设备,或者上述内容的任何合适组合。机器可读存储介质的更具体示例会包括基于一个或多个线的电气连接、便携式计算机盘、硬盘、随机存取存储器(RAM)、只读存储器(ROM)、可擦除可编程只读存储器(EPROM或快闪存储器)、光纤、便捷式紧凑盘只读存储器(CD-ROM)、光学储存设备、磁储存设备、或上述内容的任何合适组合。In the context of this disclosure, a machine-readable medium may be a tangible medium that may contain or store a program for use by or in connection with an instruction execution system, apparatus, or device. The machine-readable medium may be a machine-readable signal medium or a machine-readable storage medium. Machine-readable media may include, but are not limited to, electronic, magnetic, optical, electromagnetic, infrared, or semiconductor systems, devices or devices, or any suitable combination of the foregoing. More specific examples of machine-readable storage media would include one or more wire-based electrical connections, laptop disks, hard drives, random access memory (RAM), read only memory (ROM), erasable programmable read only memory (EPROM or flash memory), optical fiber, portable compact disk read-only memory (CD-ROM), optical storage device, magnetic storage device, or any suitable combination of the above.
为了提供与用户的交互,可以在计算机上实施此处描述的系统和技术,该计算机具有:用于向用户显示信息的显示装置(例如,CRT(阴极射线管)或者LCD(液晶显示器)监视器);以及键盘和指向装置(例如,鼠标或者轨迹球),用户可以通过该键盘和该指向装置来将输入提供给计算机。其它种类的装置还可以用于提供与用户的交互;例如,提供给用户的反馈可以是任何形式的传感反馈(例如,视觉反馈、听觉反馈、或者触觉反馈);并且可以用任何形式(包括声输入、语音输入或者、触觉输入)来接收来自用户的输入。To provide interaction with a user, the systems and techniques described herein may be implemented on a computer having a display device (eg, a CRT (cathode ray tube) or LCD (liquid crystal display) monitor) for displaying information to the user ); and a keyboard and pointing device (eg, a mouse or a trackball) through which a user can provide input to the computer. Other kinds of devices may also be used to provide interaction with the user; for example, the feedback provided to the user may be any form of sensory feedback (e.g., visual feedback, auditory feedback, or tactile feedback); and may be provided in any form, including Acoustic input, voice input or tactile input) to receive input from the user.
可以将此处描述的系统和技术实施在包括后台部件的计算系统(例如,作为数据服务器)、或者包括中间件部件的计算系统(例如,应用服务器)、或者包括前端部件的计算系统(例如,具有图形用户界面或者网络浏览器的用户计算机,用户可以通过该图形用户界面或者该网络浏览器来与此处描述的系统和技术的实施方式交互)、或者包括这种后台部件、中间件部件、或者前端部件的任何组合的计算系统中。可以通过任何形式或者介质的数字数据通信(例如,通信网络)来将系统的部件相互连接。通信网络的示例包括:局域网(LAN)、广域网(WAN)和互联网。The systems and techniques described herein may be implemented in a computing system that includes back-end components (e.g., as a data server), or a computing system that includes middleware components (e.g., an application server), or a computing system that includes front-end components (e.g., A user's computer having a graphical user interface or web browser through which the user can interact with implementations of the systems and technologies described herein), or including such backend components, middleware components, or any combination of front-end components in a computing system. The components of the system may be interconnected by any form or medium of digital data communication (eg, a communications network). Examples of communication networks include: local area network (LAN), wide area network (WAN), and the Internet.
计算机系统可以包括客户端和服务器。客户端和服务器一般远离彼此并且通常通过通信网络进行交互。通过在相应的计算机上运行并且彼此具有客户端-服务器关系的计算机程序 来产生客户端和服务器的关系。服务器可以是云服务器,也可以为分布式系统的服务器,或者是结合了区块链的服务器。Computer systems may include clients and servers. Clients and servers are generally remote from each other and typically interact over a communications network. The relationship of client and server is created by computer programs running on corresponding computers and having a client-server relationship with each other. The server can be a cloud server, a distributed system server, or a server combined with a blockchain.
应该理解,可以使用上面所示的各种形式的流程,重新排序、增加或删除步骤。例如,本发公开中记载的各步骤可以并行地执行也可以顺序地执行也可以不同的次序执行,只要能够实现本公开公开的技术方案所期望的结果,本文在此不进行限制。It should be understood that various forms of the process shown above may be used, with steps reordered, added or deleted. For example, each step described in the present disclosure can be executed in parallel, sequentially, or in a different order. As long as the desired results of the technical solution disclosed in the present disclosure can be achieved, there is no limitation here.
上述具体实施方式,并不构成对本公开保护范围的限制。本领域技术人员应该明白的是,根据设计要求和其他因素,可以进行各种修改、组合、子组合和替代。任何在本公开的精神和原则之内所作的修改、等同替换和改进等,均应包含在本公开保护范围之内。The above-mentioned specific embodiments do not constitute a limitation on the scope of the present disclosure. It will be understood by those skilled in the art that various modifications, combinations, sub-combinations and substitutions are possible depending on design requirements and other factors. Any modifications, equivalent substitutions, improvements, etc. made within the spirit and principles of this disclosure shall be included in the protection scope of this disclosure.

Claims (19)

  1. 一种智能交互的方法,包括:A method of intelligent interaction including:
    根据检测到的交互需求,获取对应的交通信息;Obtain corresponding traffic information based on the detected interaction requirements;
    根据所述交互需求和对所述交通信息的筛选结果,生成交互内容;Generate interactive content based on the interaction requirements and the screening results of the traffic information;
    基于所述交互内容,生成驱动参数,所述驱动参数包括用于驱动交互界面中的数字人进行交互内容播报的参数。Based on the interactive content, driving parameters are generated, where the driving parameters include parameters used to drive the digital human in the interactive interface to broadcast the interactive content.
  2. 根据权利要求1所述的方法,其中,所述交互需求的确认方式,包括:The method according to claim 1, wherein the confirmation method of the interaction requirement includes:
    对获取到的参考信息进行解析,得到解析结果;所述参考信息包括用户信息和当前车辆的行驶信息中的至少一种,所述行驶信息包括车辆传感器检测的车辆参数;所述用户信息包括用户的语音信息和动作信息中的至少一种;The obtained reference information is analyzed to obtain an analysis result; the reference information includes at least one of user information and current vehicle driving information, the driving information includes vehicle parameters detected by vehicle sensors; the user information includes user At least one of voice information and action information;
    在所述解析结果满足预定条件的情况下,确定检测到交互需求;所述交互需求包括导航需求、陪伴需求或兴趣点推荐需求中的至少一种。When the parsing result satisfies the predetermined condition, it is determined that an interaction requirement is detected; the interaction requirement includes at least one of a navigation requirement, a companionship requirement, or a point of interest recommendation requirement.
  3. 根据权利要求2所述的方法,其中,在所述交互需求包括导航需求的情况下,所述根据所述交互需求和对所述交通信息的筛选结果,生成交互内容,包括:The method according to claim 2, wherein, when the interaction requirements include navigation requirements, generating interactive content based on the interaction requirements and the screening results of the traffic information includes:
    根据所述导航需求,从所述交通信息中筛选出满足指定条件的目标导航路径,所述指定条件是根据导航路径的行驶里程和导航路径的行驶时间中的至少一项确定的;According to the navigation requirement, select a target navigation path that satisfies specified conditions from the traffic information, and the specified conditions are determined based on at least one of the driving mileage of the navigation path and the driving time of the navigation path;
    根据所述目标导航路径,生成所述交互内容。The interactive content is generated according to the target navigation path.
  4. 根据权利要求2或3所述的方法,其中,所述导航需求的确认方式,包括:The method according to claim 2 or 3, wherein the confirmation method of the navigation requirement includes:
    根据所述车辆参数,确定当前车辆的在预定时间段内的车速情况;Determine the vehicle speed of the current vehicle within a predetermined time period based on the vehicle parameters;
    根据所述车速情况,确定拥堵级别;Determine the congestion level based on the vehicle speed;
    在所述拥堵级别符合预先设定的拥堵标准的情况下,确定所述交互需求为导航需求。When the congestion level meets the preset congestion standard, the interaction requirement is determined to be a navigation requirement.
  5. 根据权利要求2所述的方法,其中,在所述交互需求包括陪伴需求的情况下,所述根据所述交互需求和对所述交通信息的筛选结果,生成交互内容,包括:The method according to claim 2, wherein, when the interaction requirement includes a companionship requirement, generating interactive content based on the interaction requirement and the screening results of the traffic information includes:
    根据所述交通信息中的目标导航路径以及对应的路况,确定出安全行驶路段,所述安全行驶路段为路况复杂程度低于对应复杂程度阈值的路段;Determine a safe driving section based on the target navigation path and corresponding road conditions in the traffic information, where the safe driving section is a road section with a road condition complexity lower than the corresponding complexity threshold;
    确定所述当前车辆在所述安全行驶路段的行驶时间;Determine the driving time of the current vehicle on the safe driving section;
    利用基于人工智能的内容生产技术AIGC,生成与所述行驶时间相匹配的交互内容。Using artificial intelligence-based content production technology AIGC, interactive content matching the driving time is generated.
  6. 根据权利要求2所述的方法,其中,在所述交互需求包括兴趣点推荐需求的情况下,所述根据所述交互需求和对所述交通信息的筛选结果,生成交互内容,包括:The method according to claim 2, wherein, when the interaction requirement includes a point of interest recommendation requirement, generating interactive content based on the interaction requirement and the screening results of the traffic information includes:
    确定与所述兴趣点推荐需求对应的兴趣点类型;Determine the point of interest type corresponding to the point of interest recommendation requirement;
    根据所述兴趣点类型,在所述交通信息包含的候选兴趣点中筛选出目标兴趣点;According to the type of interest point, target interest points are selected from the candidate interest points included in the traffic information;
    根据所述目标兴趣点生成所述交互内容。The interactive content is generated according to the target point of interest.
  7. 根据权利要求2或6所述的方法,其中,所述兴趣点推荐需求的确定方式,包括:The method according to claim 2 or 6, wherein the method for determining the interest point recommendation requirements includes:
    根据所述用户信息,确定所述用户的生理状态的类型;Determine the type of physiological state of the user according to the user information;
    在所述生理状态的类型为预定类型的情况下,确定所述交互需求为兴趣点推荐需求,所述预定类型包括:疲劳、饥饿、内急、患病中的至少一种。When the type of the physiological state is a predetermined type, the interaction requirement is determined to be an interest point recommendation requirement, and the predetermined type includes at least one of fatigue, hunger, internal urgency, and illness.
  8. 根据权利要求2或6所述的方法,其中,所述兴趣点推荐需求的确定方式,包括:The method according to claim 2 or 6, wherein the method for determining the interest point recommendation requirements includes:
    根据所述车辆参数,确定所述当前车辆的车况;Determine the vehicle condition of the current vehicle according to the vehicle parameters;
    在所述车况达到故障标准的情况下,确定所述交互需求为兴趣点推荐需求。When the vehicle condition reaches the fault standard, the interaction requirement is determined to be a point of interest recommendation requirement.
  9. 一种智能交互的装置,包括:An intelligent interactive device, including:
    交通信息获取模块,用于根据检测到的交互需求,获取对应的交通信息;The traffic information acquisition module is used to obtain corresponding traffic information based on the detected interaction requirements;
    交互内容生成模块,用于根据所述交互需求和对所述交通信息的筛选结果,生成交互内容;An interactive content generation module, configured to generate interactive content based on the interaction requirements and the screening results of the traffic information;
    数字人驱动模块,用于基于所述交互内容,生成驱动参数,所述驱动参数包括用于驱动交互界面中的数字人进行交互内容播报的参数。A digital human driving module is configured to generate driving parameters based on the interactive content, where the driving parameters include parameters used to drive the digital human in the interactive interface to broadcast interactive content.
  10. 根据权利要求9所述的装置,其中,所述交通信息获取模块,包括:The device according to claim 9, wherein the traffic information acquisition module includes:
    解析子模块,用于对获取到的参考信息进行解析,得到解析结果;所述参考信息包括用户信息和当前车辆的行驶信息中的至少一种,所述行驶信息包括车辆传感器检测的车辆参数;所述用户信息包括用户的语音信息和动作信息中的至少一种;An analysis submodule, used to analyze the obtained reference information to obtain an analysis result; the reference information includes at least one of user information and current vehicle driving information, and the driving information includes vehicle parameters detected by vehicle sensors; The user information includes at least one of the user's voice information and action information;
    交互需求确定执行子模块,用于在所述解析结果满足预定条件的情况下,确定检测到交互需求;所述交互需求包括导航需求、陪伴需求或兴趣点推荐需求中的至少一种。The interaction requirement determination execution submodule is configured to determine that an interaction requirement is detected when the parsing result satisfies a predetermined condition; the interaction requirement includes at least one of a navigation requirement, a companionship requirement, or a point of interest recommendation requirement.
  11. 根据权利要求10所述的装置,其中,在所述交互需求包括导航需求的情况下,所述交互内容生成模块,包括:The device according to claim 10, wherein when the interaction requirements include navigation requirements, the interactive content generation module includes:
    目标导航路径确定子模块,用于根据所述导航需求,从所述交通信息中筛选出满足指定条件的目标导航路径,所述指定条件是根据导航路径的行驶里程和导航路径的行驶时间中的至少一项确定的;The target navigation path determination submodule is used to filter out the target navigation path that meets specified conditions from the traffic information according to the navigation requirements. The specified conditions are based on the driving mileage of the navigation path and the driving time of the navigation path. At least one thing is certain;
    交互内容生成执行子模块,用于根据所述目标导航路径生成所述交互内容。An interactive content generation execution submodule is used to generate the interactive content according to the target navigation path.
  12. 根据权利要求10或11所述的装置,其中,所述交互需求确定执行子模块,包括:The device according to claim 10 or 11, wherein the interaction requirement determination execution submodule includes:
    车速情况确定单元,用于根据所述车辆参数,确定当前车辆的在预定时间段内的车速情况;A vehicle speed condition determination unit, configured to determine the vehicle speed condition of the current vehicle within a predetermined time period based on the vehicle parameters;
    拥堵级别确定单元,根据所述车速情况,确定拥堵级别;A congestion level determination unit determines the congestion level based on the vehicle speed;
    交互需求确定单元,用于在所述拥堵级别符合预先设定的拥堵标准的情况下,确定所述交互需求为导航需求。An interaction demand determination unit is configured to determine the interaction demand as a navigation demand when the congestion level meets a preset congestion standard.
  13. 根据权利要求10所述的装置,其中,在所述交互需求包括陪伴需求的情况下,所述交互内容生成模块,包括:The device according to claim 10, wherein when the interaction requirement includes a companionship requirement, the interactive content generation module includes:
    安全行驶路段确定子模块,用于根据所述交通信息中的目标导航路径以及对应的路况,确定出安全行驶路段,所述安全行驶路段为路况复杂程度低于对应复杂程度阈值的路段;A safe driving section determination submodule is used to determine a safe driving section based on the target navigation path and corresponding road conditions in the traffic information. The safe driving section is a road section where the complexity of the road conditions is lower than the corresponding complexity threshold;
    行驶时间确定子模块,用于确定所述当前车辆在所述安全行驶路段的行驶时间;A driving time determination submodule, used to determine the driving time of the current vehicle on the safe driving section;
    交互内容生成执行子模块,用于利用基于人工智能的内容生产技术AIGC,生成与所述行驶时间相匹配的交互内容。The interactive content generation execution sub-module is used to generate interactive content that matches the driving time using AIGC, a content production technology based on artificial intelligence.
  14. 根据权利要求10所述的装置,其中,在所述交互需求包括兴趣点推荐需求的情况下,所述交互内容生成模块,包括:The device according to claim 10, wherein when the interaction requirement includes a point of interest recommendation requirement, the interactive content generation module includes:
    兴趣点类型确定子模块,用于确定与所述兴趣点推荐需求对应的兴趣点类型;Point of interest type determination sub-module, used to determine the type of interest point corresponding to the interest point recommendation requirement;
    目标兴趣点筛选子模块,用于根据所述兴趣点类型,在所述交通信息包含的候选兴趣点中筛选出目标兴趣点;A target point of interest screening sub-module is used to screen out target points of interest from candidate interest points included in the traffic information according to the type of interest point;
    交互内容生成执行子模块,用于根据所述目标兴趣点生成所述交互内容。An interactive content generation execution submodule is used to generate the interactive content according to the target point of interest.
  15. 根据权利要求10或14所述的装置,其中,所述交互需求确定执行子模块,包括:The device according to claim 10 or 14, wherein the interaction requirement determination execution sub-module includes:
    类型确定单元,用于根据所述用户信息,确定所述用户的生理状态的类型;A type determination unit, configured to determine the type of the user's physiological state according to the user information;
    交互需求确定单元,用于在所述生理状态的类型为预定类型的情况下,确定所述交互需求为兴趣点推荐需求,所述预定类型包括:疲劳、饥饿、内急、患病中的至少一种。An interaction demand determination unit, configured to determine that the interaction demand is a point of interest recommendation demand when the type of the physiological state is a predetermined type, and the predetermined type includes: at least one of fatigue, hunger, internal urgency, and illness. kind.
  16. 根据权利要求10或14所述的装置,其中,所述交互需求确定执行子模块,包括:The device according to claim 10 or 14, wherein the interaction requirement determination execution sub-module includes:
    车况确定单元,用于根据所述车辆参数,确定所述当前车辆的车况;A vehicle condition determination unit, configured to determine the vehicle condition of the current vehicle according to the vehicle parameters;
    交互需求确定单元,用于在所述车况达到故障标准的情况下,确定所述交互需求为兴趣点推荐需求。An interaction demand determination unit is configured to determine the interaction demand as a point of interest recommendation demand when the vehicle condition reaches a fault standard.
  17. 一种电子设备,包括:An electronic device including:
    至少一个处理器;以及at least one processor; and
    与所述至少一个处理器通信连接的存储器;其中,a memory communicatively connected to the at least one processor; wherein,
    所述存储器存储有可被所述至少一个处理器执行的指令,所述指令被所述至少一个处理器执行,以使所述至少一个处理器能够执行权利要求1至8中任一项所述的方法。The memory stores instructions executable by the at least one processor, and the instructions are executed by the at least one processor, so that the at least one processor can perform any one of claims 1 to 8 Methods.
  18. 一种存储有计算机指令的非瞬时计算机可读存储介质,其中,所述计算机指令用于使所述计算机执行根据权利要求1至8中任一项所述的方法。A non-transitory computer-readable storage medium storing computer instructions, wherein the computer instructions are used to cause the computer to execute the method according to any one of claims 1 to 8.
  19. 一种计算机程序产品,包括计算机程序/指令,其中,该计算机程序/指令被处理器执行时实现权利要求1至8中任一项所述的方法的步骤。A computer program product comprising a computer program/instructions, wherein the computer program/instructions, when executed by a processor, implement the steps of the method according to any one of claims 1 to 8.
PCT/CN2022/110117 2022-03-18 2022-08-03 Intelligent interaction method and apparatus, device, and storage medium WO2023173657A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202210273184.3 2022-03-18
CN202210273184.3A CN116798254A (en) 2022-03-18 2022-03-18 Intelligent interaction method, device, equipment and storage medium

Publications (1)

Publication Number Publication Date
WO2023173657A1 true WO2023173657A1 (en) 2023-09-21

Family

ID=88022194

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2022/110117 WO2023173657A1 (en) 2022-03-18 2022-08-03 Intelligent interaction method and apparatus, device, and storage medium

Country Status (2)

Country Link
CN (1) CN116798254A (en)
WO (1) WO2023173657A1 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117422002A (en) * 2023-12-19 2024-01-19 利尔达科技集团股份有限公司 AIGC-based embedded product generation method, system and storage medium
CN118051944A (en) * 2024-02-19 2024-05-17 浙江掌心互动信息技术有限公司 Method and system for realizing personalized customization of generated content by AIGC technology

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN118293927B (en) * 2024-06-06 2024-08-20 青岛理工大学 Visual-voice navigation method and system with enhanced knowledge graph

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8022831B1 (en) * 2008-01-03 2011-09-20 Pamela Wood-Eyre Interactive fatigue management system and method
CN109766405A (en) * 2019-03-06 2019-05-17 路特迩科技(杭州)有限公司 Traffic and travel information service system and method based on electronic map
CN110126843A (en) * 2019-05-17 2019-08-16 北京百度网讯科技有限公司 Driving service recommendation method, device, equipment and medium
CN111611402A (en) * 2020-05-15 2020-09-01 广东新快易通智能信息发展有限公司 Driving behavior knowledge graph generation method, device and system based on position
CN113212448A (en) * 2021-04-30 2021-08-06 恒大新能源汽车投资控股集团有限公司 Intelligent interaction method and device

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8022831B1 (en) * 2008-01-03 2011-09-20 Pamela Wood-Eyre Interactive fatigue management system and method
CN109766405A (en) * 2019-03-06 2019-05-17 路特迩科技(杭州)有限公司 Traffic and travel information service system and method based on electronic map
CN110126843A (en) * 2019-05-17 2019-08-16 北京百度网讯科技有限公司 Driving service recommendation method, device, equipment and medium
CN111611402A (en) * 2020-05-15 2020-09-01 广东新快易通智能信息发展有限公司 Driving behavior knowledge graph generation method, device and system based on position
CN113212448A (en) * 2021-04-30 2021-08-06 恒大新能源汽车投资控股集团有限公司 Intelligent interaction method and device

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117422002A (en) * 2023-12-19 2024-01-19 利尔达科技集团股份有限公司 AIGC-based embedded product generation method, system and storage medium
CN117422002B (en) * 2023-12-19 2024-04-19 利尔达科技集团股份有限公司 AIGC-based embedded product generation method, AIGC-based embedded product generation system and storage medium
CN118051944A (en) * 2024-02-19 2024-05-17 浙江掌心互动信息技术有限公司 Method and system for realizing personalized customization of generated content by AIGC technology

Also Published As

Publication number Publication date
CN116798254A (en) 2023-09-22

Similar Documents

Publication Publication Date Title
WO2023173657A1 (en) Intelligent interaction method and apparatus, device, and storage medium
US10423292B2 (en) Managing messages in vehicles
CN111247394B (en) Information processing apparatus and information processing method
US9667742B2 (en) System and method of conversational assistance in an interactive information system
US11664043B2 (en) Real-time verbal harassment detection system
US10875525B2 (en) Ability enhancement
JP6948110B2 (en) Systems and methods to determine the right time to provide a message to the driver
US20180094945A1 (en) Navigation systems and associated methods
US12056198B2 (en) Method and apparatus for enhancing a geolocation database
KR20190041569A (en) Dialogue processing apparatus, vehicle having the same and dialogue service processing method
US11727451B2 (en) Implementing and optimizing safety interventions
US20150046082A1 (en) Information presentation device and presentation-use information management system
CN110126843A (en) Driving service recommendation method, device, equipment and medium
WO2021138341A1 (en) Pattern-based adaptation model for detecting contact information requests in a vehicle
KR20190011458A (en) Vehicle, mobile for communicate with the vehicle and method for controlling the vehicle
CN113734187A (en) Method and device for information interaction with vehicle user and vehicle machine
US12055404B2 (en) Sentiment-based navigation
US9596204B2 (en) Determination of a navigational text candidate
US11741400B1 (en) Machine learning-based real-time guest rider identification
US20200173798A1 (en) Generation apparatus, control method of generation apparatus, and non-transitory computer-readable storage medium
US20230392936A1 (en) Method and apparatus for determining lingering communication indicators
JP2019190940A (en) Information processor
CN113823109B (en) Live broadcast method and device, electronic equipment and storage medium
CN113516978B (en) Sound output control method and sound output control device
US20240263958A1 (en) Integration of Content within Navigation Interface Based on Contextual Data

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 22931679

Country of ref document: EP

Kind code of ref document: A1