WO2024051592A1 - Vehicle control method and control apparatus - Google Patents

Vehicle control method and control apparatus Download PDF

Info

Publication number
WO2024051592A1
WO2024051592A1 PCT/CN2023/116496 CN2023116496W WO2024051592A1 WO 2024051592 A1 WO2024051592 A1 WO 2024051592A1 CN 2023116496 W CN2023116496 W CN 2023116496W WO 2024051592 A1 WO2024051592 A1 WO 2024051592A1
Authority
WO
WIPO (PCT)
Prior art keywords
vehicle
sound source
voice command
voice
located inside
Prior art date
Application number
PCT/CN2023/116496
Other languages
French (fr)
Chinese (zh)
Inventor
彭超
李凌飞
沈波
肖龙帅
任亮亮
张跃
石子娟
吴雪晨
Original Assignee
华为技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 华为技术有限公司 filed Critical 华为技术有限公司
Publication of WO2024051592A1 publication Critical patent/WO2024051592A1/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/30Semantic analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/0464Convolutional networks [CNN, ConvNet]
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/16Speech classification or search using artificial neural networks
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue

Definitions

  • the present application relates to the field of intelligent terminal technology, and in particular, to a vehicle control method and control device.
  • the vehicle's voice collection system collects the voice commands issued by the user, conducts semantic recognition to obtain the voice control commands, and then sends the voice control commands to the control unit to control the vehicle accordingly. Actions.
  • the voice command will perform semantic recognition and execute the recognized control command. For example, if the driver and passengers of the vehicle park the vehicle on the roadside and take a rest in the vehicle, and a person outside the vehicle gives the voice command "open the door", the vehicle will directly control the door opening based on the voice command, but In fact, the drivers and passengers in the car did not want to open the door at this time. Situations like this will cause great safety risks to the personal and property safety of the drivers and passengers in the vehicle.
  • This application provides a vehicle control method and control device, which can control the vehicle more intelligently and improve the safety of the vehicle.
  • a vehicle control method including: obtaining a voice instruction to control the vehicle; determining whether the sound source position of the voice instruction is located inside the vehicle; when the sound source position is located in the vehicle When inside, the vehicle is controlled to perform operations corresponding to the voice instructions.
  • the vehicle control method after receiving a voice instruction for controlling the vehicle, instead of directly executing the instruction, it first determines whether the sound source position of the voice instruction is located inside the vehicle.
  • the control operation corresponding to the voice command is only executed when the sound source of the voice command is located inside the vehicle. This makes the control of the vehicle more intelligent, improves the safety of the vehicle, and ensures the personal and property safety of passengers in the vehicle.
  • This application can help users in the car effectively avoid control and interference of the vehicle by strangers outside the car, without causing misoperation of the vehicle, further ensuring the safety of people in the car, especially in resting scenes.
  • the voice command can perform status control, media control, air conditioning control, seat control, etc. on the vehicle.
  • the voice command can be opening/closing windows (including sunroofs), opening/closing doors (locking/unlocking doors), opening/closing the trunk, ambient light adjustment, vehicle locking, vehicle ignition/stop, and vehicle speed adjustment. , rearview mirror adjustment and other vehicle status control instructions.
  • the voice command can also be media control commands such as opening navigation, adjusting the central control screen, playing music, tuning the radio, answering/making calls, etc.
  • the voice command can also be air conditioning control commands such as raising the temperature, lowering the temperature, starting internal circulation, purifying the air in the car, etc.
  • the voice command can also be a seat control command such as turning on seat heating, raising/lowering the seat height, moving the seat forward/backward, adjusting the seat back angle, etc.
  • the voice command can also be a wake-up command used to wake up the voice assistant of the vehicle.
  • the wake-up command can be a default command set by the manufacturer.
  • the voice command can be a "little voice command” used to wake up the intelligent voice assistant Xiaoyi.
  • "Yi Xiaoyi” can also be a wake-up command customized by the user according to personal preferences.
  • the vehicle's voice assistant can also be in wake-up-free mode, that is, the voice assistant remains awake at all times to ensure that the vehicle can promptly obtain instructions for controlling the vehicle's status such as "open the door", which is convenient for voice control of the vehicle. .
  • various data related to the vehicle can be obtained through sensors such as in-car cameras, exterior cameras, radars, laser rangefinders, or seat pressure sensors, and based on one or more of the various data.
  • This application does not specifically limit the specific implementation method of determining whether the sound source position of the voice command is located inside the vehicle.
  • the acquired data can be combined with a pre-built-in algorithm to determine whether the sound source is located inside the vehicle.
  • the above data can be input into a neural network model, which analyzes the data and determines whether the sound source location is inside the vehicle.
  • the neural network model is obtained by model training with historical data (training data) and a deep learning algorithm.
  • the neural network model can be, for example, a convolutional neural network (CNN) model.
  • the deep learning algorithm can, for example, be Machine learning algorithm or meta learning algorithm.
  • the sound source position of the voice instruction is located inside the vehicle based on one or more of the following data: the audio characteristic data of the voice instruction collected by the microphone, the pickup of the voice instruction collected by the microphone ( Sound pickup) intensity data, in-car image data collected by the in-car camera, in-car sound data collected by the microphone, in-car seat pressure data collected by the seat sensor, in-car vibration data collected by the vibration sensor, touch The touch data of the in-car touch screen collected by the sensor, the vehicle speed data collected by the vehicle speed sensor, the window opening and closing data detected by the glass position detection sensor, the exterior image data collected by the exterior camera, and the radar or laser range finder collection Obstacle data outside the vehicle, etc., but is not limited to this. With the further development of intelligent vehicles and sensor technology, the content and types of the above data can become more.
  • determining whether the sound source position of the voice instruction is located inside the vehicle includes: determining whether the sound source position is located inside the vehicle according to audio characteristics of the voice instruction.
  • certain characteristic parameters of the audio characteristics of the voice command can be compared with a preset parameter threshold, and whether the sound source position is located inside the vehicle is determined based on the comparison result.
  • the acquired audio characteristics can be combined with a pre-built-in algorithm to determine whether the sound source is located inside the vehicle.
  • the audio feature data of the voice instruction can be input into a neural network model, and the neural network model analyzes the feature data and determines whether the sound source position is located inside the vehicle.
  • the neural network model is obtained by model training using historical data (training data) and a deep learning algorithm.
  • the neural network model may be, for example, a CNN model, and the deep learning algorithm may be, for example, a machine learning algorithm or a meta-learning algorithm.
  • the audio characteristics here may include any characteristic parameters such as spectral structure, energy, amplitude, frequency, period, etc., but are not limited to this.
  • the audio features include spectral structure.
  • the energy in the high-frequency band in the spectrum structure is absorbed, and the high-frequency part will be significantly attenuated.
  • the high-frequency spectrum structure of the normal sound emitted in the car that has not passed through obstacles is Some parts will be preserved relatively completely, so under the same circumstances, there will be differences in the spectrum structure between the voice commands issued outside the car and the voice commands issued inside the car. This part of the difference can be used to determine whether the sound source position is located inside the vehicle.
  • the advantage of using the spectrum structure to determine whether the sound source position is located inside the vehicle is that the confirmation method is simple, efficient and easy to implement without resorting to other data, which can reduce the frequency of use of other sensors in the vehicle.
  • determining whether the sound source position of the voice instruction is located inside the vehicle includes: determining the sound source of the voice instruction based on the characteristics of a high-frequency part of the spectrum structure of the voice instruction. Whether the position is located inside the vehicle; when the integrity of the high-frequency part meets the preset conditions, it is determined that the sound source position is located inside the vehicle.
  • the voice commands issued outside the car and the voice commands issued inside the car will have a more obvious difference in the integrity of the high-frequency part. It can be determined by the characteristics of the high-frequency part. Determine whether the sound source position of the voice command is located inside the vehicle, When the integrity of the high-frequency part meets the preset conditions, it is determined that the sound source position is located inside the vehicle.
  • the advantage of determining whether the sound source position is located inside the vehicle through the characteristics of the high-frequency part of the spectrum structure is that the confirmation method is simple, efficient and easy to implement without resorting to other data, which can reduce the frequency of use of other sensors in the vehicle.
  • the high-frequency part of the spectrum structure can be input into a pre-trained neural network model (such as a CNN model).
  • the neural network model analyzes and calculates the spectrum structure, and finally outputs a judgment of whether the sound source position is located inside the vehicle. result.
  • determining whether the sound source position of the voice instruction is located inside the vehicle includes: determining whether the sound source position is based on the pickup intensity of the voice instruction by a microphone of the vehicle. Located inside said vehicle.
  • the distance from the sound source location can be used to determine whether the sound source location is inside the vehicle based on the pickup intensity. For example, the specific location of the sound source location, or the specific orientation of the sound source location, can be determined based on the pickup intensity of the voice command by the microphone, and then it can be further determined whether the sound source location is located inside the vehicle.
  • the vehicle may be equipped with only one microphone, and whether the sound source position is located inside the vehicle may be determined based on the pickup intensity of the voice command by the only microphone.
  • a vehicle may usually be equipped with multiple microphones, and the sound source location can be determined based on the pickup strength of each of the multiple microphones for voice commands, combined with a preset algorithm (such as a trained neural network model).
  • a preset algorithm such as a trained neural network model
  • the plurality of microphones can be installed in the car. For example, they can be centrally installed near the center console of the vehicle in the form of a microphone array, or they can be distributed at different seats in the car.
  • the plurality of microphones may also include at least one external microphone for picking up voices outside the vehicle, so that the user can perform voice interaction with the vehicle outside the vehicle and obtain a better vehicle experience.
  • the vehicle is equipped with multiple in-vehicle microphones.
  • the vehicle based on the pickup strength of each of the multiple in-vehicle microphones for voice commands, combined with a preset algorithm, it can be directly determined whether the sound source position is located inside the vehicle.
  • determining whether the sound source position is located inside the vehicle based on the pickup intensity of the voice instruction by a microphone of the vehicle includes: based on the pickup intensity of the voice instruction by multiple microphones.
  • the pickup intensity determines the orientation of the sound source location; based on the orientation, it is determined whether the sound source location is located inside the vehicle.
  • the orientation of the sound source is determined based on the pickup intensity of the voice command by a pair of microphones, and with the help of a preset algorithm or logic, it is determined whether the sound source is located inside the vehicle through the orientation.
  • the sound source location is located outside the vehicle.
  • the microphone is installed on the center console of the car and the direction points to the front of the vehicle, there are no seats in the car space at this direction and cannot accommodate passengers. Therefore, it can be determined that the sound source is located outside the vehicle. .
  • the car window at this orientation is closed. For example, if the car window has a strong sound insulation effect and the current car window is completely closed, it can be determined that the sound source location is inside the vehicle.
  • the orientation of the sound source location further determine the distribution of people inside and outside the vehicle at this orientation, and determine whether the sound source location is located inside the vehicle based on the distribution of people. For example, if there are no people inside the vehicle at this orientation, it can be determined that the sound source is located outside the vehicle; or, if there are no people outside the vehicle at this orientation, it can be determined that the sound source is located inside the vehicle; Alternatively, if there are people both inside and outside the vehicle at this location, you can use other data from the sensor to determine whether the sound source is located inside the vehicle.
  • multiple factors can be combined to determine whether the sound source location of the voice command is located in the car.
  • the spectral structure of the aforementioned voice command can be combined with the pickup intensity to determine whether the sound source position of the voice command is located in the car. For example, through the spectrum structure of the voice command, it is determined that the preservation integrity of the high-frequency part is poor, combined with the weak pickup intensity of the voice command, it is jointly determined that the sound source position of the voice command is outside the car; for another example, this can be Two different factors set priorities. When the results determined by the two factors are different, the result determined by the factor with higher priority shall prevail. Other methods that combine multiple factors can also refer to the examples here.
  • whether the sound source is located inside the vehicle can be determined by the distribution of people inside the vehicle.
  • relevant data can be obtained through detection devices such as in-car cameras, seat pressure sensors, touch sensors, vibration sensors, magnetometers, and infrared sensors, and based on this data, it can be determined whether there are passengers in the car, their orientation or specific location , the specific seats of the passengers and other personnel distribution information, and finally determine whether the sound source position is located inside the vehicle based on the personnel distribution information.
  • the personnel distribution information indicates that there are currently no drivers or passengers in the vehicle, it may be determined that the sound source location is located outside the vehicle rather than inside.
  • the personnel distribution information indicates that there are drivers and passengers in the current vehicle
  • other information can be used to jointly determine whether the sound source position is located inside the vehicle.
  • the determination can be further made in combination with the audio feature data of the voice command collected by the microphone or the pickup (sound pickup) intensity data of the voice command collected by the microphone, which will be further described below.
  • whether the sound source is located inside the vehicle can be determined by the distribution of people outside the vehicle.
  • relevant data can be obtained through detection devices such as cameras, radars, and laser rangefinders outside the vehicle, and based on this data, it can be determined whether there are people within the preset range near the outside of the vehicle, as well as personnel distribution information such as their orientation or specific location. Finally, it is determined whether the sound source is located inside the vehicle based on the personnel distribution information.
  • This preset range can be determined based on the sound pickup capability of the microphone equipped in the vehicle. The stronger the sound pickup capability, it means that voice commands issued by people at a greater distance may also be picked up, so the preset range should also be larger. The manufacturer can reasonably set the preset range based on the specific configuration of the microphone before the vehicle leaves the factory.
  • the person distribution information indicates that there are no people within the current preset range outside the vehicle, it can be determined that the sound source position is located inside the vehicle.
  • the personnel distribution information indicates that there are people within the current preset range outside the vehicle
  • other information can be used to jointly determine whether the sound source location is located Vehicle interior.
  • the driving speed of the vehicle can be used to determine whether the sound source is located inside the vehicle.
  • the driving speed is greater than or equal to 80 kilometers per hour (km/h)
  • the voice command for controlling the vehicle it can be determined that the sound source position of the voice command is located inside the vehicle.
  • the vehicle speed threshold can be 60 to 100km/h, for example, 70km/h or 80km/h.
  • the manufacturer can reasonably set the vehicle speed threshold based on the microphone's sound pickup ability before the vehicle leaves the factory.
  • the vehicle's driving speed is less than the vehicle speed threshold
  • other information can be used to jointly determine whether the sound source position is located inside the vehicle.
  • the microphone in the car may not be able to pick up the voice command outside the car, so it can be determined that the sound source position of the voice command is located inside the vehicle.
  • the sound source is located inside the vehicle through the opening and closing of the vehicle's windows.
  • Relevant data can be obtained through the window position detection sensor to determine the current status (ie, opening and closing status) of each window of the vehicle, for example, it can be fully open, half open, or fully closed.
  • Car window glass has a certain sound insulation effect. When the car window is closed, the microphone in the car may not be able to accurately pick up the voice commands outside the car. However, when the car window is opened, the microphone in the car can pick up the voice commands outside the car. Voice command, so it can determine whether the sound source is located inside the vehicle based on the opening and closing of the vehicle's windows.
  • the microphone cannot receive the voice outside the vehicle at this time, and it can be determined that the sound source position of the voice command is located inside the vehicle.
  • the vehicle can be, for example, some high-end cars, and its sound insulation effect is particularly outstanding.
  • the voice instruction includes a wake-up instruction
  • controlling the vehicle to perform operations corresponding to the voice instruction includes: waking up a voice assistant of the vehicle.
  • controlling the vehicle to perform operations corresponding to the voice instructions includes: performing semantic recognition on the voice instructions to obtain control instructions; and performing operations on the vehicle according to the control instructions. control.
  • the voice command when it is determined that the sound source is located inside the vehicle, the voice command will be semantically parsed to generate the corresponding control command.
  • the voice command may not be processed.
  • the instructions are semantically parsed, which helps save the computing power of the processor.
  • the method further includes: controlling the vehicle not to execute the voice instruction when the sound source position is located outside the vehicle.
  • the voice command may have been spoken by someone outside the vehicle (such as a stranger, a passerby, or a criminal) and accidentally picked up by the microphone.
  • the voice command may not be semantically recognized or executed, that is, the voice command may not be responded to, thereby improving the safety of vehicle control and ensuring the personal and property safety of passengers in the vehicle.
  • the method further includes: when the sound source position is located outside the vehicle, if it is determined that the preset trigger condition is met, controlling the vehicle to execute a command corresponding to the voice command. operate.
  • the triggering conditions can be pre-set by the manufacturer, or can be set by the user.
  • the user can set the conditions based on the scenarios in which the user may voice control the vehicle outside the vehicle.
  • the triggering condition may be that the electronic key of the vehicle is located near the vehicle, the vehicle owner is determined to be located near the vehicle through the positioning function of a mobile terminal such as a mobile phone, a special time period (such as commuting and get off work hours), or the vehicle enters a specific mode (for example, camping mode), etc., this application does not limit this.
  • the vehicle can establish a Bluetooth connection with the electronic key, it can be determined that the electronic key is near the vehicle, that is, it is determined that the owner of the vehicle may also be located near the vehicle. At this time, the trigger condition is satisfied, the vehicle can be controlled to perform operations corresponding to the control instruction.
  • the method further includes: when the sound source is located outside the vehicle, sending inquiry information as to whether to execute the voice command.
  • an inquiry message is sent as to whether to execute the voice command, instead of directly not executing the voice command as in the previous embodiment.
  • the control method provided by the embodiments of this application can not only improve the safety of vehicle voice control, but also increase the control flexibility, which is beneficial to improving the user experience.
  • the query information here may be sound information, image information, text information, etc.
  • the inquiry information here can be a voice inquiry information whether to execute the voice command issued through the vehicle's speaker device, or it can be displayed on the central control screen, instrument screen or heads up display (HUD) device inside the vehicle.
  • image query information this application does not limit this.
  • a vehicle control device including: an acquisition unit, used to acquire a voice instruction for controlling the vehicle; a determination unit, used to determine whether the sound source position of the voice instruction is located inside the vehicle; A control unit, when the sound source is located inside the vehicle, is used to control the vehicle to perform operations corresponding to the voice instructions.
  • the determining unit is specifically configured to determine whether the sound source position is located inside the vehicle according to the audio characteristics of the voice instruction.
  • the audio features include spectral structure.
  • the determining unit is specifically configured to: determine whether the sound source position of the voice command is located inside the vehicle according to the characteristics of the high-frequency part in the spectrum structure of the voice command; when the high-frequency part When the integrity of the frequency part meets the preset conditions, it is determined that the sound source position is located inside the vehicle.
  • the determining unit is specifically configured to determine whether the sound source position is located inside the vehicle according to the pickup intensity of the voice command by the vehicle's microphone.
  • the determining unit is specifically configured to: pick up the voice instruction based on multiple microphones.
  • the intensity determines the orientation of the sound source location; based on the orientation, it is determined whether the sound source location is located inside or outside the vehicle.
  • the determining unit determines whether the sound source position is located inside the vehicle through at least one of the following information: audio characteristics of the voice instruction; The pickup intensity of the voice command; the distribution of people inside the vehicle; the distribution of people within the preset range outside the vehicle;
  • the driving speed of the vehicle The driving speed of the vehicle; the opening and closing conditions of the windows of the vehicle.
  • the voice command includes a wake-up command
  • the control unit is specifically configured to wake up the voice assistant of the vehicle.
  • control unit is specifically configured to: perform semantic analysis on the voice instructions to obtain control instructions; and control the vehicle according to the control instructions.
  • control unit when the sound source is located outside the vehicle, the control unit is further configured to control the vehicle not to execute the voice instruction.
  • control unit when the sound source is located outside the vehicle, if it is determined that the preset trigger condition is met, the control unit is further configured to: control the vehicle to execute the voice command. Corresponding operations.
  • control device further includes: a sending unit configured to send query information on whether to execute the voice command when the sound source is located outside the vehicle.
  • a vehicle control device including at least one processor, the at least one processor being coupled to a memory, reading and executing instructions in the memory, to implement any one of the foregoing first aspects. possible implementation methods.
  • control device further includes the memory.
  • a computer-readable storage medium is provided.
  • a computer program is stored on the computer-readable storage medium.
  • any one of the possible implementations of the first aspect can be realized. method provided.
  • a chip system including a processor for calling and running a computer program from a memory, so that the computer installed with the chip system executes any of the possible implementation methods provided in the first aspect. method.
  • a computer program product includes: computer program code.
  • the computer program code When the computer program code is run on a computer, it causes the computer to execute any implementation method in the first aspect. provided method.
  • the above computer program code can be stored in whole or in part on the first storage medium, where the first storage medium can be packaged together with the processor, or can be packaged separately from the processor. This application does not specifically limit this. .
  • a seventh aspect provides a vehicle, including the vehicle control device provided by any of the possible implementations of the second aspect and the third aspect.
  • the vehicles are all used to execute the control method provided in the first aspect. Therefore, the beneficial effects that can be achieved can be referred to the beneficial effects in the corresponding methods provided above, and will not be described again here.
  • Figure 1 is a schematic diagram of an application scenario provided by an embodiment of the present application.
  • Figure 2 is a logic block diagram of the current voice control of the vehicle.
  • FIG. 3 is a functional block diagram of a vehicle provided by an embodiment of the present application.
  • FIG. 4 is a schematic flowchart of an example of a vehicle control method provided by an embodiment of the present application.
  • FIG. 5 is a schematic diagram of an application scenario of the control method shown in FIG. 4 .
  • Figure 6 is a schematic diagram of the propagation process of control instructions.
  • FIG. 7 is a schematic flowchart of another example of a vehicle control method provided by an embodiment of the present application.
  • FIG. 8 is a schematic diagram of an application scenario of the control method shown in FIG. 7 .
  • FIG. 9 is a schematic flowchart of another example of a vehicle control method provided by an embodiment of the present application.
  • FIG. 10 is a schematic diagram of an application scenario of the control method shown in FIG. 9 .
  • FIG. 11 is a schematic block diagram of a vehicle control device provided by an embodiment of the present application.
  • FIG. 12 is a schematic structural diagram of a vehicle control device provided by an embodiment of the present application.
  • first and second are used for descriptive purposes only and cannot be understood as indicating or implying relative importance or implicitly indicating the quantity of indicated technical features. Therefore, features defined as “first” and “second” may explicitly or implicitly include one or more of these features. In the description of the embodiments of this application, unless otherwise specified, "plurality” means two or more.
  • FIG. 1 is a schematic diagram of an application scenario provided by an embodiment of the present application.
  • the application scenario includes a vehicle 11 with a driver 12 inside. If the vehicle 11 turns on the voice control function, the drivers and passengers, including the driver 12, can control the vehicle through voice commands.
  • the driver 12 can set the navigation or change the frequency modulation through voice commands while driving, without having to look at the center console for manual operations. On the one hand, this can free up the driver's hands. On the other hand, the driver does not need to look at the center console to perform manual operations, which will not distract his attention during driving and will help improve driving safety.
  • FIG. 2 is a logic block diagram of the current voice control of the vehicle.
  • the vehicle's voice collection system 21 collects the voice instructions issued by the user, performs semantic recognition to obtain the voice control instructions, and then sends the voice control instructions to the control unit 22.
  • the control unit 22 will form a control instruction that can be recognized by the control object 23 based on the voice control instruction, and control the control object 23 according to the control instruction to control the vehicle to perform corresponding actions.
  • the inventor of the present application discovered during practice that in some cases, the voice command picked up by the vehicle's voice collection system cannot actually be directly executed. If the voice command is directly executed, it may cause the vehicle to malfunction. operation, thus causing hidden dangers to the personal and property safety of the drivers and passengers in the vehicle.
  • the voice commands obtained by the vehicle are not differentiated. Whether the voice commands come from inside or outside the vehicle, they will be executed, which may cause misoperation of the vehicle. Causing hidden dangers to the personal and property safety of the drivers and passengers in the vehicle.
  • embodiments of the present application provide a vehicle control method. After the vehicle receives a voice command, it can determine whether the person who issued the voice command is located inside the vehicle, thereby deciding whether to perform the control operation corresponding to the voice command. , thus making the control of the vehicle more intelligent and improving the safety of the vehicle.
  • This application can help users in the car effectively avoid control and interference of the vehicle by strangers outside the car, without causing misoperation of the vehicle, further ensuring the safety of people in the car, especially in resting scenes.
  • the vehicle control method provided by the embodiment of the present application is applied in the vehicle, or in other devices with the function of controlling the vehicle (such as cloud servers, mobile phone terminals, etc.).
  • the vehicle may specifically be an internal combustion locomotive, an intelligent electric vehicle or a hybrid vehicle, or the vehicle may also be a vehicle of other power types, which is not limited in the embodiments of the present application.
  • the vehicle may be an autonomous vehicle, which may be a vehicle with partial autonomous driving capabilities, or may be Therefore, a vehicle with all autonomous driving functions, that is to say, the vehicle’s autonomous driving level can refer to the classification standards of the Society of Automotive Engineers (SAE), which is divided into no automation (L0), driving support ( L1), partial automation (L2), conditional automation (L3), high automation (L4) or complete automation (L5).
  • SAE Society of Automotive Engineers
  • a vehicle or other device can implement the vehicle control method provided by the embodiments of the present application through its components (including hardware and software), and determine whether the sound source position of the voice command used to control the vehicle is located Inside the vehicle, that is, it determines whether the person who spoke the voice command is located inside or outside the vehicle, and decides whether to perform the control operation corresponding to the voice command based on the judgment result, thereby making the control of the vehicle more intelligent and improving the safety of the vehicle. safety, and ensure the personal and property safety of passengers in the vehicle.
  • the vehicle may be the vehicle 100 in FIG. 3 , which is a functional block diagram of the vehicle 100 provided by the embodiment of the present application.
  • vehicle 100 may include various subsystems such as travel system 102 , sensor system 104 , control system 106 , one or more peripheral devices 108 as well as power supply 110 , computer system 112 and user interface 116 .
  • vehicle 100 may include more or fewer subsystems, and each subsystem may include multiple elements. Additionally, each subsystem and element of vehicle 100 may be interconnected via wires or wirelessly.
  • the travel system 102 may include components that provide powered motion for the vehicle 100 .
  • the propulsion system 102 may include an engine 118 , an energy source 119 , a transmission 120 and wheels 121 .
  • the engine 118 may be an internal combustion engine, an electric motor, an air compression engine, or a combination of other types of engines, such as a hybrid engine composed of a gasoline engine and an electric motor, or a hybrid engine composed of an internal combustion engine and an air compression engine.
  • Engine 118 converts energy source 119 into mechanical energy.
  • Examples of energy sources 119 include gasoline, diesel, other petroleum-based fuels, propane, other compressed gas-based fuels, ethanol, solar panels, batteries, and other sources of electricity. Energy source 119 may also provide energy to other systems of vehicle 100 .
  • Transmission 120 may transmit mechanical power from engine 118 to wheels 121 .
  • Transmission 120 may include a gearbox, differential, and driveshaft.
  • the transmission device 120 may also include other components, such as a clutch.
  • the drive shaft may include one or more axles that may be coupled to one or more wheels 121 .
  • Sensor system 104 may include a number of sensors that sense information about the environment surrounding vehicle 100 .
  • the sensor system 104 may include a positioning system 122 (the positioning system may be a GPS system, a Beidou system, or other positioning systems), an inertial measurement unit (IMU) 124, a radar 126, a laser rangefinder 128, and Camera 130.
  • the positioning system may be a GPS system, a Beidou system, or other positioning systems
  • IMU inertial measurement unit
  • radar 126 a radar 126
  • laser rangefinder 128 a laser rangefinder
  • Positioning system 122 may be used to estimate the geographic location of vehicle 100 .
  • the IMU 124 is used to sense changes in position and orientation of the vehicle 100 based on inertial acceleration.
  • IMU 124 may be a combination of accelerometer and gyroscope.
  • Radar 126 may utilize radio signals to sense objects within the environment surrounding vehicle 100 .
  • the radar 126 may also be used to sense the speed and/or heading of the object, such as millimeter wave radar or lidar.
  • Laser rangefinder 128 may utilize laser light to sense objects in the environment in which vehicle 100 is located.
  • laser rangefinder 128 may include one or more laser sources, laser scanners, and one or more detectors, among other system components.
  • Camera 130 may be used to capture multiple images of the surrounding environment of vehicle 100 .
  • the camera 130 may be a static camera or a video camera, or may be a visual perception camera.
  • the surrounding environment can be detected through radar 126, laser range finder 128 or camera 130 to determine whether there are people around the vehicle.
  • the sensor system 104 also includes sensors of internal systems of the vehicle 100 , including, for example, sensors of an advanced driving assistance system (ADAS).
  • the sensors of the internal system of the vehicle 100 include sensors such as seat pressure sensor 1041, in-vehicle camera 1042, touch sensor 1043, vibration sensor 1044, and vehicle speed sensor.
  • the seat pressure sensor 1041 can be used to monitor the pressure data on each seat in the car
  • the in-car camera 1042 can be used to capture multiple images of the occupants in the car and multiple images of the in-car environment
  • the touch sensor 1043 can be used to monitor the center of the car.
  • Touch data on the display screen of the control unit, vibration sensor 1044 is used to capture vibration data occurring in the vehicle.
  • the processor can determine whether there is a passenger in the vehicle and the specific location of the passenger based on at least one of the pressure data, image, touch data, vibration data, etc.
  • the vehicle speed sensor is used to monitor the vehicle's speed data to determine whether the vehicle is stationary.
  • sensors for vehicle internal systems can also include air quality sensors, fuel gauges, oil temperature gauges, etc.
  • One or more sensor data collected by these sensors can be used to detect objects and their corresponding characteristics (position, shape, temperature, velocity). degree, etc.), this detection and identification is key to achieving safe operation of the vehicle 100 and ensuring the safety of the vehicle occupants.
  • Control system 106 controls the operation of vehicle 100 and its components.
  • Control system 106 may include various elements, including steering system 132 , throttle 134 , braking unit 136 , computer vision system 140 , line control system 142 , and obstacle avoidance system 144 .
  • Steering system 132 is operable to adjust the forward direction of vehicle 100 .
  • it may be a steering wheel system.
  • Throttle 134 is used to control the operating speed of engine 118 and thereby the speed of vehicle 100 .
  • the braking unit 136 is used to control the deceleration of the vehicle 100 .
  • Braking unit 136 may use friction to slow wheel 121 .
  • braking unit 136 may convert kinetic energy of wheel 121 into electrical current.
  • the braking unit 136 may also take other forms to slow down the rotation speed of the wheels 121 to control the speed of the vehicle 100 .
  • the computer vision system 140 can process and analyze the images captured by the camera 130 and the in-vehicle camera 1042 to identify the distribution of people in the surrounding environment outside the vehicle 100 and the distribution of people inside the vehicle.
  • the distribution of people outside the vehicle includes information such as whether there are people within the preset range outside the vehicle, the location or orientation of people outside the vehicle, and other information.
  • the distribution of people in the car includes information such as whether there are people in the car, the location or orientation of the people in the car, and which seat(s) the people in the car are sitting in.
  • the computer vision system 140 can use at least one of a human body recognition algorithm, a structure from motion (SFM) algorithm, video tracking, and other computer vision technologies to complete environment mapping, tracking objects, estimating the speed of objects, and determining Operations such as the current conditions inside and outside the car.
  • a human body recognition algorithm e.g., a structure from motion (SFM) algorithm
  • video tracking e.g., video tracking, and other computer vision technologies to complete environment mapping, tracking objects, estimating the speed of objects, and determining Operations such as the current conditions inside and outside the car.
  • SFM structure from motion
  • the route control system 142 is used to determine the driving route of the vehicle 100 .
  • route control system 142 may combine sensor data from sensor system 104 and data from one or more predetermined maps to determine a route for vehicle 100 .
  • Obstacle avoidance system 144 is used to identify, evaluate, and avoid or otherwise negotiate potential obstacles in the environment of vehicle 100 .
  • control system 106 may additionally or alternatively include components in addition to those shown and described. Alternatively, some of the components shown above may be reduced.
  • Peripheral devices 108 may include a wireless communication system 146 , an onboard computer 148 , a microphone 150 and a speaker 152 .
  • peripheral device 108 provides a means for a user of vehicle 100 to interact with user interface 116 .
  • onboard computer 148 may provide information to a user of vehicle 100 .
  • the user interface 116 may also operate the onboard computer 148 to receive user input.
  • the onboard computer 148 can be operated via a touch screen.
  • peripheral device 108 may provide a means for vehicle 100 to communicate with other devices located within the vehicle.
  • microphone 150 may receive audio (eg, voice commands or other audio input) from a user of vehicle 100 .
  • speakers 152 may output audio to a user of vehicle 100 .
  • the microphone 150 can pick up sounds inside the car and can also pick up sounds outside the car. When the car windows are not closed or the sounds outside the car are loud, the microphone 150 can also accurately pick up various sounds outside the car, including voice commands for controlling the vehicle.
  • the microphone 150 may be composed of multiple microphone units.
  • the multiple microphone units may be disposed near the center console to form a microphone array, or may be distributed next to each seat in the car. At least one of them may also be disposed on On the outer wall of the car (outside the car) to pick up the sound outside the car.
  • whether the sound source position is located inside the vehicle can be determined by the pickup intensity of the voice instruction by the microphone 150 .
  • the orientation or position of the sound source position can be determined based on the pickup intensity of the voice instructions by multiple microphone units, and whether the sound source position is located inside the vehicle can be determined based on the orientation or position.
  • Wireless communication system 146 may wirelessly communicate with one or more devices directly or via a communication network.
  • wireless communication system 146 may use 3G cellular communications, such as CDMA, EVDO, GSM/GPRS, or 4G cellular communications, such as LTE. Or 5G cellular communications.
  • the wireless communication system 146 can communicate with a wireless local area network (WLAN) using WiFi.
  • WLAN wireless local area network
  • wireless communication system 146 may utilize infrared links, Bluetooth, or ZigBee to communicate directly with the device.
  • Other wireless protocols such as various vehicle communication systems.
  • wireless communication system 146 may include one or more dedicated short range communications (DSRC) devices, which may include communication between vehicles and/or roadside stations. public and/or private data communications.
  • DSRC dedicated short range communications
  • Power supply 110 may provide power to various components of vehicle 100 .
  • the power source 110 may be a rechargeable lithium Ion, sodium-ion or lead-acid batteries.
  • One or more packs of such batteries may be configured as a power source to provide power to various components of the vehicle 100 .
  • power source 110 and energy source 119 may be implemented together, such as in some all-electric vehicles.
  • Computer system 112 may include at least one processor 113 that executes instructions 115 stored in a non-transitory computer-readable medium such as data storage device 114.
  • Computer system 112 may also be a plurality of computing devices that control individual components or subsystems of vehicle 100 in a distributed manner.
  • Processor 113 may be any conventional processor, such as a commercially available central processing unit (CPU). Alternatively, the processor may be a dedicated device such as an application specific integrated circuit (ASIC) or other hardware-based processor.
  • FIG. 1 functionally illustrates processor 113, data storage 114, and other elements of computer system 112 in the same block, one of ordinary skill in the art will understand that processor 113, computer system 112, or data Storage device 114 may actually include multiple processors, computers, or memories that are not stored within the same physical enclosure.
  • the memory may be a hard drive or other storage medium located in a housing different from computer system 112 .
  • a reference to a processor or computer will be understood to include a reference to a collection of processors or computers or memories that may or may not operate in parallel.
  • some components such as the steering component and the deceleration component, may each have its own processor that only performs calculations related to component-specific functionality. .
  • the processor may be located remotely from the vehicle and in wireless communication with the vehicle. In other aspects, some of the processes described herein are performed on a processor disposed within the vehicle and others are performed by a remote processor, including taking the steps necessary to perform a single maneuver.
  • data storage 114 may contain instructions 115 (eg, program logic) that may be executed by processor 113 to perform various functions of vehicle 100 , including those described above.
  • Data storage 114 may also contain additional instructions, including sending data to, receiving data from, interacting with, and/or performing operations on one or more of travel system 102 , sensor system 104 , control system 106 , and peripherals 108 Control instructions.
  • the data storage device 114 may store data such as road maps, route information, vehicle position, direction, speed and other such vehicle data, as well as other information. This information may be used by vehicle 100 and computer system 112 during operation of vehicle 100 in autonomous, semi-autonomous and/or manual modes.
  • the data storage device 114 obtains the voice instructions for controlling the vehicle collected by the microphone 150 from the sensor system 104 or other components of the vehicle 100, the distribution information of people in the vehicle, the distribution information of people outside the vehicle, and the vehicle Status information and other information.
  • the voice command can be issued by a person inside the vehicle or by a person outside the vehicle.
  • the distribution information of people in the car includes whether there are people in the car, the orientation or specific location of the people in the car.
  • the distribution information of people outside the vehicle includes whether there are people within the preset range outside the vehicle, the orientation or specific location of the people outside the vehicle.
  • Vehicle status information includes vehicle driving speed, window opening and closing status and other information.
  • the data storage device 114 can store the above-mentioned related information.
  • the processor 113 can determine whether the sound source position of the voice instruction is located inside the vehicle based on the above information. If it is determined that the sound source position of the voice command is located inside the vehicle, the vehicle 100 may be further controlled to perform operations corresponding to the voice command.
  • User interface 116 for providing information to or receiving information from a user of vehicle 100 .
  • user interface 116 may include one or more input/output devices within a collection of peripheral devices 108 , such as wireless communications system 146 , onboard computer 148 , microphone 150 , and speaker 152 .
  • Computer system 112 may control functions of vehicle 100 based on input received from various subsystems (eg, travel system 102 , sensor system 104 , and control system 106 ) and from user interface 116 .
  • computer system 112 may utilize input from control system 106 to control steering system 132 to avoid obstacles detected by sensor system 104 and obstacle avoidance system 144 .
  • computer system 112 is operable to provide control of many aspects of vehicle 100 and its subsystems.
  • one or more of these components described above may be installed separately or associated with vehicle 100 .
  • data storage device 114 may exist partially or completely separate from vehicle 100 .
  • the components described above may be communicatively coupled together in wired and/or wireless manners.
  • a self-driving car such as the vehicle 100 above, can detect the voice command based on the audio characteristics of the voice command collected by the microphone 150, the pickup intensity of the voice command by the microphone 150, the distribution of people inside the vehicle, and the preset conditions outside the vehicle. Information such as the distribution of people within the range, the driving speed of the vehicle, the opening and closing of the vehicle's windows, etc. are used to determine whether the sound source position of the voice command is located inside the vehicle.
  • the vehicle 100 as an autonomous vehicle or the computing device associated with it (such as the computer system 112, the computer vision system 140, and the data storage device 114 in FIG. 3) can determine whether to execute the control corresponding to the voice instruction based on the judgment result. operate. For example, when it is determined that the sound source position is inside the vehicle, the vehicle is controlled to perform operations corresponding to the voice instructions. For another example, when it is determined that the sound source position is not located inside the vehicle (that is, the sound source position is located outside the vehicle), the voice command may not be executed at this time; or, it may be further determined whether the preset trigger condition is met. When the trigger condition is When satisfied, the vehicle is controlled to perform operations corresponding to the voice command; when not satisfied, the voice command is not executed; or, an inquiry message is sent to the user as to whether to execute the voice command.
  • the trigger condition is When satisfied, the vehicle is controlled to perform operations corresponding to the voice command; when not satisfied, the voice command is not executed; or, an inquiry message is sent to the user as to whether to execute
  • the computing device may also provide instructions for modifying the status of various devices within the vehicle 100 so that the self-driving car follows given emergency control measures and controls various devices within the vehicle. status to ensure the safety of the vehicle occupants.
  • the above-mentioned vehicle 100 can be a car, a truck, a train, an engineering vehicle, a motorcycle, a bus, a boat, an airplane, a helicopter, a lawn mower, an entertainment vehicle, a playground vehicle, construction equipment, a tram, a golf cart, a train, and trolleys, etc., the embodiments of this application are not particularly limited.
  • FIG. 4 is a schematic flowchart of a vehicle control method 400 provided by an embodiment of the present application.
  • the control method 400 can be applied to the vehicle 100 shown in FIG. 3 .
  • the execution subject of the control method 400 can be the vehicle 100 or a processor or chip in the vehicle 100 ; it can also be applied to other devices capable of controlling the vehicle 100 (such as In cloud servers, mobile terminals, etc.), for example, the execution subject of the control method 400 may be the cloud server or a processor or chip inside the cloud server.
  • the control method 400 may include the following steps 410 to 440.
  • Step 410 Obtain voice instructions for controlling the vehicle.
  • vehicles such as smart cars
  • microphones and other sound pickup devices which can collect voice, for example, they can obtain voice instructions for controlling the vehicle, and they can also obtain useless information such as chat content.
  • Microphones can not only collect speech inside the car, but also often outside the car (especially when the windows are open). Therefore, the voice command can be issued by a driver or passenger in the car (such as a driver or a passenger who takes the seat later) or by a person outside the car. In some cases, the voice command can also be issued by smart devices with voice broadcast functions such as robots and smart speakers. This application does not specifically limit this.
  • the voice command can control the vehicle. After the vehicle collects the voice command, it performs semantic recognition and generates corresponding control instructions, and controls related control objects through the control instructions. This application does not limit which aspect of voice command is specifically controlled. Optionally, the voice command can perform status control, media control, air conditioning control, seat control, etc. on the vehicle.
  • the voice command can be opening/closing windows (including sunroofs), opening/closing doors (locking/unlocking doors), opening/closing the trunk, ambient light adjustment, vehicle locking, vehicle ignition/stop, and vehicle speed adjustment. , rearview mirror adjustment and other vehicle status control instructions.
  • the voice command can also be media control commands such as opening navigation, adjusting the central control screen, playing music, tuning the radio, answering/making calls, etc.
  • the voice command can also be air conditioning control commands such as raising the temperature, lowering the temperature, starting internal circulation, purifying the air in the car, etc.
  • the voice command can also be a seat control command such as turning on seat heating, raising/lowering the seat height, moving the seat forward/backward, adjusting the seat back angle, etc.
  • the voice command can also be a wake-up command used to wake up the voice assistant of the vehicle.
  • the wake-up command can be a default command set by the manufacturer.
  • the voice command can be a "little voice command” used to wake up the intelligent voice assistant Xiaoyi.
  • "Yi Xiaoyi” can also be a wake-up command customized by the user according to personal preferences.
  • the vehicle's voice assistant can also be in wake-up-free mode, that is, the voice assistant remains awake at all times to ensure that the vehicle can promptly obtain instructions for controlling the vehicle's status such as "open the door", which is convenient for voice control of the vehicle. .
  • Step 420 Determine whether the sound source position of the voice command is located inside the vehicle.
  • the inside of the vehicle is opposite to the outside of the vehicle.
  • the vehicle interior may refer to the interior of the cab.
  • the cabs are equipped with seats for passengers in addition to seats for the driver. Therefore, the vehicle interior of a passenger car such as a sedan is clearly defined as the interior of the vehicle body including the seats.
  • trucks such as pickup trucks
  • vehicles in addition to the cab, they usually also have spaces such as buckets, compartments, or buckets for hauling goods or construction. This part of the space is included in this application.
  • ADAS sensors such as in-car cameras, exterior cameras, radars or seat pressure sensors.
  • Various sensors related to the vehicle can be learned through the sensors. According to one or more kinds of data, whether the sound source position of the voice command is located inside the vehicle can be determined. That is, this application does not specifically limit the specific implementation method of determining whether the sound source position of the voice command is located inside the vehicle.
  • the acquired data can be combined with a pre-built-in algorithm to determine whether the sound source is located inside the vehicle.
  • the above data can be input into a neural network model, which analyzes the data and determines whether the sound source location is inside the vehicle.
  • the neural network model is obtained by model training with historical data (training data) and a deep learning algorithm.
  • the neural network model can be, for example, a convolutional neural network (CNN) model.
  • the deep learning algorithm can, for example, be Machine learning algorithm or meta learning algorithm.
  • the sound source position of the voice instruction is located inside the vehicle based on one or more of the following data: the audio characteristic data of the voice instruction collected by the microphone, the pickup of the voice instruction collected by the microphone ( Sound pickup) intensity data, in-car image data collected by the in-car camera, in-car sound data collected by the microphone, in-car seat pressure data collected by the seat sensor, in-car vibration data collected by the vibration sensor, touch The touch data of the in-car touch screen collected by the sensor, the vehicle speed data collected by the vehicle speed sensor, the window opening and closing data detected by the glass position detection sensor, the exterior image data collected by the exterior camera, and the radar or laser range finder collection Obstacle data outside the vehicle, etc., but is not limited to this. With the further development of intelligent vehicles and sensor technology, the content and types of the above data can become more.
  • any one of the multiple methods, or a combination of multiple methods can be used to comprehensively determine whether the sound source position is located inside the vehicle.
  • the multiple methods can be determined at the same time, and the final result can be output through multiple determination results; or, the multiple methods can also be based on a certain priority. In order, when one or more of the previous methods can accurately determine the result, the subsequent methods do not need to be executed, thus saving processor computing power.
  • whether the sound source is located inside the vehicle can be determined by the distribution of people inside the vehicle.
  • relevant data can be obtained through detection devices such as in-car cameras, seat pressure sensors, touch sensors, vibration sensors, magnetometers, and infrared sensors, and based on this data, it can be determined whether there are passengers in the car, their orientation or specific location , the specific seats of the passengers and other personnel distribution information, and finally determine whether the sound source position is located inside the vehicle based on the personnel distribution information.
  • the personnel distribution information indicates that there are currently no drivers or passengers in the vehicle, it may be determined that the sound source location is located outside the vehicle rather than inside.
  • FIG. 5 is a schematic diagram of an application scenario of the control method 400 shown in FIG. 4 .
  • a person 50 outside the car for example, the car owner
  • the microphone 51 inside the car collects the voice command.
  • it can be determined that the sound must come from outside the vehicle (no need to open the exterior camera for further confirmation) that is, it is determined that the sound source position of the voice command is located outside the vehicle rather than inside.
  • the personnel distribution information indicates that there are drivers and passengers in the current vehicle
  • other information can be used to jointly determine whether the sound source position is located inside the vehicle.
  • the determination can be further made in combination with the audio feature data of the voice command collected by the microphone or the pickup (sound pickup) intensity data of the voice command collected by the microphone, which will be further described below.
  • whether the sound source is located inside the vehicle can be determined by the distribution of people outside the vehicle.
  • relevant data can be obtained through detection devices such as cameras, radars, and laser rangefinders outside the vehicle, and based on this data, it can be determined whether there are people within the preset range near the outside of the vehicle, as well as personnel distribution information such as their orientation or specific location. Finally, it is determined whether the sound source is located inside the vehicle based on the personnel distribution information.
  • This preset range can be determined based on the sound pickup capability of the microphone equipped in the vehicle. The stronger the sound pickup capability, it means that voice commands issued by people at a greater distance may also be picked up, so the preset range should also be larger. The manufacturer can reasonably set the preset range based on the specific configuration of the microphone before the vehicle leaves the factory.
  • the person distribution information indicates that there are no people within the current preset range outside the vehicle, it can be determined that the sound source position is located inside the vehicle.
  • the person 50 in the car (for example, the driver) speaks the voice command "open the window”.
  • the voice command can be Through the data detected by the vehicle's exterior camera 54, radar, laser rangefinder and other devices, it is determined that there are no people within the preset range outside the vehicle at this time. At this time, it can be determined that the sound must come from inside the vehicle (no need to open the interior of the vehicle additionally). The camera further confirms), that is, it is determined that the sound source position of the voice command is located inside the vehicle. Only one exterior camera 54 is shown in the figure.
  • multiple exterior cameras 54 can be set up along the periphery of the vehicle body to obtain a 360-degree panoramic image of the exterior periphery of the vehicle body, which facilitates more efficient and accurate determination of exterior presets. Is there anyone within range.
  • the personnel distribution information indicates that there are people within the current preset range outside the vehicle
  • other information can be used to jointly determine whether the sound source location is located Vehicle interior.
  • the driving speed of the vehicle can be used to determine whether the sound source is located inside the vehicle.
  • the driving speed is greater than or equal to 80 kilometers per hour (km/h)
  • the voice command for controlling the vehicle it can be determined that the sound source position of the voice command is located inside the vehicle.
  • the vehicle speed threshold can be 60 to 100km/h, for example, 70km/h or 80km/h.
  • the manufacturer can reasonably set the vehicle speed threshold based on the microphone's sound pickup ability before the vehicle leaves the factory.
  • the person 50 in the car (for example, the driver) speaks the voice command "Adjust the temperature in the car to 22 degrees Celsius", and the microphone 51 in the car collects the voice command.
  • the vehicle speed data obtained by the vehicle speed sensor can be used to determine that the current vehicle speed is 110km/h, which is greater than the preset vehicle speed threshold of 75km/h.
  • the microphone 51 cannot pick up the voice outside the vehicle, so the voice command can be determined.
  • the sound source location is inside the vehicle.
  • the vehicle's driving speed is less than the vehicle speed threshold
  • other information can be used to jointly determine whether the sound source position is located inside the vehicle.
  • the microphone in the car may not be able to pick up the voice command outside the car, so it can be determined that the sound source position of the voice command is located inside the vehicle.
  • the sound source is located inside the vehicle through the opening and closing of the vehicle's windows.
  • Relevant data can be obtained through the window position detection sensor to determine the current status (ie, opening and closing status) of each window of the vehicle, for example, it can be fully open, half open, or fully closed.
  • Car window glass has a certain sound insulation effect. When the car window is closed, the microphone in the car may not be able to accurately pick up the voice commands outside the car. However, when the car window is opened, the microphone in the car can pick up the voice commands outside the car. Voice command, so it can determine whether the sound source is located inside the vehicle based on the opening and closing of the vehicle's windows.
  • the microphone cannot receive the voice outside the vehicle at this time, and it can be determined that the sound source position of the voice command is located inside the vehicle.
  • the vehicle can be, for example, some high-end cars, and its sound insulation effect is particularly outstanding.
  • the audio characteristics of the voice command can be used to determine whether the sound source location is located inside the vehicle. After sound passes through obstacles, some of its own audio characteristics (such as waveform and energy) will be affected, and the audio characteristics will change. In other words, if the same person speaks the same voice command inside and outside the car at the same time, the audio characteristics of the two will be different after being picked up by the microphone. difference. At this time, it can be determined whether the sound source position is located inside the vehicle through this change or difference in the audio characteristics.
  • some of its own audio characteristics such as waveform and energy
  • certain characteristic parameters of the audio characteristics of the voice command can be compared with a preset parameter threshold, and whether the sound source position is located inside the vehicle is determined based on the comparison result.
  • the acquired audio characteristics can be combined with a pre-built-in algorithm to determine whether the sound source is located inside the vehicle.
  • the audio feature data of the voice instruction can be input into a neural network model, and the neural network model analyzes the feature data and determines whether the sound source position is located inside the vehicle.
  • the neural network model is obtained by model training using historical data (training data) and a deep learning algorithm.
  • the neural network model may be, for example, a CNN model, and the deep learning algorithm may be, for example, a machine learning algorithm or a meta-learning algorithm.
  • Figure 6 is a schematic diagram of the propagation process of control instructions.
  • a microphone 61 is provided inside the vehicle, and the vehicle window 62 is in a completely closed state.
  • the person 60 outside the car speaks the voice command "open the door" outside the car.
  • This voice command propagates in the air in the form of sound waves, and needs to pass through the car body including the window glass. Enter the car and eventually reach microphone 61. Due to the absorption and refraction of sound waves by the car body, the audio characteristics of the voice command change, that is, the characteristic parameters of the sound wave 63 before passing through and the sound wave 64 after passing through in Figure 6 may be slightly different.
  • the audio characteristics of the voice command picked up by the microphone 61 and combined with a preset algorithm such as a neural network model, it can be determined that the sound source position of the voice command is located outside the car, that is, the person speaks the voice command from outside the car.
  • the audio characteristics here may include any characteristic parameters such as spectral structure, energy, amplitude, frequency, period, etc., but are not limited to this.
  • the voice feature may include a spectrum structure, and whether the sound source position is located inside the vehicle may be determined based on the acquired spectrum structure of the voice instruction.
  • the energy in the high-frequency band in the spectrum structure is absorbed, and the high-frequency part will be significantly attenuated.
  • the high-frequency spectrum structure of the normal sound emitted in the car that has not passed through obstacles is Some parts will be preserved relatively completely, so under the same circumstances, there will be differences in the spectrum structure between the voice commands issued outside the car and the voice commands issued inside the car. This part of the difference can be used to determine whether the sound source position is located inside the vehicle.
  • the advantage of using the spectrum structure to determine whether the sound source position is located inside the vehicle is that the confirmation method is simple, efficient and easy to implement without resorting to other data, which can reduce the frequency of use of other sensors in the vehicle.
  • the spectral structure can be input into a pre-trained neural network model (such as a CNN model).
  • the neural network model analyzes and calculates the spectral structure, and finally outputs a judgment result of whether the sound source position is located inside the vehicle.
  • the voice commands issued outside the car and the voice commands issued inside the car will have a more obvious difference in the integrity of the high-frequency part. It can be determined by the characteristics of the high-frequency part. Determine whether the sound source position of the voice command is located inside the vehicle. When the integrity of the high-frequency part meets the preset conditions, it is determined that the sound source position is located inside the vehicle.
  • the advantage of determining whether the sound source position is located inside the vehicle through the characteristics of the high-frequency part of the spectrum structure is that the confirmation method is simple, efficient and easy to implement without resorting to other data, which can reduce the frequency of use of other sensors in the vehicle.
  • the high-frequency part of the spectrum structure can be input into a pre-trained neural network model (such as a CNN model).
  • the neural network model analyzes and calculates the spectrum structure, and finally outputs a judgment of whether the sound source position is located inside the vehicle. result.
  • Step 430 When it is determined that the sound source position is inside the vehicle, control the vehicle to perform operations corresponding to the voice instructions.
  • the vehicle can be controlled to perform corresponding operations based on the voice command.
  • the voice command may be a wake-up command used to wake up the voice assistant of the vehicle, such as "Xiaoyi Xiaoyi", in which case the voice assistant of the vehicle can be woken up.
  • the voice command can be a control command for status control, media control, air conditioning control, and seat control of the vehicle. In this case, the corresponding control object can be controlled to perform the corresponding operation.
  • controlling the vehicle to perform operations corresponding to the voice instructions specifically includes: performing semantic recognition on the acquired voice instructions to obtain control instructions, and controlling the vehicle accordingly according to the control instructions.
  • the voice command when it is determined that the sound source is located inside the vehicle, the voice command will be semantically parsed to generate the corresponding
  • the semantic analysis of the voice instruction when it is determined that the sound source position is not located inside the vehicle (that is, outside the vehicle), the semantic analysis of the voice instruction may not be performed, which is beneficial to saving the computing power of the processor.
  • Step 440 When it is determined that the sound source position is not located inside the vehicle, control the vehicle not to execute the voice command.
  • step 420 when it is determined in step 420 that the sound source position is not located inside the vehicle, that is, it is determined that the sound source position is located outside the vehicle, it means that the voice command may have been spoken by someone outside the vehicle (such as a stranger, a passerby, or a criminal) and is accidentally picked up by the microphone. At this time, the voice command does not need to be semantically recognized and the voice command is not executed, that is, the voice command is not responded to. This can improve the safety of vehicle control and ensure the safety of the passengers in the car. Property safety is ensured.
  • the vehicle control method 400 after receiving a voice command for controlling the vehicle, instead of directly executing the command, it first determines whether the sound source position of the voice command is located inside the vehicle. When determining The control operation corresponding to the voice command is only executed when the sound source position of the voice command is located inside the vehicle. This makes the control of the vehicle more intelligent, improves the safety of the vehicle, and ensures the personal and property safety of passengers in the vehicle. This application can help users in the car effectively avoid control and interference of the vehicle by strangers outside the car, without causing misoperation of the vehicle, further ensuring the safety of people in the car, especially in resting scenes.
  • the control method 400 provided by the embodiment of the present application enables the vehicle 11 to first determine that the sound source position of the voice command is located outside the vehicle according to the preset determination logic. At this time, the vehicle 11 will not execute the voice command, that is, it will not open the door. Car doors, thus ensuring the personal and property safety of passengers in the car.
  • FIG. 7 is a schematic flowchart of a vehicle control method 700 provided by an embodiment of the present application.
  • the control method 700 may include the following steps 710 to 750.
  • step 710, step 730 and step 750 reference may be made to the descriptions of step 410, step 430 and step 440 in the foregoing embodiments respectively. Different parts will be emphasized here.
  • Step 710 Obtain voice instructions for controlling the vehicle.
  • Step 720 Determine whether the sound source is located inside the vehicle based on the microphone's pickup intensity of the voice command.
  • the intensity can reflect the distance from the sound source location. Based on the pickup intensity, it can be determined whether the sound source location is inside the vehicle. For example, the specific location of the sound source location, or the specific orientation of the sound source location, can be determined based on the pickup intensity of the voice command by the microphone, and then it can be further determined whether the sound source location is located inside the vehicle.
  • the vehicle may be equipped with only one microphone, and whether the sound source position is located inside the vehicle can be determined by the pickup intensity of the voice command by the only microphone.
  • Figure 8 is a schematic diagram of the application scenario of the control method shown in Figure 7. As shown in part (a) of Figure 8, the vehicle is equipped with an in-vehicle microphone 81 located in the vehicle. The in-vehicle microphone 81 is located adjacent to the driving position and is the vehicle's the only microphone. The in-car microphone 81 can only accurately pick up sounds within a small range of the surrounding area.
  • the in-car microphone 81 After the in-car microphone 81 receives the voice command "turn on the seat heating function" and determines that the pickup intensity of the voice command exceeds the threshold, it is determined that the voice command should be from a person in the driving seat relatively close to the in-car microphone 81 80 is issued, that is, it is determined that the sound source location is in the car. At this time, the vehicle can execute the voice command, that is, turn on the seat heating function to heat the seat.
  • the vehicle may usually be equipped with multiple microphones.
  • the pickup strength of each of the multiple microphones for voice commands can be used in conjunction with a preset algorithm (such as a trained neural network model) to determine whether the sound source location is Located inside the vehicle.
  • the plurality of microphones can be installed in the car. For example, they can be centrally installed near the center console of the vehicle in the form of a microphone array, or they can be distributed at different seats in the car.
  • the plurality of microphones may also include at least one external microphone for picking up voices outside the vehicle, so that the user can perform voice interaction with the vehicle outside the vehicle and obtain a better vehicle experience.
  • the vehicle is equipped with at least one in-vehicle microphone 81 and at least one external microphone 82.
  • the in-vehicle microphone 81 and the out-of-vehicle microphone 82 pick up the voice of the instruction at the same time.
  • the pickup intensity of the out-of-vehicle microphone 82 is significantly greater than that of the in-vehicle microphone 81, so it can be determined that the sound source position (that is, the position of the person 80) is located in the vehicle. Externally, this instruction does not need to be executed.
  • the vehicle is equipped with multiple in-vehicle microphones.
  • whether the sound source position is located inside the vehicle can be directly determined based on the pickup strength of each of the multiple in-vehicle microphones 81 for voice commands in combination with a preset algorithm.
  • the vehicle is equipped with three in-vehicle microphones 81 distributed at different seats in the vehicle.
  • the connection lines of these three in-vehicle microphones 81 generally form an isosceles triangle structure.
  • the person 80 in the middle seat of the back row speaks the voice command "Move the rear seat backwards", and the three in-car microphones 81 pick up the voice command with roughly the same intensity, then it can be determined that the sound source position is adjacent to the triangle.
  • the central position that is, the position of the sound source (ie, the position of the person 80) can be determined to be located inside the vehicle.
  • Step 730 If it is determined that the sound source is located inside the vehicle, control the vehicle to perform operations corresponding to the control instruction.
  • Step 740 if it is determined that the sound source is located outside the vehicle, it is determined whether the preset trigger condition is met. If the trigger condition is met, step 730 is entered, that is, the vehicle is controlled to perform operations corresponding to the control instruction.
  • Step 750 If the trigger condition is not met, the voice command is not executed.
  • the trigger conditions can be preset by the manufacturer, or can be set by the user.
  • the user can set the conditions based on the scenarios in which the user may voice control the vehicle outside the vehicle.
  • the triggering condition may be that the electronic key of the vehicle is located near the vehicle, the vehicle owner is determined to be located near the vehicle through the positioning function of a mobile terminal such as a mobile phone, a special time period (such as commuting and get off work hours), or the vehicle enters a specific mode (for example, camping mode), etc., this application does not limit this.
  • the vehicle can establish a Bluetooth connection with the electronic key, it can be determined that the electronic key is near the vehicle, that is, it is determined that the owner of the vehicle may also be located near the vehicle. At this time, the trigger condition is satisfied, the vehicle can be controlled to perform operations corresponding to the control instruction.
  • FIG. 9 is a schematic flowchart of a vehicle control method 900 provided by an embodiment of the present application.
  • the control method 900 may include the following steps 910 to 960.
  • step 910, step 940 and step 950 reference may be made to the descriptions of step 410, step 420 and step 430 in the foregoing embodiments respectively. Different parts will be emphasized here.
  • Step 910 Obtain voice instructions for controlling the vehicle.
  • Step 920 Determine the orientation of the sound source location based on the pickup intensity of the voice command by each of the multiple microphones.
  • Step 930 Determine whether the sound source is located inside the vehicle based on the direction of the sound source.
  • the orientation of the sound source position can be determined based on the pickup intensity of the voice command by a pair of microphones, and with the help of a preset algorithm or logic, it can be determined whether the sound source position is located inside the vehicle through the orientation.
  • the sound source location is located outside the vehicle.
  • the microphone is installed on the center console of the car and the direction points to the front of the vehicle, there are no seats in the car space at this direction and cannot accommodate passengers. Therefore, it can be determined that the sound source is located outside the vehicle. .
  • the car window at this orientation is closed. For example, if the car window has a strong sound insulation effect and the current car window is completely closed, it can be determined that the sound source location is inside the vehicle.
  • the orientation of the sound source location further determine the distribution of people inside and outside the vehicle at this orientation, and determine whether the sound source location is located inside the vehicle based on the distribution of people. For example, if there are no people inside the vehicle at this orientation, it can be determined that the sound source is located outside the vehicle; or, if there are no people outside the vehicle at this orientation, it can be determined that the sound source is located inside the vehicle; Alternatively, if there are people both inside and outside the vehicle at this location, you can use other data from the sensor to determine whether the sound source is located inside the vehicle.
  • FIG. 10 is a schematic diagram of an application scenario of the control method 900 shown in FIG. 9 .
  • step 930 specifically includes:
  • Step 931 Determine whether there is a person inside the vehicle at the orientation. If not, determine that the sound source is located outside the vehicle. At this time, step 960 is entered to send a query message asking whether to execute the voice command; if so, step 932 is entered.
  • the in-car microphone 81 is installed near the center console of the vehicle.
  • the direction of the sound source position can first be determined based on the pickup intensity of the voice command by the in-car microphone 81 as indicated by the dotted arrow in the figure.
  • the person 80 located at this location speaks the voice command.
  • Step 932 Determine whether there is a person outside the vehicle at the orientation. If not, determine that the sound source position is inside the vehicle. At this time, step 950 is entered to control the vehicle to perform the control operation corresponding to the voice command; if so, indicate that the sound source is located inside the vehicle. There are people both inside and outside the vehicle in the direction. At this time, further judgment needs to be made with the help of other data, and step 940 can be entered.
  • the in-car microphone 81 is installed near the center console of the vehicle.
  • the direction of the sound source position can first be determined based on the pickup intensity of the voice command by the in-car microphone 81 as indicated by the dotted arrow in the figure. At this time, if it is determined that there is no person outside the vehicle in this direction with the help of data collected by the exterior camera 85, radar or laser range finder, it can be determined that the sound source position is inside the vehicle, for example, it can be the vehicle
  • the person 80 in the main driving position speaks the voice command.
  • step 940 may be entered.
  • Step 940 Determine whether the sound source position is located in the vehicle according to the audio characteristics. If yes, proceed to step 950 to control the vehicle to perform a control operation corresponding to the voice command; if not, proceed to step 960 to send an inquiry message as to whether to execute the voice command.
  • step 940 Determine whether the sound source position is located in the vehicle according to the audio characteristics. If yes, proceed to step 950 to control the vehicle to perform a control operation corresponding to the voice command; if not, proceed to step 960 to send an inquiry message as to whether to execute the voice command.
  • step 960 is entered to send an inquiry message as to whether to execute the voice command, instead of directly not executing the voice command as in the previous embodiment.
  • the control method provided by the embodiment of the present application can not only improve the safety of vehicle voice control, but also increase the control flexibility, which is beneficial to improving the user experience.
  • the query information here may be sound information, image information, text information, etc.
  • the inquiry information here can be a voice inquiry information whether to execute the voice command issued through the vehicle's speaker device, or it can be displayed on the central control screen, instrument screen or heads up display (HUD) device inside the vehicle.
  • image query information this application does not limit this.
  • the vehicle uses built-in judgment logic to determine that the voice command comes from outside the vehicle.
  • the speaker device in the car can broadcast "Do you need to open the window?" ?” inquiry information, if you get a positive reply from the person in the car through the touch screen or voice assistant, for example, the driver says “Yes, open the window”, then you can perform the "open window” operation at this time, and if the driver If the driver says “Don't open the window” or no further reply is received, the operation of "opening the window” does not need to be performed.
  • FIG. 11 is a schematic block diagram of a vehicle control device 1100 provided by an embodiment of the present application. As shown in FIG. 11 , the control device 1100 includes an acquisition unit 1110 , a determination unit 1120 and a control unit 1130 .
  • Obtaining unit 1110 used to obtain voice instructions for controlling the vehicle
  • Determining unit 1120 used to determine whether the sound source position of the voice instruction is located inside the vehicle
  • the control unit 1130 is configured to control the vehicle to perform operations corresponding to the voice instructions when the sound source is located inside the vehicle.
  • the determining unit 1120 is specifically used to:
  • Whether the sound source position is located inside the vehicle is determined based on the audio characteristics of the voice instruction.
  • the audio features include spectral structure.
  • the determining unit 1120 is specifically configured to: determine whether the sound source position of the voice command is located inside the vehicle according to the characteristics of the high-frequency part in the spectrum structure of the voice command; when the integrity of the high-frequency part When the preset conditions are met, OK The sound source location is located inside the vehicle.
  • the determining unit 1120 is specifically used to:
  • Whether the sound source position is located inside the vehicle is determined based on the pickup intensity of the voice command by the vehicle's microphone.
  • the determining unit 1120 is specifically used to:
  • Whether the sound source position is located inside or outside the vehicle is determined based on the orientation.
  • the determining unit 1120 determines whether the sound source position is located inside the vehicle through at least one of the following information:
  • the driving speed of the vehicle is the driving speed of the vehicle
  • the voice instruction includes a wake-up instruction
  • the control unit 1130 is specifically configured to:
  • control unit 1130 is specifically used to:
  • the vehicle is controlled according to the control instructions.
  • control unit 1130 when the sound source is located outside the vehicle, the control unit 1130 is also used to:
  • the vehicle is controlled not to execute the voice command.
  • control unit 1130 when the sound source position is located outside the vehicle, if it is determined that the preset trigger condition is met, the control unit 1130 is also used to:
  • the vehicle is controlled to perform operations corresponding to the voice instructions.
  • control device also includes:
  • the sending unit 1140 is configured to send query information on whether to execute the voice command when the sound source is located outside the vehicle.
  • control device 1100 may correspond to the vehicle or the device (such as a cloud server) that controls the vehicle in the methods 400, 700, and 900 according to the embodiments of the present application, or a chip configured in the vehicle or the vehicle control device.
  • the control device 1100 may include various units for executing the methods performed by the vehicle in FIGS. 4 , 7 and 9 .
  • each unit and the above-mentioned other operations and/or functions in the control device 1100 are respectively to implement the corresponding processes of the method 400, the method 700 and the method 900.
  • the specific process of each unit performing the above-mentioned corresponding steps is in the method 400, the method 700 and the method. It has been explained in detail in 900 and will not be repeated here for the sake of brevity.
  • FIG. 12 is a schematic structural diagram of a vehicle control device 1200 provided by an embodiment of the present application.
  • the control device 1200 may be a vehicle or a vehicle control device, or may be a chip or circuit provided in the vehicle or a vehicle control device.
  • the control device 1200 includes: a processor 1210, a memory 1220 and a communication interface 1230. Instructions are stored in the memory 1220, and the processor 1210 is used to execute the instructions in the memory 1220. When the instructions are executed, the processor 1210 is used to execute the method provided by the above method embodiment, and the processor 1210 is also used to control
  • the communication interface 1230 communicates with the outside world.
  • processor 1210 the memory 1220 and the communication interface 1230 can communicate with each other through internal connection channels to transmit control and/or data signals.
  • the memory 1220 can be integrated in the processor 1210 or can be provided separately from the processor 1210 .
  • control device 1200 can be used to execute each step in the methods 400, 700 and 900 in FIGS. Modules of the method executed by the vehicle of method 400, method 700 and method 900 in 9. Moreover, each module in the control device 1200 and the above-mentioned other operations and/or functions are to implement the corresponding processes of the method 400, the method 700 and the method 900 in FIG. 4, FIG. 7 and FIG. 9 respectively. The specific process of each module performing the above corresponding steps has been described in detail in method 400, method 700, and method 900, and will not be described again for the sake of brevity.
  • Embodiments of the present application also provide a computer-readable storage medium.
  • the computer-readable storage medium includes a computer program that, when run on a computer, causes the computer to execute the method provided by the above method embodiment.
  • Embodiments of the present application also provide a computer program product containing instructions, which when the computer program product is run on a computer, causes the computer to execute the method provided by the above method embodiment.
  • Embodiments of the present application also provide a chip system, including a memory and a processor.
  • the memory is used to store a computer program.
  • the processor is used to call and run the computer program from the memory, so that the network device installed with the chip system executes the above. Methods provided by method embodiments.
  • the chip system may include an input circuit or interface for sending information or data, and an output circuit or interface for receiving information or data.
  • the processor may be a central processing unit (CPU).
  • the processor may also be other general-purpose processors, digital signal processors (DSP), or dedicated integrated processors.
  • Circuit application specific integrated circuit, ASIC
  • off-the-shelf programmable gate array field programmable gate array, FPGA
  • a general-purpose processor may be a microprocessor or the processor may be any conventional processor, etc.
  • non-volatile memory may be volatile memory or non-volatile memory, or may include both volatile and non-volatile memory.
  • non-volatile memory can be read-only memory (ROM), programmable ROM (PROM), erasable programmable read-only memory (erasable PROM, EPROM), electrically removable memory. Erase electrically programmable read-only memory (EPROM, EEPROM) or flash memory.
  • Volatile memory can be random access memory (RAM), which is used as an external cache.
  • RAM random access memory
  • static random access memory static random access memory
  • DRAM dynamic random access memory
  • RAM synchronous dynamic random access memory
  • SDRAM synchronous dynamic random access memory
  • double data rate SDRAM double data rate SDRAM
  • DDR SDRAM double data rate SDRAM
  • enhanced SDRAM enhanced synchronous dynamic random access memory
  • SLDRAM synchronous connection dynamic random access memory access memory
  • direct rambus RAM direct rambus RAM, DR RAM
  • the disclosed systems, devices and methods can be implemented in other ways.
  • the device embodiments described above are only illustrative.
  • the division of the units is only a logical function division. In actual implementation, there may be other division methods.
  • multiple units or components may be combined or can be integrated into another system, or some features can be ignored, or not implemented.
  • the coupling or direct coupling or communication connection between each other shown or discussed may be through some interfaces, and the indirect coupling or communication connection of the devices or units may be in electrical, mechanical or other forms.
  • the units described as separate components may or may not be physically separated, and the components shown as units may or may not be physical units, that is, they may be located in one place, or they may be distributed to multiple network units. Some or all of the units can be selected according to actual needs to achieve the purpose of the solution of this embodiment.
  • each functional unit in each embodiment of the present application can be integrated into one processing unit, each unit can exist physically alone, or two or more units can be integrated into one unit.
  • the functions are implemented in the form of software functional units and sold or used as independent products, they can be stored in a computer-readable storage medium.
  • the technical solution of the present application is essentially or the part that contributes to the existing technology or the part of the technical solution can be embodied in the form of a software product.
  • the computer software product is stored in a storage medium, including Several instructions are used to cause a computer device (which may be a personal computer, a server, or a network device, etc.) to execute all or part of the steps of the methods described in various embodiments of this application.
  • the aforementioned storage media include: U disk, mobile
  • Various media that can store program code include hard disk, read-only memory (Read-Only Memory, ROM), random access memory (Random Access Memory, RAM), magnetic disk or optical disk.

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • General Health & Medical Sciences (AREA)
  • Evolutionary Computation (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Biophysics (AREA)
  • Data Mining & Analysis (AREA)
  • Biomedical Technology (AREA)
  • Molecular Biology (AREA)
  • Computing Systems (AREA)
  • Multimedia (AREA)
  • Acoustics & Sound (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Human Computer Interaction (AREA)
  • Fittings On The Vehicle Exterior For Carrying Loads, And Devices For Holding Or Mounting Articles (AREA)

Abstract

The present application relates to the technical field of intelligent terminals, and provides a vehicle control method and control apparatus. The control method comprises: obtaining a voice instruction for controlling a vehicle; determining whether a sound source in respect of the voice instruction is positioned inside the vehicle; and when the sound source is positioned inside the vehicle, controlling the vehicle to execute an operation corresponding to the voice instruction. According to the vehicle control method provided in embodiments of the present application, after the vehicle receives the voice instruction, whether a person giving the voice instruction is located inside the vehicle can be determined, so as to determine whether to execute a control operation corresponding to the voice instruction, such that control over the vehicle is more intelligent, and the use safety of the vehicle can be improved. According to the present application, a user in the vehicle can be helped to effectively avoid control and interference of strangers outside the vehicle on the vehicle, misoperations for the vehicle cannot be caused, and the safety of persons in the vehicle, especially in a rest scene, is further ensured.

Description

车辆的控制方法及控制装置Vehicle control method and control device
本申请要求于2022年09月05日提交国家知识产权局、申请号为202211098498.0、申请名称为“车辆的控制方法及控制装置”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。This application claims priority to the Chinese patent application filed with the State Intellectual Property Office on September 5, 2022, with application number 202211098498.0 and application name "Vehicle Control Method and Control Device", the entire content of which is incorporated into this application by reference. middle.
技术领域Technical field
本申请涉及智能终端技术领域,尤其涉及一种车辆的控制方法及控制装置。The present application relates to the field of intelligent terminal technology, and in particular, to a vehicle control method and control device.
背景技术Background technique
随着语音识别的准确度和语义理解能力的不断提升,以及汽车联网的普及,通过语音指令控制车辆变得越来越常见,由此给用户带来了极大的便利,提高了使用体验。With the continuous improvement of speech recognition accuracy and semantic understanding capabilities, as well as the popularization of automobile networking, it has become more and more common to control vehicles through voice commands, which has brought great convenience to users and improved the user experience.
目前,在对车辆进行语音控制的时候,是由车辆的语音采集系统采集用户发出的语音指令,并进行语义识别得到语音控制指令,再将该语音控制指令发送至控制单元,以控制车辆进行相应的动作。Currently, when a vehicle is voice controlled, the vehicle's voice collection system collects the voice commands issued by the user, conducts semantic recognition to obtain the voice control commands, and then sends the voice control commands to the control unit to control the vehicle accordingly. Actions.
在上述的车辆语音控制过程中,语音采集系统只要采集到语音指令,就会进行语义识别,并执行识别到的控制指令。例如,如果车辆的驾乘人员将车辆停在路边在车内进行休息,此时如果车外人员说出“打开车门”的语音指令,则车辆就会直接根据该语音指令控制车门打开,但其实此时车内的驾乘人员并不想打开车门。诸如此类的情况,就会对车辆内驾乘人员的人身和财产安全造成极大的安全隐患。In the above-mentioned vehicle voice control process, as long as the voice command is collected by the voice collection system, it will perform semantic recognition and execute the recognized control command. For example, if the driver and passengers of the vehicle park the vehicle on the roadside and take a rest in the vehicle, and a person outside the vehicle gives the voice command "open the door", the vehicle will directly control the door opening based on the voice command, but In fact, the drivers and passengers in the car did not want to open the door at this time. Situations like this will cause great safety risks to the personal and property safety of the drivers and passengers in the vehicle.
发明内容Contents of the invention
本申请提供一种车辆的控制方法及控制装置,对车辆的控制更加智能化,能够提高车辆的使用安全性。This application provides a vehicle control method and control device, which can control the vehicle more intelligently and improve the safety of the vehicle.
第一方面,提供了一种车辆的控制方法,包括:获取对车辆进行控制的语音指令;确定所述语音指令的声源位置是否位于所述车辆内部;当所述声源位置位于所述车辆内部时,控制所述车辆执行与所述语音指令相应的操作。In a first aspect, a vehicle control method is provided, including: obtaining a voice instruction to control the vehicle; determining whether the sound source position of the voice instruction is located inside the vehicle; when the sound source position is located in the vehicle When inside, the vehicle is controlled to perform operations corresponding to the voice instructions.
根据本申请实施例提供的车辆的控制方法,当接收到用于控制车辆的语音指令以后,不是直接执行该指令,而是首先判断该语音指令的声源位置是否位于车辆的内部,当判定该语音指令的声源位置位于车辆内部时,才执行该语音指令对应的控制操作。从而使得对车辆的控制更加智能化,能够提高车辆的使用安全性,并使得车内乘客的人身和财产安全得以保障。本申请可以帮助车内的用户有效规避车外陌生人员对于车辆的控制和干扰,不会造成车辆的误操作,进一步保证了车内人员,特别是休息场景下的安全。According to the vehicle control method provided by the embodiment of the present application, after receiving a voice instruction for controlling the vehicle, instead of directly executing the instruction, it first determines whether the sound source position of the voice instruction is located inside the vehicle. The control operation corresponding to the voice command is only executed when the sound source of the voice command is located inside the vehicle. This makes the control of the vehicle more intelligent, improves the safety of the vehicle, and ensures the personal and property safety of passengers in the vehicle. This application can help users in the car effectively avoid control and interference of the vehicle by strangers outside the car, without causing misoperation of the vehicle, further ensuring the safety of people in the car, especially in resting scenes.
可选地,该语音指令可以对车辆进行状态控制、媒体类控制、空调控制、座椅控制等等。Optionally, the voice command can perform status control, media control, air conditioning control, seat control, etc. on the vehicle.
一些情况下,该语音指令可以是打开/关闭车窗(包括天窗)、打开/关闭车门(锁定/解锁车门)、打开/关闭后备箱、氛围灯调节、车辆锁定、车辆点火/熄火、车速调节、后视镜调节等车辆状态控制指令。一些情况下,该语音指令还可以是打开导航、中控屏调节、播放音乐、收音机调频、接通/拨打电话等媒体控制指令。一些情况下,该语音指令还可以是调高温度、降低温度、开启内循环、进行车内空气净化等空调控制指令。一些情况下,该语音指令还可以是打开座椅加热、升高/降低座椅高度、前/后移动座椅、调节椅背角度等座椅控制指令。In some cases, the voice command can be opening/closing windows (including sunroofs), opening/closing doors (locking/unlocking doors), opening/closing the trunk, ambient light adjustment, vehicle locking, vehicle ignition/stop, and vehicle speed adjustment. , rearview mirror adjustment and other vehicle status control instructions. In some cases, the voice command can also be media control commands such as opening navigation, adjusting the central control screen, playing music, tuning the radio, answering/making calls, etc. In some cases, the voice command can also be air conditioning control commands such as raising the temperature, lowering the temperature, starting internal circulation, purifying the air in the car, etc. In some cases, the voice command can also be a seat control command such as turning on seat heating, raising/lowering the seat height, moving the seat forward/backward, adjusting the seat back angle, etc.
可选地,该语音指令还可以是用于唤醒车辆的语音助手的唤醒指令,该唤醒指令可以是厂家设置好的默认指令,例如该语音指令可以是用于唤醒智能语音助手小艺的“小艺小艺”,还可以是用户根据个人喜好自定义的唤醒指令。一些情况下,车辆的语音助手还可以是免唤醒模式,即语音助手时刻保持唤醒状态,保证车辆能够及时获取到“打开车门”等对车辆进行状态控制的指令,方便用于对车辆进行语音控制。 Optionally, the voice command can also be a wake-up command used to wake up the voice assistant of the vehicle. The wake-up command can be a default command set by the manufacturer. For example, the voice command can be a "little voice command" used to wake up the intelligent voice assistant Xiaoyi. "Yi Xiaoyi" can also be a wake-up command customized by the user according to personal preferences. In some cases, the vehicle's voice assistant can also be in wake-up-free mode, that is, the voice assistant remains awake at all times to ensure that the vehicle can promptly obtain instructions for controlling the vehicle's status such as "open the door", which is convenient for voice control of the vehicle. .
可选地,可以通过车内摄像头、车外摄像头、雷达、激光测距仪或者座椅压力传感器等传感器获知与车辆相关的各种数据,并根据该各种数据中的一种或者多种来确定该语音指令的声源位置是否位于车辆内部。本申请对确定该语音指令的声源位置是否位于车辆内部的具体实现方式不作具体限定。Optionally, various data related to the vehicle can be obtained through sensors such as in-car cameras, exterior cameras, radars, laser rangefinders, or seat pressure sensors, and based on one or more of the various data. Determine whether the sound source location of the voice command is located inside the vehicle. This application does not specifically limit the specific implementation method of determining whether the sound source position of the voice command is located inside the vehicle.
可选地,可以根据获取到的数据,结合预先内置的算法来确定声源位置是否位于车辆内部。Optionally, the acquired data can be combined with a pre-built-in algorithm to determine whether the sound source is located inside the vehicle.
例如,可以将上述数据输入到神经网络模型中,通过该神经网络模型对该数据进行分析并确定声源位置是否位于车辆内部。其中,神经网络模型是由历史数据(训练数据)以及深度学习算法进行模型训练得到的,该神经网络模型例如可以是卷积神经网络(convolutional neural network,CNN)模型,该深度学习算法例如可以是机器学习(machine learning)算法或者元学习(meta Learning)算法。For example, the above data can be input into a neural network model, which analyzes the data and determines whether the sound source location is inside the vehicle. Among them, the neural network model is obtained by model training with historical data (training data) and a deep learning algorithm. The neural network model can be, for example, a convolutional neural network (CNN) model. The deep learning algorithm can, for example, be Machine learning algorithm or meta learning algorithm.
值得一提的是,确定语音指令的声源位置是否位于车辆内部,一些情况下,并不一定要确定出声源位置的具体位置是在何处,而是仅需要确定是否位于车辆内部即可,这样做的好处是能够减小所需的数据量,无需额外设置过多的传感器,或者能够减小传感器的工作频次;此外还能够简化计算过程,节约处理器的算力,降低对处理器的性能要求。It is worth mentioning that to determine whether the sound source position of the voice command is located inside the vehicle, in some cases, it is not necessary to determine the specific location of the sound source position, but only needs to determine whether it is located inside the vehicle. The advantage of this is that it can reduce the amount of data required, without setting too many additional sensors, or reduce the working frequency of the sensors; in addition, it can also simplify the calculation process, save the computing power of the processor, and reduce the burden on the processor. performance requirements.
可选地,可以根据以下数据中的一种或者多种来确定语音指令的声源位置是否位于车辆内部:麦克风采集到的该语音指令的音频特征数据,麦克风采集到的该语音指令的拾取(拾音)强度数据,车内摄像头采集到的车内图像数据,麦克风采集到的车内声音数据,座椅传感器采集到的车内座椅压力数据,振动传感器采集到的车内振动数据、触摸传感器采集到的车内触摸屏的触摸数据、车速传感器采集到的车速数据、玻璃位置检测传感器检测的车窗开闭情况数据、车外摄像头采集到的车外图像数据、雷达或者激光测距仪采集到的车外障碍物数据等等,但不限于此,随着车辆的智能化以及传感器技术的进一步发展,上述数据的内容和种类可以变得更多。Optionally, it can be determined whether the sound source position of the voice instruction is located inside the vehicle based on one or more of the following data: the audio characteristic data of the voice instruction collected by the microphone, the pickup of the voice instruction collected by the microphone ( Sound pickup) intensity data, in-car image data collected by the in-car camera, in-car sound data collected by the microphone, in-car seat pressure data collected by the seat sensor, in-car vibration data collected by the vibration sensor, touch The touch data of the in-car touch screen collected by the sensor, the vehicle speed data collected by the vehicle speed sensor, the window opening and closing data detected by the glass position detection sensor, the exterior image data collected by the exterior camera, and the radar or laser range finder collection Obstacle data outside the vehicle, etc., but is not limited to this. With the further development of intelligent vehicles and sensor technology, the content and types of the above data can become more.
在一种可能的实现方式中,所述确定所述语音指令的声源位置是否位于所述车辆内部,包括:根据所述语音指令的音频特征确定所述声源位置是否位于所述车辆内部。In a possible implementation, determining whether the sound source position of the voice instruction is located inside the vehicle includes: determining whether the sound source position is located inside the vehicle according to audio characteristics of the voice instruction.
例如,可以将该语音指令的音频特征的某些特征参数与预设的参数阈值进行比较,根据比较结果确定声源位置是否位于车辆内部。For example, certain characteristic parameters of the audio characteristics of the voice command can be compared with a preset parameter threshold, and whether the sound source position is located inside the vehicle is determined based on the comparison result.
再例如,可以根据获取到的音频特征,结合预先内置的算法来确定声源位置是否位于车辆内部。For another example, the acquired audio characteristics can be combined with a pre-built-in algorithm to determine whether the sound source is located inside the vehicle.
可选地,可以将该语音指令的音频特征数据输入到神经网络模型中,通过该神经网络模型对该特征数据进行分析并确定声源位置是否位于车辆内部。其中,神经网络模型是由历史数据(训练数据)以及深度学习算法进行模型训练得到的,该神经网络模型例如可以是CNN模型,该深度学习算法例如可以是机器学习算法或者元学习算法。Optionally, the audio feature data of the voice instruction can be input into a neural network model, and the neural network model analyzes the feature data and determines whether the sound source position is located inside the vehicle. The neural network model is obtained by model training using historical data (training data) and a deep learning algorithm. The neural network model may be, for example, a CNN model, and the deep learning algorithm may be, for example, a machine learning algorithm or a meta-learning algorithm.
可选地,这里的音频特征可以包括频谱结构、能量、振幅、频率、周期等任意的特征参数,但不限于此。Optionally, the audio characteristics here may include any characteristic parameters such as spectral structure, energy, amplitude, frequency, period, etc., but are not limited to this.
在一种可能的实现方式中,所述音频特征包括频谱结构。In a possible implementation, the audio features include spectral structure.
声音穿过车窗玻璃等障碍物以后,频谱结构中的高频段部分能量被吸收,高频部分将明显得到衰减,而车内发出的正常未穿过障碍物的声音的频谱结构中的高频部分会被保存的比较完整,因此同等情况下,车外发出的语音指令和车内发出的语音指令在频谱结构上会产生差异,可以通过这部分差异确定声源位置是否位于车辆内部。通过频谱结构来确定声源位置是否位于车辆内部的好处是,确认方法简单高效并且容易实现,无需借助于其他数据,能够减小车辆其他传感器的使用频次。After the sound passes through obstacles such as car window glass, the energy in the high-frequency band in the spectrum structure is absorbed, and the high-frequency part will be significantly attenuated. However, the high-frequency spectrum structure of the normal sound emitted in the car that has not passed through obstacles is Some parts will be preserved relatively completely, so under the same circumstances, there will be differences in the spectrum structure between the voice commands issued outside the car and the voice commands issued inside the car. This part of the difference can be used to determine whether the sound source position is located inside the vehicle. The advantage of using the spectrum structure to determine whether the sound source position is located inside the vehicle is that the confirmation method is simple, efficient and easy to implement without resorting to other data, which can reduce the frequency of use of other sensors in the vehicle.
在一种可能的实现方式中,所述确定所述语音指令的声源位置是否位于所述车辆内部,包括:根据所述语音指令的频谱结构中高频部分的特征确定所述语音指令的声源位置是否位于所述车辆内部;当所述高频部分完整性满足预设条件时,确定所述声源位置位于所述车辆内部。In one possible implementation, determining whether the sound source position of the voice instruction is located inside the vehicle includes: determining the sound source of the voice instruction based on the characteristics of a high-frequency part of the spectrum structure of the voice instruction. Whether the position is located inside the vehicle; when the integrity of the high-frequency part meets the preset conditions, it is determined that the sound source position is located inside the vehicle.
由于声音穿过车窗玻璃等障碍物以后,频谱结构中的高频段部分能量被吸收,高频部分将明显得到衰减,而车内发出的正常未穿过障碍物的声音的频谱结构中的高频部分会被保存的比较完整,因此同等情况下,车外发出的语音指令和车内发出的语音指令在高频部分的完整性上会产生较为明显的差异,可以通过高频部分的特征来确定语音指令的声源位置是否位于所述车辆内部, 当高频部分完整性满足预设条件时,确定声源位置位于所述车辆内部。通过频谱结构中高频部分的特征来确定声源位置是否位于车辆内部的好处是,确认方法简单高效并且容易实现,无需借助于其他数据,能够减小车辆其他传感器的使用频次。Because after the sound passes through obstacles such as car window glass, the energy in the high-frequency band in the spectrum structure is absorbed, and the high-frequency part will be significantly attenuated. However, the normal sound emitted in the car that has not passed through obstacles has a high-frequency band in the spectrum structure. The high-frequency part will be preserved relatively completely. Therefore, under the same circumstances, the voice commands issued outside the car and the voice commands issued inside the car will have a more obvious difference in the integrity of the high-frequency part. It can be determined by the characteristics of the high-frequency part. Determine whether the sound source position of the voice command is located inside the vehicle, When the integrity of the high-frequency part meets the preset conditions, it is determined that the sound source position is located inside the vehicle. The advantage of determining whether the sound source position is located inside the vehicle through the characteristics of the high-frequency part of the spectrum structure is that the confirmation method is simple, efficient and easy to implement without resorting to other data, which can reduce the frequency of use of other sensors in the vehicle.
例如,可以将该频谱结构中的高频部分输入到预先训练好的神经网络模型(例如CNN模型)中,神经网络模型对频谱结构进行分析计算,并最终输出声源位置是否位于车辆内部的判断结果。For example, the high-frequency part of the spectrum structure can be input into a pre-trained neural network model (such as a CNN model). The neural network model analyzes and calculates the spectrum structure, and finally outputs a judgment of whether the sound source position is located inside the vehicle. result.
在一种可能的实现方式中,所述确定所述语音指令的声源位置是否位于所述车辆内部,包括:根据所述车辆的麦克风对所述语音指令的拾取强度确定所述声源位置是否位于所述车辆内部。In a possible implementation, determining whether the sound source position of the voice instruction is located inside the vehicle includes: determining whether the sound source position is based on the pickup intensity of the voice instruction by a microphone of the vehicle. Located inside said vehicle.
麦克风距离语音指令的声源位置越近,则对该语音指令的拾取(拾音)强度则越强,即拾取到的该语音指令的音量也越大,因此麦克风对语音指令的拾取强度能够反映距离声源位置的远近,可以根据该拾取强度确定声源位置是否位于车辆内部。例如,可以根据麦克风对语音指令的拾取强度确定声源位置具体所在何处,或者声源位置的具体方位,之后进一步确定声源位置是否位于车辆内部。The closer the microphone is to the sound source of the voice command, the stronger the pickup (sound pickup) intensity of the voice command will be, that is, the volume of the picked-up voice command will be greater, so the pickup intensity of the microphone can reflect the voice command. The distance from the sound source location can be used to determine whether the sound source location is inside the vehicle based on the pickup intensity. For example, the specific location of the sound source location, or the specific orientation of the sound source location, can be determined based on the pickup intensity of the voice command by the microphone, and then it can be further determined whether the sound source location is located inside the vehicle.
可选地,车辆可以仅配置一个麦克风,可以通过该唯一的一个麦克风对语音指令的拾取强度确定声源位置是否位于车辆内部。Alternatively, the vehicle may be equipped with only one microphone, and whether the sound source position is located inside the vehicle may be determined based on the pickup intensity of the voice command by the only microphone.
可选地,车辆通常可能配置有多个麦克风,可以通过该多个麦克风中每个麦克风对语音指令的拾取强度,结合预设的算法(例如训练好的神经网络模型),确定声源位置是否位于车辆内部。该多个麦克风可以均设置于车内,例如可以以麦克风阵列的方式集中设置于车辆的中控台附近,也可以分布于车内的不同座位处。该多个麦克风还可以包括至少一个用于拾取车外语音的车外麦克风,从而使得用户在车外也可以与车辆进行语音交互,能够获得更好的用车体验。Optionally, a vehicle may usually be equipped with multiple microphones, and the sound source location can be determined based on the pickup strength of each of the multiple microphones for voice commands, combined with a preset algorithm (such as a trained neural network model). Located inside the vehicle. The plurality of microphones can be installed in the car. For example, they can be centrally installed near the center console of the vehicle in the form of a microphone array, or they can be distributed at different seats in the car. The plurality of microphones may also include at least one external microphone for picking up voices outside the vehicle, so that the user can perform voice interaction with the vehicle outside the vehicle and obtain a better vehicle experience.
可选地,车辆配置有多个车内麦克风,此时可以根据该多个车内麦克风中每个麦克风对语音指令的拾取强度,结合预设的算法直接确定声源位置是否位于车辆内部。或者,也可以首先确定声源位置具体所在何处(即具体坐标)、声源位置的方位、与每个麦克风的距离等信息,之后通过上述信息间接确定声源位置是否位于车辆内部。Optionally, the vehicle is equipped with multiple in-vehicle microphones. In this case, based on the pickup strength of each of the multiple in-vehicle microphones for voice commands, combined with a preset algorithm, it can be directly determined whether the sound source position is located inside the vehicle. Alternatively, it is also possible to first determine the specific location of the sound source (i.e., specific coordinates), the direction of the sound source, the distance to each microphone, and other information, and then indirectly determine whether the sound source is located inside the vehicle through the above information.
在一种可能的实现方式中,所述根据所述车辆的麦克风对所述语音指令的拾取强度确定所述声源位置是否位于所述车辆内部,包括:根据多个麦克风对所述语音指令的拾取强度确定所述声源位置的方位;根据所述方位确定所述声源位置是否位于所述车辆内部。In a possible implementation, determining whether the sound source position is located inside the vehicle based on the pickup intensity of the voice instruction by a microphone of the vehicle includes: based on the pickup intensity of the voice instruction by multiple microphones. The pickup intensity determines the orientation of the sound source location; based on the orientation, it is determined whether the sound source location is located inside the vehicle.
一些情况下,例如麦克风的数量不足,或者声源位置与麦克风的相对位置不理想等原因,根据拾取强度可能无法准确确定声源的具体位置,或者需要较大的处理器算力,此时可以根据对个麦克风对该语音指令的拾取强度确定声源位置的方位,借助于预设的算法或者逻辑,通过该方位来确定声源位置是否位于车辆内部。In some cases, such as an insufficient number of microphones or an unsatisfactory relative position between the sound source and the microphone, it may not be possible to accurately determine the specific location of the sound source based on the pickup intensity, or greater processor computing power is required. In this case, you can The orientation of the sound source is determined based on the pickup intensity of the voice command by a pair of microphones, and with the help of a preset algorithm or logic, it is determined whether the sound source is located inside the vehicle through the orientation.
可选地,当确定声源位置的方位以后,进一步确定在该方位上车辆内部不具有座位,此时可以确定声源位置位于车辆外部。例如,麦克风被安装于车内的中控台上,而该方位指向车辆的车头前方,此时在该方位上的车内空间不具有座位,无法容纳乘客,因此可以确定声源位置位于车辆外部。Optionally, after determining the orientation of the sound source location, it is further determined that there are no seats inside the vehicle at this orientation. At this time, it can be determined that the sound source location is located outside the vehicle. For example, if the microphone is installed on the center console of the car and the direction points to the front of the vehicle, there are no seats in the car space at this direction and cannot accommodate passengers. Therefore, it can be determined that the sound source is located outside the vehicle. .
可选地,当确定声源位置的方位以后,进一步确定在该方位上的车窗是否被关闭。例如,如果车窗具有较强的隔音效果并且当前车窗被完全关闭,则可以确定声源位置位于车辆内部。Optionally, after determining the orientation of the sound source location, it is further determined whether the car window at this orientation is closed. For example, if the car window has a strong sound insulation effect and the current car window is completely closed, it can be determined that the sound source location is inside the vehicle.
可选地,当确定声源位置的方位以后,进一步确定在该方位上车内和车外的人员分布情况,结合该人员分布情况确定声源位置是否位于车辆内部。例如,此时如果在该方位上车内没有人员存在,则能够确定声源位置位于车辆外部;或者,此时如果在该方位上车外没有人员存在,则能够确定声源位置位于车辆内部;或者,此时如果在该方位上车内和车外均有人员存在,此时可以借助于传感器的其他数据来确定声源位置是否位于车辆内部。Optionally, after determining the orientation of the sound source location, further determine the distribution of people inside and outside the vehicle at this orientation, and determine whether the sound source location is located inside the vehicle based on the distribution of people. For example, if there are no people inside the vehicle at this orientation, it can be determined that the sound source is located outside the vehicle; or, if there are no people outside the vehicle at this orientation, it can be determined that the sound source is located inside the vehicle; Alternatively, if there are people both inside and outside the vehicle at this location, you can use other data from the sensor to determine whether the sound source is located inside the vehicle.
在一种可能的实现方式中,通过以下信息中的至少一种来确定所述声源位置是否位于所述车辆内部:所述语音指令的音频特征;所述车辆的麦克风对所述语音指令的拾取强度;所述车辆内部的人员分布情况;所述车辆外部预设范围内的人员分布情况;所述车辆的行驶速度;所述车辆的车窗开闭情况。In a possible implementation, it is determined whether the sound source position is located inside the vehicle through at least one of the following information: the audio characteristics of the voice instruction; the response of the vehicle's microphone to the voice instruction. Picking intensity; the distribution of people inside the vehicle; the distribution of people within the preset range outside the vehicle; the driving speed of the vehicle; the opening and closing of the windows of the vehicle.
在一些实现方式中,多种因素可以结合来判断语音指令的声源位置是否位于车内。在一些实现方式中,前述语音指令的频谱结构可以和拾取强度结合来判断语音指令的声源位置是否位于车 内,例如,通过语音指令的频谱结构确定高频部分保存完整性较差,结合该语音指令的拾取强度较弱,共同确定出该语音指令的声源位置位于车外;再例如,可以给这两种不同的因素设置优先级,在两种因素确定出的结果不同的时候以优先级高的因素确定出的结果为准。其它多种因素结合的方法也可以参考此处示例。In some implementations, multiple factors can be combined to determine whether the sound source location of the voice command is located in the car. In some implementations, the spectral structure of the aforementioned voice command can be combined with the pickup intensity to determine whether the sound source position of the voice command is located in the car. For example, through the spectrum structure of the voice command, it is determined that the preservation integrity of the high-frequency part is poor, combined with the weak pickup intensity of the voice command, it is jointly determined that the sound source position of the voice command is outside the car; for another example, this can be Two different factors set priorities. When the results determined by the two factors are different, the result determined by the factor with higher priority shall prevail. Other methods that combine multiple factors can also refer to the examples here.
一些情况下,可以通过车辆内部的人员分布情况来确定声源位置是否位于车辆内部。此时,可以通过车内摄像头、座椅压力传感器、触摸传感器、振动传感器、磁力计以及红外传感器等检测装置获取相关数据,并且根据该数据来确定车内是否具有乘客、乘客的方位或者具体位置、乘客乘坐的具体座位等人员分布信息,最终根据该人员分布信息确定声源位置是否位于车辆内部。In some cases, whether the sound source is located inside the vehicle can be determined by the distribution of people inside the vehicle. At this time, relevant data can be obtained through detection devices such as in-car cameras, seat pressure sensors, touch sensors, vibration sensors, magnetometers, and infrared sensors, and based on this data, it can be determined whether there are passengers in the car, their orientation or specific location , the specific seats of the passengers and other personnel distribution information, and finally determine whether the sound source position is located inside the vehicle based on the personnel distribution information.
可选地,如果该人员分布信息指示当前车内没有驾乘人员,此时可以确定声源位置位于车辆的外部而非内部。Optionally, if the personnel distribution information indicates that there are currently no drivers or passengers in the vehicle, it may be determined that the sound source location is located outside the vehicle rather than inside.
可选地,如果该人员分布信息指示当前车内具有驾乘人员,此时可以借助于其他信息(例如前述各类数据中的任意一种或者多种)共同确定声源位置是否位于车辆内部。Optionally, if the personnel distribution information indicates that there are drivers and passengers in the current vehicle, other information (such as any one or more of the aforementioned types of data) can be used to jointly determine whether the sound source position is located inside the vehicle.
例如,此时可以进一步结合麦克风采集到的该语音指令的音频特征数据或者麦克风采集到的该语音指令的拾取(拾音)强度数据等进行确定,下文将对此做进一步说明。For example, at this time, the determination can be further made in combination with the audio feature data of the voice command collected by the microphone or the pickup (sound pickup) intensity data of the voice command collected by the microphone, which will be further described below.
一些情况下,可以通过车辆外部的人员分布情况来确定声源位置是否位于车辆内部。此时,可以通过车外摄像头、雷达、激光测距仪等检测装置获取相关数据,并且根据该数据来确定车外附近预设范围内是否具有人员以及人员的方位或者具体位置等人员分布信息,最终根据该人员分布信息确定声源位置是否位于车辆内部。In some cases, whether the sound source is located inside the vehicle can be determined by the distribution of people outside the vehicle. At this time, relevant data can be obtained through detection devices such as cameras, radars, and laser rangefinders outside the vehicle, and based on this data, it can be determined whether there are people within the preset range near the outside of the vehicle, as well as personnel distribution information such as their orientation or specific location. Finally, it is determined whether the sound source is located inside the vehicle based on the personnel distribution information.
该预设范围可以根据车辆配备的麦克风的拾音能力进行确定,声音拾取能力越强,说明较远距离的人员发出的语音指令也可能被拾取到,因此该预设范围也应当越大。生产厂家可以在车辆出厂之前根据麦克风的具体配置对该预设范围进行合理设定。This preset range can be determined based on the sound pickup capability of the microphone equipped in the vehicle. The stronger the sound pickup capability, it means that voice commands issued by people at a greater distance may also be picked up, so the preset range should also be larger. The manufacturer can reasonably set the preset range based on the specific configuration of the microphone before the vehicle leaves the factory.
可选地,如果该人员分布信息指示当前车外预设范围内没有人员存在,此时可以确定声源位置位于车辆的内部。Optionally, if the person distribution information indicates that there are no people within the current preset range outside the vehicle, it can be determined that the sound source position is located inside the vehicle.
可选地,如果该人员分布信息指示当前车外预设范围内具有人员存在,此时可以借助于其他信息(例如前述各类数据中的任意一种或者多种)共同确定声源位置是否位于车辆内部。Optionally, if the personnel distribution information indicates that there are people within the current preset range outside the vehicle, then other information (such as any one or more of the aforementioned types of data) can be used to jointly determine whether the sound source location is located Vehicle interior.
一些情况下,可以通过车辆的行驶速度确定声源位置是否位于车辆内部。车辆的行驶速度达到一定数值时,例如行驶速度大于或者等于80千米/每小时(km/h),此时即使车窗大开可能也无法拾取到车外的语音,因此如果此时麦克风拾取到对车辆进行控制的语音指令,则可以确定该语音指令的声源位置位于车辆的内部。In some cases, the driving speed of the vehicle can be used to determine whether the sound source is located inside the vehicle. When the vehicle's driving speed reaches a certain value, for example, the driving speed is greater than or equal to 80 kilometers per hour (km/h), even if the window is wide open, it may not be able to pick up the voice outside the car. Therefore, if the microphone picks up the When the voice command for controlling the vehicle is received, it can be determined that the sound source position of the voice command is located inside the vehicle.
可选地,当车辆的行驶速度大于或者等于预设的车速阈值时,则可以确定语音指令的声源位置位于车辆的内部。该车速阈值可以为60~100km/h,例如可以是70km/h或者80km/h,生产厂家可以在车辆出厂之前根据麦克风的拾音能力对该车速阈值进行合理设定。Optionally, when the driving speed of the vehicle is greater than or equal to the preset vehicle speed threshold, it may be determined that the sound source position of the voice command is located inside the vehicle. The vehicle speed threshold can be 60 to 100km/h, for example, 70km/h or 80km/h. The manufacturer can reasonably set the vehicle speed threshold based on the microphone's sound pickup ability before the vehicle leaves the factory.
可选地,当车辆的行驶速度小于该车速阈值时,此时可以借助于其他信息(例如前述各类数据中的任意一种或者多种)共同确定声源位置是否位于车辆内部。Optionally, when the vehicle's driving speed is less than the vehicle speed threshold, other information (such as any one or more of the aforementioned types of data) can be used to jointly determine whether the sound source position is located inside the vehicle.
例如,当车辆的行驶速度小于该车速阈值,并且大于预设的阈值下限(例如25km/h)时,如果车辆的所有车窗均处于关闭的状态,此时相对较快的车速加上玻璃的阻隔作用,车内的麦克风可能无法拾取到车外的语音指令,因此可以确定语音指令的声源位置位于车辆的内部。For example, when the vehicle's driving speed is less than the speed threshold and greater than the preset lower threshold (for example, 25km/h), if all the windows of the vehicle are closed, the relatively fast vehicle speed plus the glass Due to the blocking effect, the microphone in the car may not be able to pick up the voice command outside the car, so it can be determined that the sound source position of the voice command is located inside the vehicle.
一些情况下,可以通过车辆的车窗开闭情况确定声源位置是否位于车辆内部。可以通过车窗位置检测传感器获取相关数据以确定车辆的各个车窗的当前状态(即开闭情况),例如可以是完全打开、半开或者完全关闭等。车窗玻璃具有一定的隔音效果,当车窗被关闭时,车内的麦克风可能无法准确拾取到车外的语音指令,而当车窗被打开时,车内的麦克风则能够拾取到车外的语音指令,因此可以根据车辆的车窗开闭情况确定声源位置是否位于车辆内部。In some cases, it can be determined whether the sound source is located inside the vehicle through the opening and closing of the vehicle's windows. Relevant data can be obtained through the window position detection sensor to determine the current status (ie, opening and closing status) of each window of the vehicle, for example, it can be fully open, half open, or fully closed. Car window glass has a certain sound insulation effect. When the car window is closed, the microphone in the car may not be able to accurately pick up the voice commands outside the car. However, when the car window is opened, the microphone in the car can pick up the voice commands outside the car. Voice command, so it can determine whether the sound source is located inside the vehicle based on the opening and closing of the vehicle's windows.
可选地,当车辆的所有车窗均处于完全关闭的状态时,说明此时麦克风无法接收到车外的语音,则可以确定语音指令的声源位置位于车辆的内部。该车辆例如可以是一些高档轿车,其隔音效果尤为突出。Optionally, when all the windows of the vehicle are completely closed, it means that the microphone cannot receive the voice outside the vehicle at this time, and it can be determined that the sound source position of the voice command is located inside the vehicle. The vehicle can be, for example, some high-end cars, and its sound insulation effect is particularly outstanding.
在一种可能的实现方式中,所述语音指令包括唤醒指令,所述控制所述车辆执行与所述语音指令相应的操作,包括:唤醒所述车辆的语音助手。 In a possible implementation, the voice instruction includes a wake-up instruction, and controlling the vehicle to perform operations corresponding to the voice instruction includes: waking up a voice assistant of the vehicle.
在一种可能的实现方式中,所述控制所述车辆执行与所述语音指令相应的操作,包括:对所述语音指令进行语义识别以获得控制指令;根据所述控制指令对所述车辆进行控制。In a possible implementation, controlling the vehicle to perform operations corresponding to the voice instructions includes: performing semantic recognition on the voice instructions to obtain control instructions; and performing operations on the vehicle according to the control instructions. control.
也就是说,当确定声源位置位于车辆内部时,才会对语音指令进行语义解析,以生成相应的控制指令,当确定声源位置不是位于车辆内部(即位于车辆外部)时,可以不对语音指令进行语义解析,由此有利于节约处理器的算力。That is to say, when it is determined that the sound source is located inside the vehicle, the voice command will be semantically parsed to generate the corresponding control command. When it is determined that the sound source is not located inside the vehicle (that is, outside the vehicle), the voice command may not be processed. The instructions are semantically parsed, which helps save the computing power of the processor.
在一种可能的实现方式中,所述方法还包括:当所述声源位置位于所述车辆外部时,控制所述车辆不执行所述语音指令。In a possible implementation, the method further includes: controlling the vehicle not to execute the voice instruction when the sound source position is located outside the vehicle.
当确定声源位置不是位于车辆内部,即确定声源位置位于车辆外部时,说明该语音指令可能是由车外人员(例如陌生人、路人或者不法分子)所说出并被麦克风碰巧拾取到,此时可以不对该语音指令进行语义识别,不执行该语音指令,即对该语音指令不进行响应,由此能够提高车辆控制的安全性,使得车内乘客的人身和财产安全得以保障。When it is determined that the sound source position is not located inside the vehicle, that is, it is determined that the sound source position is located outside the vehicle, it means that the voice command may have been spoken by someone outside the vehicle (such as a stranger, a passerby, or a criminal) and accidentally picked up by the microphone. At this time, the voice command may not be semantically recognized or executed, that is, the voice command may not be responded to, thereby improving the safety of vehicle control and ensuring the personal and property safety of passengers in the vehicle.
在一种可能的实现方式中,所述方法还包括:当所述声源位置位于所述车辆外部时,若确定预设的触发条件被满足,控制所述车辆执行与所述语音指令相应的操作。In a possible implementation, the method further includes: when the sound source position is located outside the vehicle, if it is determined that the preset trigger condition is met, controlling the vehicle to execute a command corresponding to the voice command. operate.
也就是说,若确定声源位置位于车辆外部,可以作进一步的判定,根据判定结果,决定是否执行与该控制指令相应的操作,而不是像前述实施例那样直接不执行该语音指令。具体地,若触发条件被满足,则控制车辆执行与该控制指令相应的操作,若触发条件没有被满足,则不执行该语音指令。通过以上设置,在提高车辆语音控制的安全性的前提下,还能够增加控制灵活性,有利于提高用户的使用体验。That is to say, if it is determined that the sound source is located outside the vehicle, further determination can be made, and based on the determination result, it is decided whether to execute the operation corresponding to the control instruction, instead of directly not executing the voice instruction as in the previous embodiment. Specifically, if the trigger condition is met, the vehicle is controlled to perform an operation corresponding to the control instruction; if the trigger condition is not met, the voice instruction is not executed. Through the above settings, on the premise of improving the safety of vehicle voice control, it can also increase the control flexibility, which is beneficial to improving the user experience.
可选地,触发条件可以由厂家预先设置,也可以由用户进行自设定,例如用户可以根据哪些场景可能在车外对车辆进行语音控制来进行条件设定。Optionally, the triggering conditions can be pre-set by the manufacturer, or can be set by the user. For example, the user can set the conditions based on the scenarios in which the user may voice control the vehicle outside the vehicle.
可选地,触发条件可以是车辆的电子钥匙位于车辆附近、通过手机等移动终端的定位功能确定车主位于车辆附近、特殊时间段(例如上、下班时间)、车辆进入某一特定模式(例如,露营模式)等,本申请对此不作限定。Optionally, the triggering condition may be that the electronic key of the vehicle is located near the vehicle, the vehicle owner is determined to be located near the vehicle through the positioning function of a mobile terminal such as a mobile phone, a special time period (such as commuting and get off work hours), or the vehicle enters a specific mode (for example, Camping mode), etc., this application does not limit this.
以触发条件是车辆的电子钥匙位于车辆附近为例,若车辆能够建立与电子钥匙的蓝牙连接,则能够确定电子钥匙就在车辆附近,即确定车辆的主人也可能位于车辆附近,此时触发条件被满足,可以控制车辆执行与该控制指令相应的操作。Take the trigger condition that the vehicle's electronic key is located near the vehicle as an example. If the vehicle can establish a Bluetooth connection with the electronic key, it can be determined that the electronic key is near the vehicle, that is, it is determined that the owner of the vehicle may also be located near the vehicle. At this time, the trigger condition is satisfied, the vehicle can be controlled to perform operations corresponding to the control instruction.
在一种可能的实现方式中,所述方法还包括:当所述声源位置位于所述车辆外部时,发送是否执行所述语音指令的询问信息。In a possible implementation, the method further includes: when the sound source is located outside the vehicle, sending inquiry information as to whether to execute the voice command.
在本申请实施例中,当确定语音指令的声源位置位于车辆外部时,则发送是否执行该语音指令的询问信息,而不是像前述实施例那样直接不执行该语音指令。使得本申请实施例提供的控制方法不仅能够提高车辆语音控制的安全性,还能够增加控制灵活性,有利于提高用户的使用体验。In the embodiment of the present application, when it is determined that the sound source position of the voice command is located outside the vehicle, an inquiry message is sent as to whether to execute the voice command, instead of directly not executing the voice command as in the previous embodiment. The control method provided by the embodiments of this application can not only improve the safety of vehicle voice control, but also increase the control flexibility, which is beneficial to improving the user experience.
可选地,这里的询问信息可以为声音信息、图像信息或文字信息等。Optionally, the query information here may be sound information, image information, text information, etc.
例如,这里的询问信息可以是通过车辆的音箱设备发出是否执行该语音指令的语音询问信息,也可以是在车辆内部的中控屏、仪表屏或者抬头显示(heads up display,HUD)设备上显示的图像询问信息,本申请对此不作限定。For example, the inquiry information here can be a voice inquiry information whether to execute the voice command issued through the vehicle's speaker device, or it can be displayed on the central control screen, instrument screen or heads up display (HUD) device inside the vehicle. image query information, this application does not limit this.
第二方面,提供了一种车辆的控制装置,包括:获取单元,用于获取对车辆进行控制的语音指令;确定单元,用于确定所述语音指令的声源位置是否位于所述车辆内部;控制单元,当所述声源位置位于所述车辆内部时,用于控制所述车辆执行与所述语音指令相应的操作。In a second aspect, a vehicle control device is provided, including: an acquisition unit, used to acquire a voice instruction for controlling the vehicle; a determination unit, used to determine whether the sound source position of the voice instruction is located inside the vehicle; A control unit, when the sound source is located inside the vehicle, is used to control the vehicle to perform operations corresponding to the voice instructions.
在一种可能的实现方式中,所述确定单元具体用于:根据所述语音指令的音频特征确定所述声源位置是否位于所述车辆内部。In a possible implementation, the determining unit is specifically configured to determine whether the sound source position is located inside the vehicle according to the audio characteristics of the voice instruction.
在一种可能的实现方式中,所述音频特征包括频谱结构。In a possible implementation, the audio features include spectral structure.
在一种可能的实现方式中,所述确定单元具体用于:根据所述语音指令的频谱结构中高频部分的特征确定所述语音指令的声源位置是否位于所述车辆内部;当所述高频部分完整性满足预设条件时,确定所述声源位置位于所述车辆内部。In a possible implementation, the determining unit is specifically configured to: determine whether the sound source position of the voice command is located inside the vehicle according to the characteristics of the high-frequency part in the spectrum structure of the voice command; when the high-frequency part When the integrity of the frequency part meets the preset conditions, it is determined that the sound source position is located inside the vehicle.
在一种可能的实现方式中,所述确定单元具体用于:根据所述车辆的麦克风对所述语音指令的拾取强度确定所述声源位置是否位于所述车辆内部。In a possible implementation, the determining unit is specifically configured to determine whether the sound source position is located inside the vehicle according to the pickup intensity of the voice command by the vehicle's microphone.
在一种可能的实现方式中,所述确定单元具体用于:根据多个麦克风对所述语音指令的拾取 强度确定所述声源位置的方位;根据所述方位确定所述声源位置是位于所述车辆内部还是外部。In a possible implementation, the determining unit is specifically configured to: pick up the voice instruction based on multiple microphones. The intensity determines the orientation of the sound source location; based on the orientation, it is determined whether the sound source location is located inside or outside the vehicle.
在一种可能的实现方式中,所述确定单元通过以下信息中的至少一种来确定所述声源位置是否位于所述车辆内部:所述语音指令的音频特征;所述车辆的麦克风对所述语音指令的拾取强度;所述车辆内部的人员分布情况;所述车辆外部预设范围内的人员分布情况;In a possible implementation, the determining unit determines whether the sound source position is located inside the vehicle through at least one of the following information: audio characteristics of the voice instruction; The pickup intensity of the voice command; the distribution of people inside the vehicle; the distribution of people within the preset range outside the vehicle;
所述车辆的行驶速度;所述车辆的车窗开闭情况。The driving speed of the vehicle; the opening and closing conditions of the windows of the vehicle.
在一种可能的实现方式中,所述语音指令包括唤醒指令,所述控制单元具体用于:唤醒所述车辆的语音助手。In a possible implementation, the voice command includes a wake-up command, and the control unit is specifically configured to wake up the voice assistant of the vehicle.
在一种可能的实现方式中,所述控制单元具体用于:对所述语音指令进行语义解析以获得控制指令;根据所述控制指令对所述车辆进行控制。In a possible implementation, the control unit is specifically configured to: perform semantic analysis on the voice instructions to obtain control instructions; and control the vehicle according to the control instructions.
在一种可能的实现方式中,当所述声源位置位于所述车辆外部时,所述控制单元还用于:控制所述车辆不执行所述语音指令。In a possible implementation, when the sound source is located outside the vehicle, the control unit is further configured to control the vehicle not to execute the voice instruction.
在一种可能的实现方式中,当所述声源位置位于所述车辆外部时,若确定预设的触发条件被满足,所述控制单元还用于:控制所述车辆执行与所述语音指令相应的操作。In a possible implementation, when the sound source is located outside the vehicle, if it is determined that the preset trigger condition is met, the control unit is further configured to: control the vehicle to execute the voice command. Corresponding operations.
在一种可能的实现方式中,所述控制装置还包括:发送单元,当所述声源位置位于所述车辆外部时,用于发送是否执行所述语音指令的询问信息。In a possible implementation, the control device further includes: a sending unit configured to send query information on whether to execute the voice command when the sound source is located outside the vehicle.
第三方面,提供了一种车辆的控制装置,包括至少一个处理器,所述至少一个处理器用于与存储器耦合,读取并执行所述存储器中的指令,以实现前述第一方面中任一种可能实现方式所提供的方法。In a third aspect, a vehicle control device is provided, including at least one processor, the at least one processor being coupled to a memory, reading and executing instructions in the memory, to implement any one of the foregoing first aspects. possible implementation methods.
可选地,控制装置还包括该存储器。Optionally, the control device further includes the memory.
第四方面,提供了一种计算机可读存储介质,所述计算机可读存储介质上存储有计算机程序,当所述计算机程序在计算机上运行时,以实现前述第一方面中任一种可能实现方式所提供的方法。In a fourth aspect, a computer-readable storage medium is provided. A computer program is stored on the computer-readable storage medium. When the computer program is run on a computer, any one of the possible implementations of the first aspect can be realized. method provided.
第五方面,提供了一种芯片系统,包括处理器,用于从存储器中调用并运行计算机程序,使得安装有所述芯片系统的计算机执行前述第一方面中任一种可能实现方式所提供的方法。In a fifth aspect, a chip system is provided, including a processor for calling and running a computer program from a memory, so that the computer installed with the chip system executes any of the possible implementation methods provided in the first aspect. method.
第六方面,提供了一种计算机程序产品,所述计算机程序产品包括:计算机程序代码,当所述计算机程序代码在计算机上运行时,使得计算机执行上述第一方面中的任意一种实现方式所提供的方法。In a sixth aspect, a computer program product is provided. The computer program product includes: computer program code. When the computer program code is run on a computer, it causes the computer to execute any implementation method in the first aspect. provided method.
需要说明的是,上述计算机程序代码可以全部或者部分存储在第一存储介质上,其中第一存储介质可以与处理器封装在一起的,也可以与处理器单独封装,本申请对此不作具体限定。It should be noted that the above computer program code can be stored in whole or in part on the first storage medium, where the first storage medium can be packaged together with the processor, or can be packaged separately from the processor. This application does not specifically limit this. .
第七方面,提供了一种车辆,包括前述第二方面、第三方面中任一种可能实现方式所提供的车辆的控制装置。A seventh aspect provides a vehicle, including the vehicle control device provided by any of the possible implementations of the second aspect and the third aspect.
可以理解地,上述第二、三方面提供的车辆的控制装置、第四方面提供的计算机可读存储介质、第五方面提供的芯片系统、第六方面提供的计算机程序产品、第七方面提供的车辆均用于执行前述第一方面所提供的控制方法,因此,其所能达到的有益效果可参考上文所提供的对应的方法中的有益效果,此处不再赘述。It can be understood that the vehicle control device provided in the second and third aspects, the computer-readable storage medium provided in the fourth aspect, the chip system provided in the fifth aspect, the computer program product provided in the sixth aspect, the computer program product provided in the seventh aspect, The vehicles are all used to execute the control method provided in the first aspect. Therefore, the beneficial effects that can be achieved can be referred to the beneficial effects in the corresponding methods provided above, and will not be described again here.
附图说明Description of the drawings
图1是本申请实施例提供的一种应用场景的示意图。Figure 1 is a schematic diagram of an application scenario provided by an embodiment of the present application.
图2是当前对车辆进行语音控制的逻辑框图。Figure 2 is a logic block diagram of the current voice control of the vehicle.
图3是本申请实施例提供的车辆的功能框图。Figure 3 is a functional block diagram of a vehicle provided by an embodiment of the present application.
图4是本申请实施例提供的车辆的控制方法的一例的流程示意图。FIG. 4 is a schematic flowchart of an example of a vehicle control method provided by an embodiment of the present application.
图5是图4所示的控制方法的应用场景示意图。FIG. 5 is a schematic diagram of an application scenario of the control method shown in FIG. 4 .
图6是控制指令的传播过程示意图。Figure 6 is a schematic diagram of the propagation process of control instructions.
图7是本申请实施例提供的车辆的控制方法的另一例的流程示意图。FIG. 7 is a schematic flowchart of another example of a vehicle control method provided by an embodiment of the present application.
图8是图7所示的控制方法的应用场景示意图。FIG. 8 is a schematic diagram of an application scenario of the control method shown in FIG. 7 .
图9是本申请实施例提供的车辆的控制方法的再一例的流程示意图。FIG. 9 is a schematic flowchart of another example of a vehicle control method provided by an embodiment of the present application.
图10是图9所示的控制方法的应用场景示意图。FIG. 10 is a schematic diagram of an application scenario of the control method shown in FIG. 9 .
图11是本申请实施例提供的车辆的控制装置的示意性框图。FIG. 11 is a schematic block diagram of a vehicle control device provided by an embodiment of the present application.
图12是本申请实施例提供的车辆的控制装置的结构性示意图。 FIG. 12 is a schematic structural diagram of a vehicle control device provided by an embodiment of the present application.
具体实施方式Detailed ways
下面将结合附图,对本申请中的技术方案进行描述。显然,所描述的实施例仅仅是本申请一部分实施例,而不是全部的实施例。The technical solutions in this application will be described below with reference to the accompanying drawings. Obviously, the described embodiments are only some of the embodiments of the present application, but not all of the embodiments.
以下描述中,为了说明而不是为了限定,提出了诸如特定系统结构、技术之类的具体细节,以便透彻理解本申请实施例。然而,本领域的技术人员应当清楚,在没有这些具体细节的其它实施例中也可以实现本申请。在其它情况中,省略对众所周知的系统、装置、电路以及方法的详细说明,以免不必要的细节妨碍本申请的描述。In the following description, for the purpose of explanation rather than limitation, specific details such as specific system structures and technologies are provided to provide a thorough understanding of the embodiments of the present application. However, it will be apparent to those skilled in the art that the present application may be practiced in other embodiments without these specific details. In other instances, detailed descriptions of well-known systems, devices, circuits, and methods are omitted so as not to obscure the description of the present application with unnecessary detail.
本文中术语“包括”指示所描述特征、整体、步骤、操作、元素和/或组件的存在,但并不排除一个或多个其它特征、整体、步骤、操作、元素、组件和/或其集合的存在或添加。术语“包括”、“包含”、“具有”及它们的变形都意味着“包括但不限于”,除非是以其他方式另外特别强调。The term "comprising" as used herein indicates the presence of described features, integers, steps, operations, elements and/or components but does not exclude the presence of one or more other features, integers, steps, operations, elements, components and/or collections thereof existence or addition. The terms “including,” “includes,” “having,” and variations thereof all mean “including but not limited to,” unless otherwise specifically emphasized.
以下,术语“第一”、“第二”仅用于描述目的,而不能理解为指示或暗示相对重要性或者隐含指明所指示的技术特征的数量。由此,限定有“第一”、“第二”的特征可以明示或者隐含地包括一个或者更多个该特征。在本申请实施例的描述中,除非另有说明,“多个”的含义是两个或两个以上。Hereinafter, the terms “first” and “second” are used for descriptive purposes only and cannot be understood as indicating or implying relative importance or implicitly indicating the quantity of indicated technical features. Therefore, features defined as "first" and "second" may explicitly or implicitly include one or more of these features. In the description of the embodiments of this application, unless otherwise specified, "plurality" means two or more.
本文中术语“和/或”,仅仅是一种描述关联对象的关联关系,表示可以存在三种关系,例如,A和/或B,可以表示:单独存在A,同时存在A和B,单独存在B这三种情况。另外,本文中字符“/”,一般表示前后关联对象是一种“或”的关系。The term "and/or" in this article is just an association relationship that describes related objects, indicating that three relationships can exist. For example, A and/or B can mean: A exists alone, A and B exist simultaneously, and they exist alone. B these three situations. In addition, the character "/" in this article generally indicates that the related objects are an "or" relationship.
图1是本申请实施例提供的一种应用场景的示意图。如图1所示,该应用场景包括车辆11,车辆11内具有司机12。若车辆11开启了语音控制功能,那么包括司机12在内的驾乘人员就能够通过语音指令对车辆进行控制。例如,司机12在开车过程中,可以通过语音指令设定导航或更换调频,就不需要去看中控台手动操作。一方面,这样能够解放驾驶员双手,另一方面,驾驶员不需要看着中控台进行手动操作,在开车过程中不会分散注意力,有利于提高行车安全。Figure 1 is a schematic diagram of an application scenario provided by an embodiment of the present application. As shown in Figure 1, the application scenario includes a vehicle 11 with a driver 12 inside. If the vehicle 11 turns on the voice control function, the drivers and passengers, including the driver 12, can control the vehicle through voice commands. For example, the driver 12 can set the navigation or change the frequency modulation through voice commands while driving, without having to look at the center console for manual operations. On the one hand, this can free up the driver's hands. On the other hand, the driver does not need to look at the center console to perform manual operations, which will not distract his attention during driving and will help improve driving safety.
图2是当前对车辆进行语音控制的逻辑框图。如图2所示,在目前的车辆语音控制过程中,由车辆的语音采集系统21采集用户发出的语音指令,并进行语义识别得到语音控制指令,再将该语音控制指令发送至控制单元22,控制单元22就会基于该语音控制指令形成控制对象23能够识别的控制指令,并根据该控制指令控制控制对象23,以控制车辆进行相应的动作。Figure 2 is a logic block diagram of the current voice control of the vehicle. As shown in Figure 2, in the current vehicle voice control process, the vehicle's voice collection system 21 collects the voice instructions issued by the user, performs semantic recognition to obtain the voice control instructions, and then sends the voice control instructions to the control unit 22. The control unit 22 will form a control instruction that can be recognized by the control object 23 based on the voice control instruction, and control the control object 23 according to the control instruction to control the vehicle to perform corresponding actions.
然而,本申请的发明人在实践过程中发现,在一些情形下,车辆的语音采集系统拾取到的语音指令实际上并不能被直接执行,若该语音指令被直接执行,可能会造成车辆的误操作,从而对车内驾乘人员的人身和财产安全造成隐患。However, the inventor of the present application discovered during practice that in some cases, the voice command picked up by the vehicle's voice collection system cannot actually be directly executed. If the voice command is directly executed, it may cause the vehicle to malfunction. operation, thus causing hidden dangers to the personal and property safety of the drivers and passengers in the vehicle.
例如,如图1所示,如果车辆11的驾乘人员将车辆11停在路边在车内进行休息,此时如果车外人员13喊出“打开车门”的语音指令,则车辆11就会根据该语音指令直接控制车门打开。由此给车外人员13提供了可乘之机,对车内驾乘人员的人身和财产安全造成隐患。For example, as shown in Figure 1, if the driver and passengers of the vehicle 11 park the vehicle 11 on the roadside and take a rest in the car, and at this time, if the person 13 outside the vehicle shouts a voice command of "open the door", the vehicle 11 will Directly control the door opening according to the voice command. This provides an opportunity for people 13 outside the vehicle to take advantage of, causing hidden dangers to the personal and property safety of the drivers and passengers in the vehicle.
再例如,如图1所示,如果车辆11慢速行驶在拥挤的城市道路上,此时如果车外人员13喊出“打开车门”的语音指令,则车辆11就会根据该语音指令直接控制车门打开。此时由于车门已被打开(解锁),乘坐在后排的儿童可能有从车内跌落的风险。For another example, as shown in Figure 1, if the vehicle 11 is driving slowly on a crowded urban road, and a person 13 outside the vehicle shouts the voice command "open the door", the vehicle 11 will directly control the vehicle according to the voice command. The car door opens. At this time, because the door has been opened (unlocked), children sitting in the back seat may be at risk of falling from the car.
综上所述,针对当前的车辆语音控制方法,车辆获取到的语音指令并没有进行区分,无论是来自于车内还是车外的语音指令都会被执行,由此可能会造成车辆的误操作,对车内驾乘人员的人身和财产安全造成隐患。To sum up, with the current vehicle voice control method, the voice commands obtained by the vehicle are not differentiated. Whether the voice commands come from inside or outside the vehicle, they will be executed, which may cause misoperation of the vehicle. Causing hidden dangers to the personal and property safety of the drivers and passengers in the vehicle.
基于上述问题,本申请实施例提供了一种车辆的控制方法,在车辆接收到语音指令以后,能够判断发出该语音指令的人员是否位于车辆内部,从而决定是否执行与该语音指令相应的控制操作,由此使得对车辆的控制更加智能化,能够提高车辆的使用安全性。本申请可以帮助车内的用户有效规避车外陌生人员对于车辆的控制和干扰,不会造成车辆的误操作,进一步保证了车内人员,特别是休息场景下的安全。Based on the above problems, embodiments of the present application provide a vehicle control method. After the vehicle receives a voice command, it can determine whether the person who issued the voice command is located inside the vehicle, thereby deciding whether to perform the control operation corresponding to the voice command. , thus making the control of the vehicle more intelligent and improving the safety of the vehicle. This application can help users in the car effectively avoid control and interference of the vehicle by strangers outside the car, without causing misoperation of the vehicle, further ensuring the safety of people in the car, especially in resting scenes.
本申请实施例提供的车辆的控制方法应用在车辆中,或者应用于具有控制车辆的功能的其他设备(例如云端服务器、手机终端等)中。其中,所述车辆具体可以为内燃机车、智能电动车或者混合动力车,或者,该车辆也可以为其他动力类型的车辆等,本申请实施例对此并不限定。The vehicle control method provided by the embodiment of the present application is applied in the vehicle, or in other devices with the function of controlling the vehicle (such as cloud servers, mobile phone terminals, etc.). The vehicle may specifically be an internal combustion locomotive, an intelligent electric vehicle or a hybrid vehicle, or the vehicle may also be a vehicle of other power types, which is not limited in the embodiments of the present application.
所述车辆可以为自动驾驶车辆,该自动驾驶车辆可以是具备部分自动驾驶功能的车辆,也可 以是具备全部自动驾驶功能的车辆,也就是说,该车辆的自动驾驶的等级可以参照美国汽车工程师协会(society of automotive engineers,SAE)的分类标准,划分为无自动化(L0)、驾驶支援(L1)、部分自动化(L2)、有条件自动化(L3)、高度自动化(L4)或者完全自动化(L5)。车辆或者其他设备(例如云端服务器、手机终端等)可通过其包含的组件(包括硬件和软件)实施本申请实施例提供的车辆控制方法,判断用于控制车辆的语音指令的声源位置是否位于车辆内部,即判断说出该语音指令的人员是位于车辆内部还是外部,并根据判断结果决定是否执行该语音指令对应的控制操作,从而使得对车辆的控制更加智能化,能够提高车辆的使用安全性,并使得车内乘客的人身和财产安全得以保障。The vehicle may be an autonomous vehicle, which may be a vehicle with partial autonomous driving capabilities, or may be Therefore, a vehicle with all autonomous driving functions, that is to say, the vehicle’s autonomous driving level can refer to the classification standards of the Society of Automotive Engineers (SAE), which is divided into no automation (L0), driving support ( L1), partial automation (L2), conditional automation (L3), high automation (L4) or complete automation (L5). A vehicle or other device (such as a cloud server, a mobile phone terminal, etc.) can implement the vehicle control method provided by the embodiments of the present application through its components (including hardware and software), and determine whether the sound source position of the voice command used to control the vehicle is located Inside the vehicle, that is, it determines whether the person who spoke the voice command is located inside or outside the vehicle, and decides whether to perform the control operation corresponding to the voice command based on the judgment result, thereby making the control of the vehicle more intelligent and improving the safety of the vehicle. safety, and ensure the personal and property safety of passengers in the vehicle.
作为一个具体的示例,所述车辆可以为图3中的车辆100,图3是本申请实施例提供的车辆100的功能框图。As a specific example, the vehicle may be the vehicle 100 in FIG. 3 , which is a functional block diagram of the vehicle 100 provided by the embodiment of the present application.
如图3所示,车辆100可包括各种子系统,例如行进系统102、传感器系统104、控制系统106、一个或多个外围设备108以及电源110、计算机系统112和用户接口116。可选地,车辆100可包括更多或更少的子系统,并且每个子系统可包括多个元件。另外,车辆100的每个子系统和元件可以通过有线或者无线互连。As shown in FIG. 3 , vehicle 100 may include various subsystems such as travel system 102 , sensor system 104 , control system 106 , one or more peripheral devices 108 as well as power supply 110 , computer system 112 and user interface 116 . Alternatively, vehicle 100 may include more or fewer subsystems, and each subsystem may include multiple elements. Additionally, each subsystem and element of vehicle 100 may be interconnected via wires or wirelessly.
行进系统102可包括为车辆100提供动力运动的组件。在一个实施例中,行进系统102可包括引擎118、能量源119、传动装置120和车轮121。引擎118可以是内燃引擎、电动机、空气压缩引擎或其他类型的引擎组合,例如,汽油发动机和电动机组成的混动引擎,内燃引擎和空气压缩引擎组成的混动引擎。引擎118将能量源119转换成机械能量。The travel system 102 may include components that provide powered motion for the vehicle 100 . In one embodiment, the propulsion system 102 may include an engine 118 , an energy source 119 , a transmission 120 and wheels 121 . The engine 118 may be an internal combustion engine, an electric motor, an air compression engine, or a combination of other types of engines, such as a hybrid engine composed of a gasoline engine and an electric motor, or a hybrid engine composed of an internal combustion engine and an air compression engine. Engine 118 converts energy source 119 into mechanical energy.
能量源119的示例包括汽油、柴油、其他基于石油的燃料、丙烷、其他基于压缩气体的燃料、乙醇、太阳能电池板、电池和其他电力来源。能量源119也可以为车辆100的其他系统提供能量。Examples of energy sources 119 include gasoline, diesel, other petroleum-based fuels, propane, other compressed gas-based fuels, ethanol, solar panels, batteries, and other sources of electricity. Energy source 119 may also provide energy to other systems of vehicle 100 .
传动装置120可以将来自引擎118的机械动力传送到车轮121。传动装置120可包括变速箱、差速器和驱动轴。Transmission 120 may transmit mechanical power from engine 118 to wheels 121 . Transmission 120 may include a gearbox, differential, and driveshaft.
在一个实施例中,传动装置120还可以包括其他器件,比如离合器。其中,驱动轴可包括可耦合到一个或多个车轮121的一个或多个轴。In one embodiment, the transmission device 120 may also include other components, such as a clutch. Among other things, the drive shaft may include one or more axles that may be coupled to one or more wheels 121 .
传感器系统104可包括感测关于车辆100周边的环境的信息的若干个传感器。Sensor system 104 may include a number of sensors that sense information about the environment surrounding vehicle 100 .
例如,传感器系统104可包括定位系统122(定位系统可以是GPS系统,也可以是北斗系统或者其他定位系统)、惯性测量单元(inertial measurement unit,IMU)124、雷达126、激光测距仪128以及摄像头130。For example, the sensor system 104 may include a positioning system 122 (the positioning system may be a GPS system, a Beidou system, or other positioning systems), an inertial measurement unit (IMU) 124, a radar 126, a laser rangefinder 128, and Camera 130.
定位系统122可用于估计车辆100的地理位置。IMU 124用于基于惯性加速度来感测车辆100的位置和朝向变化。在一个实施例中,IMU 124可以是加速度计和陀螺仪的组合。雷达126可利用无线电信号来感测车辆100的周边环境内的物体。在一些实施例中,除了感测物体以外,雷达126还可用于感测物体的速度和/或前进方向,例如可以是毫米波雷达或者激光雷达。激光测距仪128可利用激光来感测车辆100所位于的环境中的物体。在一些实施例中,激光测距仪128可包括一个或多个激光源、激光扫描器以及一个或多个检测器,以及其他系统组件。摄像头130可用于捕捉车辆100的周边环境的多个图像。摄像头130可以是静态摄像头或视频摄像头,还可以是视觉感知摄像头。可以通过雷达126、激光测距仪128或者摄像头130对周边环境进行检测,以确定车辆周边是否有人存在。Positioning system 122 may be used to estimate the geographic location of vehicle 100 . The IMU 124 is used to sense changes in position and orientation of the vehicle 100 based on inertial acceleration. In one embodiment, IMU 124 may be a combination of accelerometer and gyroscope. Radar 126 may utilize radio signals to sense objects within the environment surrounding vehicle 100 . In some embodiments, in addition to sensing the object, the radar 126 may also be used to sense the speed and/or heading of the object, such as millimeter wave radar or lidar. Laser rangefinder 128 may utilize laser light to sense objects in the environment in which vehicle 100 is located. In some embodiments, laser rangefinder 128 may include one or more laser sources, laser scanners, and one or more detectors, among other system components. Camera 130 may be used to capture multiple images of the surrounding environment of vehicle 100 . The camera 130 may be a static camera or a video camera, or may be a visual perception camera. The surrounding environment can be detected through radar 126, laser range finder 128 or camera 130 to determine whether there are people around the vehicle.
传感器系统104还包括车辆100的内部系统的传感器,例如包括高级驾驶辅助系统(advanced driving assistance system,ADAS)的传感器。在本申请实施例中,车辆100的内部系统的传感器包括座椅压力传感器1041、车内摄像头1042、触摸传感器1043、振动传感器1044以及车速传感器等传感器。座椅压力传感器1041可用于监测车内各个座椅上的压力数据,车内摄像头1042可用于捕捉车内乘员的多个图像和车内环境的多个图像,触摸传感器1043用于监测车内中央控制单元的显示屏上的触摸数据,振动传感器1044用于捕捉车内发生的振动数据。处理器可以根据该压力数据、图像、触摸数据以及振动数据等中的至少一种确定车内是否具有乘客,以及乘客的具体位置。车速传感器用于监测车辆的速度数据,以确定车辆是否处于静止状态。The sensor system 104 also includes sensors of internal systems of the vehicle 100 , including, for example, sensors of an advanced driving assistance system (ADAS). In the embodiment of the present application, the sensors of the internal system of the vehicle 100 include sensors such as seat pressure sensor 1041, in-vehicle camera 1042, touch sensor 1043, vibration sensor 1044, and vehicle speed sensor. The seat pressure sensor 1041 can be used to monitor the pressure data on each seat in the car, the in-car camera 1042 can be used to capture multiple images of the occupants in the car and multiple images of the in-car environment, and the touch sensor 1043 can be used to monitor the center of the car. Touch data on the display screen of the control unit, vibration sensor 1044 is used to capture vibration data occurring in the vehicle. The processor can determine whether there is a passenger in the vehicle and the specific location of the passenger based on at least one of the pressure data, image, touch data, vibration data, etc. The vehicle speed sensor is used to monitor the vehicle's speed data to determine whether the vehicle is stationary.
此外,车辆内部系统的传感器还可以包括空气质量传感器、燃油量表、机油温度表等。这些传感器收集到的一个或多个传感器数据均可用于检测对象及其相应特性(位置、形状、温度、速 度等),这种检测和识别是实现车辆100的安全操作并保证车内乘员安全的关键。In addition, sensors for vehicle internal systems can also include air quality sensors, fuel gauges, oil temperature gauges, etc. One or more sensor data collected by these sensors can be used to detect objects and their corresponding characteristics (position, shape, temperature, velocity). degree, etc.), this detection and identification is key to achieving safe operation of the vehicle 100 and ensuring the safety of the vehicle occupants.
控制系统106为控制车辆100及其组件的操作。控制系统106可包括各种元件,其中包括转向系统132、油门134、制动单元136、计算机视觉系统140、线路控制系统142以及障碍物规避系统144。Control system 106 controls the operation of vehicle 100 and its components. Control system 106 may include various elements, including steering system 132 , throttle 134 , braking unit 136 , computer vision system 140 , line control system 142 , and obstacle avoidance system 144 .
转向系统132可操作来调整车辆100的前进方向。例如在一个实施例中可以为方向盘系统。Steering system 132 is operable to adjust the forward direction of vehicle 100 . For example, in one embodiment it may be a steering wheel system.
油门134用于控制引擎118的操作速度并进而控制车辆100的速度。Throttle 134 is used to control the operating speed of engine 118 and thereby the speed of vehicle 100 .
制动单元136用于控制车辆100减速。制动单元136可使用摩擦力来减慢车轮121。在其他实施例中,制动单元136可将车轮121的动能转换为电流。制动单元136也可采取其他形式来减慢车轮121转速从而控制车辆100的速度。The braking unit 136 is used to control the deceleration of the vehicle 100 . Braking unit 136 may use friction to slow wheel 121 . In other embodiments, braking unit 136 may convert kinetic energy of wheel 121 into electrical current. The braking unit 136 may also take other forms to slow down the rotation speed of the wheels 121 to control the speed of the vehicle 100 .
在本申请实施例中,计算机视觉系统140可以对摄像头130和车内摄像头1042捕捉到的图像进行处理和分析,以识别车辆100的车外周边环境中的人员分布情况以及车内的人员分布情况。其中,车外的人员分布情况包括车外预设范围内是否有人、车外人员的位置或者方位等信息。车内的人员分布情况包括车内是否有人、车内人员的位置或者方位、车内人员具体乘坐哪个(哪些)座位等信息。计算机视觉系统140可使用人体识别算法、运动中恢复结构(structure from motion,SFM)算法、视频跟踪和其他计算机视觉技术中的至少一种,完成环境绘制地图、跟踪物体、估计物体的速度、确定车内和车外当前状况等操作。In the embodiment of the present application, the computer vision system 140 can process and analyze the images captured by the camera 130 and the in-vehicle camera 1042 to identify the distribution of people in the surrounding environment outside the vehicle 100 and the distribution of people inside the vehicle. . Among them, the distribution of people outside the vehicle includes information such as whether there are people within the preset range outside the vehicle, the location or orientation of people outside the vehicle, and other information. The distribution of people in the car includes information such as whether there are people in the car, the location or orientation of the people in the car, and which seat(s) the people in the car are sitting in. The computer vision system 140 can use at least one of a human body recognition algorithm, a structure from motion (SFM) algorithm, video tracking, and other computer vision technologies to complete environment mapping, tracking objects, estimating the speed of objects, and determining Operations such as the current conditions inside and outside the car.
线路控制系统142用于确定车辆100的行驶路线。在一些实施例中,线路控制系统142可结合来自传感器系统104中的传感器数据以及一个或多个预定地图的数据,为车辆100确定行驶路线。The route control system 142 is used to determine the driving route of the vehicle 100 . In some embodiments, route control system 142 may combine sensor data from sensor system 104 and data from one or more predetermined maps to determine a route for vehicle 100 .
障碍物规避系统144用于识别、评估和规避或者以其他方式越过车辆100的环境中的潜在障碍物。Obstacle avoidance system 144 is used to identify, evaluate, and avoid or otherwise negotiate potential obstacles in the environment of vehicle 100 .
当然,在一个实例中,控制系统106可以增加或替换地包括除了所示出和描述的那些以外的组件。或者也可以减少一部分上述示出的组件。Of course, in one example, control system 106 may additionally or alternatively include components in addition to those shown and described. Alternatively, some of the components shown above may be reduced.
车辆100通过外围设备108与外部传感器、其他车辆、其他计算机系统或用户之间进行交互。外围设备108可包括无线通信系统146、车载电脑148、麦克风150和扬声器152。Vehicle 100 interacts with external sensors, other vehicles, other computer systems, or users through peripheral devices 108 . Peripheral devices 108 may include a wireless communication system 146 , an onboard computer 148 , a microphone 150 and a speaker 152 .
在一些实施例中,外围设备108提供车辆100的用户与用户接口116交互的手段。例如,车载电脑148可向车辆100的用户提供信息。用户接口116还可操作车载电脑148来接收用户的输入。车载电脑148可以通过触摸屏进行操作。在其他情况中,外围设备108可提供用于车辆100与位于车内的其它设备通信的手段。例如,麦克风150可从车辆100的用户接收音频(例如,语音命令或其他音频输入)。类似地,扬声器152可向车辆100的用户输出音频。In some embodiments, peripheral device 108 provides a means for a user of vehicle 100 to interact with user interface 116 . For example, onboard computer 148 may provide information to a user of vehicle 100 . The user interface 116 may also operate the onboard computer 148 to receive user input. The onboard computer 148 can be operated via a touch screen. In other cases, peripheral device 108 may provide a means for vehicle 100 to communicate with other devices located within the vehicle. For example, microphone 150 may receive audio (eg, voice commands or other audio input) from a user of vehicle 100 . Similarly, speakers 152 may output audio to a user of vehicle 100 .
麦克风150能够对车内的声音进行拾取,还可以对车外的声音进行拾取。当车窗未被关闭或者车外声音较大时,麦克风150还能够对车外的包括对车辆进行控制的语音指令在内的各种声音进行准确拾取。The microphone 150 can pick up sounds inside the car and can also pick up sounds outside the car. When the car windows are not closed or the sounds outside the car are loud, the microphone 150 can also accurately pick up various sounds outside the car, including voice commands for controlling the vehicle.
麦克风150可以由多个麦克风单元组成,多个麦克风单元可以均被设置于中控台附近以形成麦克风阵列,也可以被分布设置于车内各个座位的旁边,其中的至少一个还可以被设置于车体外壁上(车外)以用于拾取车外的声音。The microphone 150 may be composed of multiple microphone units. The multiple microphone units may be disposed near the center console to form a microphone array, or may be distributed next to each seat in the car. At least one of them may also be disposed on On the outer wall of the car (outside the car) to pick up the sound outside the car.
例如,当用户发出对车辆进行语音控制的语音指令时,可以通过麦克风150对语音指令的拾取强度确定声源位置是否位于车辆内部。具体地,可以根据多个麦克风单元对语音指令的拾取强度确定声源位置的方位或者位置,并根据方位或者位置确定声源位置是否位于所述车辆内部。For example, when the user issues a voice instruction for voice control of the vehicle, whether the sound source position is located inside the vehicle can be determined by the pickup intensity of the voice instruction by the microphone 150 . Specifically, the orientation or position of the sound source position can be determined based on the pickup intensity of the voice instructions by multiple microphone units, and whether the sound source position is located inside the vehicle can be determined based on the orientation or position.
无线通信系统146可以直接地或者经由通信网络来与一个或多个设备无线通信。例如,无线通信系统146可使用3G蜂窝通信,例如CDMA、EVD0、GSM/GPRS,或者4G蜂窝通信,例如LTE。或者5G蜂窝通信。无线通信系统146可利用WiFi与无线局域网(wireless local area network,WLAN)通信。在一些实施例中,无线通信系统146可利用红外链路、蓝牙或ZigBee与设备直接通信。其他无线协议,例如各种车辆通信系统,例如,无线通信系统146可包括一个或多个专用短程通信(dedicated short range communications,DSRC)设备,这些设备可包括车辆和/或路边台站之间的公共和/或私有数据通信。Wireless communication system 146 may wirelessly communicate with one or more devices directly or via a communication network. For example, wireless communication system 146 may use 3G cellular communications, such as CDMA, EVDO, GSM/GPRS, or 4G cellular communications, such as LTE. Or 5G cellular communications. The wireless communication system 146 can communicate with a wireless local area network (WLAN) using WiFi. In some embodiments, wireless communication system 146 may utilize infrared links, Bluetooth, or ZigBee to communicate directly with the device. Other wireless protocols, such as various vehicle communication systems. For example, wireless communication system 146 may include one or more dedicated short range communications (DSRC) devices, which may include communication between vehicles and/or roadside stations. public and/or private data communications.
电源110可向车辆100的各种组件提供电力。在一个实施例中,电源110可以为可再充电锂 离子、钠离子或铅酸电池。这种电池的一个或多个电池组可被配置为电源为车辆100的各种组件提供电力。在一些实施例中,电源110和能量源119可一起实现,例如一些全电动车中那样。Power supply 110 may provide power to various components of vehicle 100 . In one embodiment, the power source 110 may be a rechargeable lithium Ion, sodium-ion or lead-acid batteries. One or more packs of such batteries may be configured as a power source to provide power to various components of the vehicle 100 . In some embodiments, power source 110 and energy source 119 may be implemented together, such as in some all-electric vehicles.
车辆100的部分或所有功能受计算机系统112控制。计算机系统112可包括至少一个处理器113,处理器113执行存储在例如数据存储装置114这样的非暂态计算机可读介质中的指令115。计算机系统112还可以是采用分布式方式控制车辆100的个体组件或子系统的多个计算设备。Some or all functions of vehicle 100 are controlled by computer system 112 . Computer system 112 may include at least one processor 113 that executes instructions 115 stored in a non-transitory computer-readable medium such as data storage device 114. Computer system 112 may also be a plurality of computing devices that control individual components or subsystems of vehicle 100 in a distributed manner.
处理器113可以是任何常规的处理器,诸如商业可获得的中央处理单元(central processing unit,CPU)。可选地,该处理器可以是诸如专用集成电路(application specific integrated circuit,ASIC)或其它基于硬件的处理器的专用设备。尽管图1功能性地图示了处理器113、数据存储装置114、和在相同块中的计算机系统112的其它元件,但是本领域的普通技术人员应该理解该处理器113、计算机系统112、或数据存储装置114实际上可以包括不存储在相同的物理外壳内的多个处理器、计算机、或存储器。例如,存储器可以是硬盘驱动器或位于不同于计算机系统112的外壳内的其它存储介质。因此,对处理器或计算机的引用将被理解为包括对可以或者可以不并行操作的处理器或计算机或存储器的集合的引用。不同于使用单一的处理器来执行此处所描述的步骤,诸如转向组件和减速组件的一些组件每个都可以具有其自己的处理器,所述处理器只执行与特定于组件的功能相关的计算。Processor 113 may be any conventional processor, such as a commercially available central processing unit (CPU). Alternatively, the processor may be a dedicated device such as an application specific integrated circuit (ASIC) or other hardware-based processor. Although FIG. 1 functionally illustrates processor 113, data storage 114, and other elements of computer system 112 in the same block, one of ordinary skill in the art will understand that processor 113, computer system 112, or data Storage device 114 may actually include multiple processors, computers, or memories that are not stored within the same physical enclosure. For example, the memory may be a hard drive or other storage medium located in a housing different from computer system 112 . Thus, a reference to a processor or computer will be understood to include a reference to a collection of processors or computers or memories that may or may not operate in parallel. Rather than using a single processor to perform the steps described herein, some components, such as the steering component and the deceleration component, may each have its own processor that only performs calculations related to component-specific functionality. .
在此处所描述的各个方面中,处理器可以位于远离该车辆并且与该车辆进行无线通信。在其它方面中,此处所描述的过程中的一些在布置于车辆内的处理器上执行而其它则由远程处理器执行,包括采取执行单一操纵的必要步骤。In various aspects described herein, the processor may be located remotely from the vehicle and in wireless communication with the vehicle. In other aspects, some of the processes described herein are performed on a processor disposed within the vehicle and others are performed by a remote processor, including taking the steps necessary to perform a single maneuver.
在一些实施例中,数据存储装置114可包含指令115(例如,程序逻辑),指令115可被处理器113执行来执行车辆100的各种功能,包括以上描述的那些功能。数据存储装置114也可包含额外的指令,包括向行进系统102、传感器系统104、控制系统106和外围设备108中的一个或多个发送数据、从其接收数据、与其交互和/或对其进行控制的指令。In some embodiments, data storage 114 may contain instructions 115 (eg, program logic) that may be executed by processor 113 to perform various functions of vehicle 100 , including those described above. Data storage 114 may also contain additional instructions, including sending data to, receiving data from, interacting with, and/or performing operations on one or more of travel system 102 , sensor system 104 , control system 106 , and peripherals 108 Control instructions.
除了指令115以外,数据存储装置114还可存储数据,例如道路地图、路线信息,车辆的位置、方向、速度以及其它这样的车辆数据,以及其他信息。这种信息可在车辆100在自主、半自主和/或手动模式中操作期间被车辆100和计算机系统112使用。In addition to instructions 115, the data storage device 114 may store data such as road maps, route information, vehicle position, direction, speed and other such vehicle data, as well as other information. This information may be used by vehicle 100 and computer system 112 during operation of vehicle 100 in autonomous, semi-autonomous and/or manual modes.
比如,在本申请实施例中,数据存储装置114从传感器系统104或车辆100的其他组件获取麦克风150采集到的对车辆进行控制的语音指令、车内人员分布信息、车外人员分布信息以及车辆状态信息等信息。其中,该语音指令可以由车内人员发出,也可以由车外人员发出。车内人员分布信息包括车内是否有人、车内人员的方位或者具体位置。车外人员分布信息包括车外预设范围内是否有人、车外人员的方位或者具体位置。车辆状态信息包括车辆的行驶速度、车窗开闭情况等信息。数据存储装置114能够存储上述相关信息。如此,处理器113能够根据上述信息确定该语音指令的声源位置是否位于所述车辆内部。如果确定该语音指令的声源位置位于车辆内部时,可以进一步控制车辆100执行与该语音指令相应的操作。For example, in the embodiment of the present application, the data storage device 114 obtains the voice instructions for controlling the vehicle collected by the microphone 150 from the sensor system 104 or other components of the vehicle 100, the distribution information of people in the vehicle, the distribution information of people outside the vehicle, and the vehicle Status information and other information. The voice command can be issued by a person inside the vehicle or by a person outside the vehicle. The distribution information of people in the car includes whether there are people in the car, the orientation or specific location of the people in the car. The distribution information of people outside the vehicle includes whether there are people within the preset range outside the vehicle, the orientation or specific location of the people outside the vehicle. Vehicle status information includes vehicle driving speed, window opening and closing status and other information. The data storage device 114 can store the above-mentioned related information. In this way, the processor 113 can determine whether the sound source position of the voice instruction is located inside the vehicle based on the above information. If it is determined that the sound source position of the voice command is located inside the vehicle, the vehicle 100 may be further controlled to perform operations corresponding to the voice command.
用户接口116,用于向车辆100的用户提供信息或从其接收信息。可选地,用户接口116可包括在外围设备108的集合内的一个或多个输入/输出设备,例如无线通信系统146、车载电脑148、麦克风150和扬声器152。User interface 116 for providing information to or receiving information from a user of vehicle 100 . Optionally, user interface 116 may include one or more input/output devices within a collection of peripheral devices 108 , such as wireless communications system 146 , onboard computer 148 , microphone 150 , and speaker 152 .
计算机系统112可基于从各种子系统(例如,行进系统102、传感器系统104和控制系统106)以及从用户接口116接收的输入来控制车辆100的功能。例如,计算机系统112可利用来自控制系统106的输入以便控制转向系统132来避免由传感器系统104和障碍物规避系统144检测到的障碍物。在一些实施例中,计算机系统112可操作来对车辆100及其子系统的许多方面提供控制。Computer system 112 may control functions of vehicle 100 based on input received from various subsystems (eg, travel system 102 , sensor system 104 , and control system 106 ) and from user interface 116 . For example, computer system 112 may utilize input from control system 106 to control steering system 132 to avoid obstacles detected by sensor system 104 and obstacle avoidance system 144 . In some embodiments, computer system 112 is operable to provide control of many aspects of vehicle 100 and its subsystems.
可选地,上述这些组件中的一个或多个可与车辆100分开安装或关联。例如,数据存储装置114可以部分或完全地与车辆100分开存在。上述组件可以按有线和/或无线方式来通信地耦合在一起。Alternatively, one or more of these components described above may be installed separately or associated with vehicle 100 . For example, data storage device 114 may exist partially or completely separate from vehicle 100 . The components described above may be communicatively coupled together in wired and/or wireless manners.
可选地,上述组件只是一个示例,实际应用中,上述各个模块中的组件有可能根据实际需要增添或者删除,图1不应理解为对本申请实施例的限制。Optionally, the above components are just examples. In actual applications, the components in each of the above modules may be added or deleted according to actual needs. Figure 1 should not be understood as limiting the embodiments of the present application.
在本申请实施例中,自动驾驶汽车,如上面的车辆100,可以根据麦克风150采集到的语音指令的音频特征、麦克风150对该语音指令的拾取强度、车辆内部的人员分布情况、车辆外部预 设范围内的人员分布情况、车辆的行驶速度、车辆的车窗开闭情况等信息来确定语音指令的声源位置是否位于车辆内部。In the embodiment of the present application, a self-driving car, such as the vehicle 100 above, can detect the voice command based on the audio characteristics of the voice command collected by the microphone 150, the pickup intensity of the voice command by the microphone 150, the distribution of people inside the vehicle, and the preset conditions outside the vehicle. Information such as the distribution of people within the range, the driving speed of the vehicle, the opening and closing of the vehicle's windows, etc. are used to determine whether the sound source position of the voice command is located inside the vehicle.
进一步地,作为自动驾驶汽车的车辆100或者与其相关联的计算设备(如图3的计算机系统112、计算机视觉系统140、数据存储装置114)可以基于判断结果来确定是否执行该语音指令对应的控制操作。例如,当确定声源位置位于车辆内部时,控制车辆执行与语音指令相应的操作。再例如,当确定声源位置不是位于车辆内部时(即声源位置位于车辆外部),此时可以不执行该语音指令;或者,可以进一步确定预设的触发条件是否被满足,当触发条件被满足时,控制车辆执行与语音指令相应的操作,不满足时则不执行该语音指令;或者,向用户发送是否执行该语音指令的询问信息。Further, the vehicle 100 as an autonomous vehicle or the computing device associated with it (such as the computer system 112, the computer vision system 140, and the data storage device 114 in FIG. 3) can determine whether to execute the control corresponding to the voice instruction based on the judgment result. operate. For example, when it is determined that the sound source position is inside the vehicle, the vehicle is controlled to perform operations corresponding to the voice instructions. For another example, when it is determined that the sound source position is not located inside the vehicle (that is, the sound source position is located outside the vehicle), the voice command may not be executed at this time; or, it may be further determined whether the preset trigger condition is met. When the trigger condition is When satisfied, the vehicle is controlled to perform operations corresponding to the voice command; when not satisfied, the voice command is not executed; or, an inquiry message is sent to the user as to whether to execute the voice command.
除了提供指示自动驾驶汽车进行应急通讯的指令之外,计算设备还可以提供修改车辆100内的各种设备状态的指令,以使得自动驾驶汽车遵循给定的应急控制措施,对车内各种设备的状态进行调整,以保证车内乘员的安全。In addition to providing instructions for instructing the self-driving car to perform emergency communications, the computing device may also provide instructions for modifying the status of various devices within the vehicle 100 so that the self-driving car follows given emergency control measures and controls various devices within the vehicle. status to ensure the safety of the vehicle occupants.
上述车辆100可以为轿车、卡车、火车、工程车、摩托车、公共汽车、船、飞机、直升飞机、割草机、娱乐车、游乐场车辆、施工设备、电车、高尔夫球车、火车、和手推车等,本申请实施例不做特别的限定。The above-mentioned vehicle 100 can be a car, a truck, a train, an engineering vehicle, a motorcycle, a bus, a boat, an airplane, a helicopter, a lawn mower, an entertainment vehicle, a playground vehicle, construction equipment, a tram, a golf cart, a train, and trolleys, etc., the embodiments of this application are not particularly limited.
基于图3提供的车辆100的结构,结合本申请实施例提供的车辆的控制方法,下面继续对本申请中提出的技术问题进行具体分析和解决。Based on the structure of the vehicle 100 provided in FIG. 3 and combined with the vehicle control method provided by the embodiment of the present application, the following continues to conduct a detailed analysis and solution of the technical problems raised in the present application.
参见附图4,图4是本申请实施例提供的车辆的控制方法400的流程示意图。控制方法400可应用于上述图3所示的车辆100中,例如控制方法400的执行主体可以是车辆100或者车辆100内的处理器或者芯片;也可以应用于能够控制车辆100的其他设备(例如云端服务器、手机终端等)中,例如控制方法400的执行主体可以是云端服务器或者云端服务器内部的处理器或者芯片。控制方法400可以包括以下步骤410-步骤440。Referring to FIG. 4 , FIG. 4 is a schematic flowchart of a vehicle control method 400 provided by an embodiment of the present application. The control method 400 can be applied to the vehicle 100 shown in FIG. 3 . For example, the execution subject of the control method 400 can be the vehicle 100 or a processor or chip in the vehicle 100 ; it can also be applied to other devices capable of controlling the vehicle 100 (such as In cloud servers, mobile terminals, etc.), for example, the execution subject of the control method 400 may be the cloud server or a processor or chip inside the cloud server. The control method 400 may include the following steps 410 to 440.
步骤410,获取对车辆进行控制的语音指令。Step 410: Obtain voice instructions for controlling the vehicle.
随着智能化水平的逐渐提高,车辆(例如智能汽车)通常都配置有麦克风等拾音设备,能够进行语音采集,例如能够获取对车辆进行控制的语音指令,另外还可以获取聊天内容等无用信息。麦克风不仅能够采集车内的语音,通常还能够采集车外的语音(尤其是车窗打开的情况下)。因此,该语音指令可以由车内的驾乘人员(例如司机或者后来落座的乘客)所发出,也可以是由车外人员所发出。一些情况下,还可以由机器人、智能音箱等具有语音播报功能的智能设备发出该语音指令,本申请对此不作特殊限定。With the gradual improvement of the level of intelligence, vehicles (such as smart cars) are usually equipped with microphones and other sound pickup devices, which can collect voice, for example, they can obtain voice instructions for controlling the vehicle, and they can also obtain useless information such as chat content. . Microphones can not only collect speech inside the car, but also often outside the car (especially when the windows are open). Therefore, the voice command can be issued by a driver or passenger in the car (such as a driver or a passenger who takes the seat later) or by a person outside the car. In some cases, the voice command can also be issued by smart devices with voice broadcast functions such as robots and smart speakers. This application does not specifically limit this.
该语音指令能够对车辆进行控制,车辆采集到该语音指令以后,进行语义识别并生成相应的控制指令,并通过该控制指令对相关控制对象进行控制。本申请对语音指令具体进行哪一方面的控制不作限定。可选地,该语音指令可以对车辆进行状态控制、媒体类控制、空调控制、座椅控制等等。The voice command can control the vehicle. After the vehicle collects the voice command, it performs semantic recognition and generates corresponding control instructions, and controls related control objects through the control instructions. This application does not limit which aspect of voice command is specifically controlled. Optionally, the voice command can perform status control, media control, air conditioning control, seat control, etc. on the vehicle.
一些情况下,该语音指令可以是打开/关闭车窗(包括天窗)、打开/关闭车门(锁定/解锁车门)、打开/关闭后备箱、氛围灯调节、车辆锁定、车辆点火/熄火、车速调节、后视镜调节等车辆状态控制指令。一些情况下,该语音指令还可以是打开导航、中控屏调节、播放音乐、收音机调频、接通/拨打电话等媒体控制指令。一些情况下,该语音指令还可以是调高温度、降低温度、开启内循环、进行车内空气净化等空调控制指令。一些情况下,该语音指令还可以是打开座椅加热、升高/降低座椅高度、前/后移动座椅、调节椅背角度等座椅控制指令。In some cases, the voice command can be opening/closing windows (including sunroofs), opening/closing doors (locking/unlocking doors), opening/closing the trunk, ambient light adjustment, vehicle locking, vehicle ignition/stop, and vehicle speed adjustment. , rearview mirror adjustment and other vehicle status control instructions. In some cases, the voice command can also be media control commands such as opening navigation, adjusting the central control screen, playing music, tuning the radio, answering/making calls, etc. In some cases, the voice command can also be air conditioning control commands such as raising the temperature, lowering the temperature, starting internal circulation, purifying the air in the car, etc. In some cases, the voice command can also be a seat control command such as turning on seat heating, raising/lowering the seat height, moving the seat forward/backward, adjusting the seat back angle, etc.
可选地,该语音指令还可以是用于唤醒车辆的语音助手的唤醒指令,该唤醒指令可以是厂家设置好的默认指令,例如该语音指令可以是用于唤醒智能语音助手小艺的“小艺小艺”,还可以是用户根据个人喜好自定义的唤醒指令。一些情况下,车辆的语音助手还可以是免唤醒模式,即语音助手时刻保持唤醒状态,保证车辆能够及时获取到“打开车门”等对车辆进行状态控制的指令,方便用于对车辆进行语音控制。Optionally, the voice command can also be a wake-up command used to wake up the voice assistant of the vehicle. The wake-up command can be a default command set by the manufacturer. For example, the voice command can be a "little voice command" used to wake up the intelligent voice assistant Xiaoyi. "Yi Xiaoyi" can also be a wake-up command customized by the user according to personal preferences. In some cases, the vehicle's voice assistant can also be in wake-up-free mode, that is, the voice assistant remains awake at all times to ensure that the vehicle can promptly obtain instructions for controlling the vehicle's status such as "open the door", which is convenient for voice control of the vehicle. .
步骤420,确定该语音指令的声源位置是否位于车辆内部。Step 420: Determine whether the sound source position of the voice command is located inside the vehicle.
在这里,语音指令的声源位置即说出该语音指令的人或者设备的位置。确定该语音指令的声源位置是否位于车辆内部,可以是确定说出该语音指令的人员是否位于车辆的内部。 Here, the sound source location of the voice command is the location of the person or device that speaks the voice command. Determining whether the sound source position of the voice command is located inside the vehicle may be determining whether the person who spoke the voice command is located inside the vehicle.
在这里,车辆内部与车辆外部相对,当确定该语音指令的声源位置不是位于车辆内部时,等同于确定该语音指令的声源位置位于车辆的外部。Here, the inside of the vehicle is opposite to the outside of the vehicle. When it is determined that the sound source position of the voice command is not located inside the vehicle, it is equivalent to determining that the sound source position of the voice command is located outside the vehicle.
一些情况下,车辆内部可以是指驾驶室的内部。例如,对于轿车、越野车、巴士车等客车而言,其驾驶室除了安装有供驾驶员落座的座位以外,还具有供乘客乘坐的座位。因此轿车等客车的车辆内部很明确,即具有车座的车体内部。而对于卡车(例如皮卡)、货车、吊车、挖掘机等工程车辆,其除了驾驶室以外通常还具有用于拉货或者施工的车斗、车厢或者铲斗等空间,该部分空间在本申请中被算作车辆外部。例如,某一人员在铲车的铲斗上说出该语音指令,此时可以确定该语音指令的声源位置位于车辆的外部。In some cases, the vehicle interior may refer to the interior of the cab. For example, in passenger vehicles such as cars, off-road vehicles, and buses, the cabs are equipped with seats for passengers in addition to seats for the driver. Therefore, the vehicle interior of a passenger car such as a sedan is clearly defined as the interior of the vehicle body including the seats. For engineering vehicles such as trucks (such as pickup trucks), trucks, cranes, and excavators, in addition to the cab, they usually also have spaces such as buckets, compartments, or buckets for hauling goods or construction. This part of the space is included in this application. Counted as vehicle exterior. For example, if a person speaks the voice command on the bucket of a forklift, it can be determined that the sound source position of the voice command is located outside the vehicle.
随着智能化水平的逐渐提高,车辆所配备的传感器(例如ADAS的传感器)也日益增多,例如车内摄像头、车外摄像头、雷达或者座椅压力传感器等,通过传感器能够获知与车辆相关的各种数据,可以根据该各种数据中的一种或者多种来确定该语音指令的声源位置是否位于车辆内部。即本申请对确定该语音指令的声源位置是否位于车辆内部的具体实现方式不作具体限定。With the gradual improvement of the level of intelligence, vehicles are equipped with an increasing number of sensors (such as ADAS sensors), such as in-car cameras, exterior cameras, radars or seat pressure sensors. Various sensors related to the vehicle can be learned through the sensors. According to one or more kinds of data, whether the sound source position of the voice command is located inside the vehicle can be determined. That is, this application does not specifically limit the specific implementation method of determining whether the sound source position of the voice command is located inside the vehicle.
可选地,可以根据获取到的数据,结合预先内置的算法来确定声源位置是否位于车辆内部。Optionally, the acquired data can be combined with a pre-built-in algorithm to determine whether the sound source is located inside the vehicle.
例如,可以将上述数据输入到神经网络模型中,通过该神经网络模型对该数据进行分析并确定声源位置是否位于车辆内部。其中,神经网络模型是由历史数据(训练数据)以及深度学习算法进行模型训练得到的,该神经网络模型例如可以是卷积神经网络(convolutional neural network,CNN)模型,该深度学习算法例如可以是机器学习(machine learning)算法或者元学习(meta Learning)算法。For example, the above data can be input into a neural network model, which analyzes the data and determines whether the sound source location is inside the vehicle. Among them, the neural network model is obtained by model training with historical data (training data) and a deep learning algorithm. The neural network model can be, for example, a convolutional neural network (CNN) model. The deep learning algorithm can, for example, be Machine learning algorithm or meta learning algorithm.
值得一提的是,确定语音指令的声源位置是否位于车辆内部,一些情况下,并不一定要确定出声源位置的具体位置是在何处,而是仅需要确定是否位于车辆内部即可,这样做的好处是能够减小所需的数据量,无需额外设置过多的传感器,或者能够减小传感器的工作频次;此外还能够简化计算过程,节约处理器的算力,降低对处理器的性能要求。It is worth mentioning that to determine whether the sound source position of the voice command is located inside the vehicle, in some cases, it is not necessary to determine the specific location of the sound source position, but only needs to determine whether it is located inside the vehicle. The advantage of this is that it can reduce the amount of data required, without setting too many additional sensors, or reduce the working frequency of the sensors; in addition, it can also simplify the calculation process, save the computing power of the processor, and reduce the burden on the processor. performance requirements.
可选地,可以根据以下数据中的一种或者多种来确定语音指令的声源位置是否位于车辆内部:麦克风采集到的该语音指令的音频特征数据,麦克风采集到的该语音指令的拾取(拾音)强度数据,车内摄像头采集到的车内图像数据,麦克风采集到的车内声音数据,座椅传感器采集到的车内座椅压力数据,振动传感器采集到的车内振动数据、触摸传感器采集到的车内触摸屏的触摸数据、车速传感器采集到的车速数据、玻璃位置检测传感器检测的车窗开闭情况数据、车外摄像头采集到的车外图像数据、雷达或者激光测距仪采集到的车外障碍物数据等等,但不限于此,随着车辆的智能化以及传感器技术的进一步发展,上述数据的内容和种类可以变得更多。Optionally, it can be determined whether the sound source position of the voice instruction is located inside the vehicle based on one or more of the following data: the audio characteristic data of the voice instruction collected by the microphone, the pickup of the voice instruction collected by the microphone ( Sound pickup) intensity data, in-car image data collected by the in-car camera, in-car sound data collected by the microphone, in-car seat pressure data collected by the seat sensor, in-car vibration data collected by the vibration sensor, touch The touch data of the in-car touch screen collected by the sensor, the vehicle speed data collected by the vehicle speed sensor, the window opening and closing data detected by the glass position detection sensor, the exterior image data collected by the exterior camera, and the radar or laser range finder collection Obstacle data outside the vehicle, etc., but is not limited to this. With the further development of intelligent vehicles and sensor technology, the content and types of the above data can become more.
下文中列举了多种确定声源位置是否位于车辆内部方法,实际应用过程中,可以通过所述多种方法中的任意一种,或者多种的结合来综合确定声源位置是否位于车辆内部。当通过多种方法来综合确定声源位置是否位于车辆内部时,该多种方法可以同时进行确定,并且通过多个确定结果来输出最终结果;或者,该多种方法也可以按照一定的优先级顺序,当前面的一种或者多种方法能够准确确定出结果时,后面的方法可以不执行,从而能够节约处理器算力。The following lists a variety of methods for determining whether the sound source position is located inside the vehicle. In actual application, any one of the multiple methods, or a combination of multiple methods, can be used to comprehensively determine whether the sound source position is located inside the vehicle. When multiple methods are used to comprehensively determine whether the sound source position is located inside the vehicle, the multiple methods can be determined at the same time, and the final result can be output through multiple determination results; or, the multiple methods can also be based on a certain priority. In order, when one or more of the previous methods can accurately determine the result, the subsequent methods do not need to be executed, thus saving processor computing power.
一些情况下,可以通过车辆内部的人员分布情况来确定声源位置是否位于车辆内部。此时,可以通过车内摄像头、座椅压力传感器、触摸传感器、振动传感器、磁力计以及红外传感器等检测装置获取相关数据,并且根据该数据来确定车内是否具有乘客、乘客的方位或者具体位置、乘客乘坐的具体座位等人员分布信息,最终根据该人员分布信息确定声源位置是否位于车辆内部。In some cases, whether the sound source is located inside the vehicle can be determined by the distribution of people inside the vehicle. At this time, relevant data can be obtained through detection devices such as in-car cameras, seat pressure sensors, touch sensors, vibration sensors, magnetometers, and infrared sensors, and based on this data, it can be determined whether there are passengers in the car, their orientation or specific location , the specific seats of the passengers and other personnel distribution information, and finally determine whether the sound source position is located inside the vehicle based on the personnel distribution information.
可选地,如果该人员分布信息指示当前车内没有驾乘人员,此时可以确定声源位置位于车辆的外部而非内部。Optionally, if the personnel distribution information indicates that there are currently no drivers or passengers in the vehicle, it may be determined that the sound source location is located outside the vehicle rather than inside.
图5是图4所示的控制方法400的应用场景示意图。如图5中的(a)部分所示,位于车外的人员50(例如可以是车主)喊出了“收起后视镜”的语音指令,位于车内的麦克风51采集到该语音指令以后,可以通过车辆的车内摄像头52或者座椅传感器53等采集到的数据确定此时车内没有驾乘人员,此时可以确定声音必然是来自于车外(无需额外打开车外摄像头进一步确认),即可以确定语音指令的声源位置位于车辆的外部而非内部。FIG. 5 is a schematic diagram of an application scenario of the control method 400 shown in FIG. 4 . As shown in part (a) of Figure 5, a person 50 outside the car (for example, the car owner) shouts the voice command "fold the rearview mirror", and the microphone 51 inside the car collects the voice command. , it can be determined through the data collected by the vehicle's interior camera 52 or seat sensor 53 that there are no drivers or passengers in the vehicle at this time. At this time, it can be determined that the sound must come from outside the vehicle (no need to open the exterior camera for further confirmation) , that is, it is determined that the sound source position of the voice command is located outside the vehicle rather than inside.
可选地,如果该人员分布信息指示当前车内具有驾乘人员,此时可以借助于其他信息(例如前述各类数据中的任意一种或者多种)共同确定声源位置是否位于车辆内部。 Optionally, if the personnel distribution information indicates that there are drivers and passengers in the current vehicle, other information (such as any one or more of the aforementioned types of data) can be used to jointly determine whether the sound source position is located inside the vehicle.
例如,此时可以进一步结合麦克风采集到的该语音指令的音频特征数据或者麦克风采集到的该语音指令的拾取(拾音)强度数据等进行确定,下文将对此做进一步说明。For example, at this time, the determination can be further made in combination with the audio feature data of the voice command collected by the microphone or the pickup (sound pickup) intensity data of the voice command collected by the microphone, which will be further described below.
一些情况下,可以通过车辆外部的人员分布情况来确定声源位置是否位于车辆内部。此时,可以通过车外摄像头、雷达、激光测距仪等检测装置获取相关数据,并且根据该数据来确定车外附近预设范围内是否具有人员以及人员的方位或者具体位置等人员分布信息,最终根据该人员分布信息确定声源位置是否位于车辆内部。In some cases, whether the sound source is located inside the vehicle can be determined by the distribution of people outside the vehicle. At this time, relevant data can be obtained through detection devices such as cameras, radars, and laser rangefinders outside the vehicle, and based on this data, it can be determined whether there are people within the preset range near the outside of the vehicle, as well as personnel distribution information such as their orientation or specific location. Finally, it is determined whether the sound source is located inside the vehicle based on the personnel distribution information.
该预设范围可以根据车辆配备的麦克风的拾音能力进行确定,声音拾取能力越强,说明较远距离的人员发出的语音指令也可能被拾取到,因此该预设范围也应当越大。生产厂家可以在车辆出厂之前根据麦克风的具体配置对该预设范围进行合理设定。This preset range can be determined based on the sound pickup capability of the microphone equipped in the vehicle. The stronger the sound pickup capability, it means that voice commands issued by people at a greater distance may also be picked up, so the preset range should also be larger. The manufacturer can reasonably set the preset range based on the specific configuration of the microphone before the vehicle leaves the factory.
可选地,如果该人员分布信息指示当前车外预设范围内没有人员存在,此时可以确定声源位置位于车辆的内部。Optionally, if the person distribution information indicates that there are no people within the current preset range outside the vehicle, it can be determined that the sound source position is located inside the vehicle.
如图5中的(b)部分所示,位于车内的人员50(例如可以是驾驶员)说出“打开车窗”的语音指令,位于车内的麦克风51采集到该语音指令以后,可以通过车辆的车外摄像头54、雷达、激光测距仪等装置检测到的数据确定此时车外预设范围内没有人员存在,此时可以确定声音必然是来自于车内(无需额外打开车内摄像头进一步确认),即可以确定语音指令的声源位置位于车辆的内部。图中只示出了一个车外摄像头54,实际应用中可以沿着车身的外周设置多个车外摄像头54以此能够获取车身外周360度的全景图像,方便更加高效准确的确定车外预设范围内是否有人存在。As shown in part (b) of Figure 5, the person 50 in the car (for example, the driver) speaks the voice command "open the window". After the microphone 51 in the car collects the voice command, the voice command can be Through the data detected by the vehicle's exterior camera 54, radar, laser rangefinder and other devices, it is determined that there are no people within the preset range outside the vehicle at this time. At this time, it can be determined that the sound must come from inside the vehicle (no need to open the interior of the vehicle additionally). The camera further confirms), that is, it is determined that the sound source position of the voice command is located inside the vehicle. Only one exterior camera 54 is shown in the figure. In practical applications, multiple exterior cameras 54 can be set up along the periphery of the vehicle body to obtain a 360-degree panoramic image of the exterior periphery of the vehicle body, which facilitates more efficient and accurate determination of exterior presets. Is there anyone within range.
可选地,如果该人员分布信息指示当前车外预设范围内具有人员存在,此时可以借助于其他信息(例如前述各类数据中的任意一种或者多种)共同确定声源位置是否位于车辆内部。Optionally, if the personnel distribution information indicates that there are people within the current preset range outside the vehicle, then other information (such as any one or more of the aforementioned types of data) can be used to jointly determine whether the sound source location is located Vehicle interior.
一些情况下,可以通过车辆的行驶速度确定声源位置是否位于车辆内部。车辆的行驶速度达到一定数值时,例如行驶速度大于或者等于80千米/每小时(km/h),此时即使车窗大开可能也无法拾取到车外的语音,因此如果此时麦克风拾取到对车辆进行控制的语音指令,则可以确定该语音指令的声源位置位于车辆的内部。In some cases, the driving speed of the vehicle can be used to determine whether the sound source is located inside the vehicle. When the vehicle's driving speed reaches a certain value, for example, the driving speed is greater than or equal to 80 kilometers per hour (km/h), even if the window is wide open, it may not be able to pick up the voice outside the car. Therefore, if the microphone picks up the When the voice command for controlling the vehicle is received, it can be determined that the sound source position of the voice command is located inside the vehicle.
可选地,当车辆的行驶速度大于或者等于预设的车速阈值时,则可以确定语音指令的声源位置位于车辆的内部。该车速阈值可以为60~100km/h,例如可以是70km/h或者80km/h,生产厂家可以在车辆出厂之前根据麦克风的拾音能力对该车速阈值进行合理设定。Optionally, when the driving speed of the vehicle is greater than or equal to the preset vehicle speed threshold, it may be determined that the sound source position of the voice command is located inside the vehicle. The vehicle speed threshold can be 60 to 100km/h, for example, 70km/h or 80km/h. The manufacturer can reasonably set the vehicle speed threshold based on the microphone's sound pickup ability before the vehicle leaves the factory.
如图5中的(c)部分所示,位于车内的人员50(例如可以是驾驶员)说出“将车内温度调整到22摄氏度”的语音指令,位于车内的麦克风51采集到该语音指令以后,可以通过车速传感器获取到的车速数据确定当前车速为110km/h,大于预设的车速阈值75km/h,此时麦克风51无法拾取到车外的语音,因此可以确定该语音指令的声源位置位于车辆的内部。As shown in part (c) of Figure 5 , the person 50 in the car (for example, the driver) speaks the voice command "Adjust the temperature in the car to 22 degrees Celsius", and the microphone 51 in the car collects the voice command. After the voice command, the vehicle speed data obtained by the vehicle speed sensor can be used to determine that the current vehicle speed is 110km/h, which is greater than the preset vehicle speed threshold of 75km/h. At this time, the microphone 51 cannot pick up the voice outside the vehicle, so the voice command can be determined. The sound source location is inside the vehicle.
可选地,当车辆的行驶速度小于该车速阈值时,此时可以借助于其他信息(例如前述各类数据中的任意一种或者多种)共同确定声源位置是否位于车辆内部。Optionally, when the vehicle's driving speed is less than the vehicle speed threshold, other information (such as any one or more of the aforementioned types of data) can be used to jointly determine whether the sound source position is located inside the vehicle.
例如,当车辆的行驶速度小于该车速阈值,并且大于预设的阈值下限(例如25km/h)时,如果车辆的所有车窗均处于关闭的状态,此时相对较快的车速加上玻璃的阻隔作用,车内的麦克风可能无法拾取到车外的语音指令,因此可以确定语音指令的声源位置位于车辆的内部。For example, when the vehicle's driving speed is less than the speed threshold and greater than the preset lower threshold (for example, 25km/h), if all the windows of the vehicle are closed, the relatively fast vehicle speed plus the glass Due to the blocking effect, the microphone in the car may not be able to pick up the voice command outside the car, so it can be determined that the sound source position of the voice command is located inside the vehicle.
一些情况下,可以通过车辆的车窗开闭情况确定声源位置是否位于车辆内部。可以通过车窗位置检测传感器获取相关数据以确定车辆的各个车窗的当前状态(即开闭情况),例如可以是完全打开、半开或者完全关闭等。车窗玻璃具有一定的隔音效果,当车窗被关闭时,车内的麦克风可能无法准确拾取到车外的语音指令,而当车窗被打开时,车内的麦克风则能够拾取到车外的语音指令,因此可以根据车辆的车窗开闭情况确定声源位置是否位于车辆内部。In some cases, it can be determined whether the sound source is located inside the vehicle through the opening and closing of the vehicle's windows. Relevant data can be obtained through the window position detection sensor to determine the current status (ie, opening and closing status) of each window of the vehicle, for example, it can be fully open, half open, or fully closed. Car window glass has a certain sound insulation effect. When the car window is closed, the microphone in the car may not be able to accurately pick up the voice commands outside the car. However, when the car window is opened, the microphone in the car can pick up the voice commands outside the car. Voice command, so it can determine whether the sound source is located inside the vehicle based on the opening and closing of the vehicle's windows.
可选地,当车辆的所有车窗均处于完全关闭的状态时,说明此时麦克风无法接收到车外的语音,则可以确定语音指令的声源位置位于车辆的内部。该车辆例如可以是一些高档轿车,其隔音效果尤为突出。Optionally, when all the windows of the vehicle are completely closed, it means that the microphone cannot receive the voice outside the vehicle at this time, and it can be determined that the sound source position of the voice command is located inside the vehicle. The vehicle can be, for example, some high-end cars, and its sound insulation effect is particularly outstanding.
一些情况下,可以通过语音指令的音频特征来确定声源位置是否位于车辆内部。声音穿过障碍物以后,自身的一些音频特征(例如波形和能量)将会受到影响,音频特征将会发生改变。换句话说,同一人员在车内和车外同时说出同一个语音指令,被麦克风拾取后二者的音频特征会有 差异。此时可以通过音频特征的这部分改变或者差异来确定声源位置是否位于车辆内部。In some cases, the audio characteristics of the voice command can be used to determine whether the sound source location is located inside the vehicle. After sound passes through obstacles, some of its own audio characteristics (such as waveform and energy) will be affected, and the audio characteristics will change. In other words, if the same person speaks the same voice command inside and outside the car at the same time, the audio characteristics of the two will be different after being picked up by the microphone. difference. At this time, it can be determined whether the sound source position is located inside the vehicle through this change or difference in the audio characteristics.
例如,可以将该语音指令的音频特征的某些特征参数与预设的参数阈值进行比较,根据比较结果确定声源位置是否位于车辆内部。For example, certain characteristic parameters of the audio characteristics of the voice command can be compared with a preset parameter threshold, and whether the sound source position is located inside the vehicle is determined based on the comparison result.
再例如,可以根据获取到的音频特征,结合预先内置的算法来确定声源位置是否位于车辆内部。For another example, the acquired audio characteristics can be combined with a pre-built-in algorithm to determine whether the sound source is located inside the vehicle.
可选地,可以将该语音指令的音频特征数据输入到神经网络模型中,通过该神经网络模型对该特征数据进行分析并确定声源位置是否位于车辆内部。其中,神经网络模型是由历史数据(训练数据)以及深度学习算法进行模型训练得到的,该神经网络模型例如可以是CNN模型,该深度学习算法例如可以是机器学习算法或者元学习算法。Optionally, the audio feature data of the voice instruction can be input into a neural network model, and the neural network model analyzes the feature data and determines whether the sound source position is located inside the vehicle. The neural network model is obtained by model training using historical data (training data) and a deep learning algorithm. The neural network model may be, for example, a CNN model, and the deep learning algorithm may be, for example, a machine learning algorithm or a meta-learning algorithm.
作为一个具体的示例,图6是控制指令的传播过程示意图。如图6所示,车辆的内部设有麦克风61,车辆的车窗62处于完全封闭的状态。此时位于车外的人员60在车外说出“打开车门”的语音指令,该语音指令以声波的方式在空气中进行传播,并且中间需要穿过包括车窗玻璃在内的车体才能够进入到车内,并最终到达麦克风61。在车体对声波的吸收以及折射等作用下,该语音指令的音频特征发生了改变,即图6中穿过前的声波63和穿过后的声波64二者的特性参数可能略有不同。根据麦克风61拾取到的该语音指令的音频特征,结合预置的算法(例如神经网络模型)能够确定该语音指令的声源位置位于车外,即人员从车外说出该语音指令。As a specific example, Figure 6 is a schematic diagram of the propagation process of control instructions. As shown in FIG. 6 , a microphone 61 is provided inside the vehicle, and the vehicle window 62 is in a completely closed state. At this time, the person 60 outside the car speaks the voice command "open the door" outside the car. This voice command propagates in the air in the form of sound waves, and needs to pass through the car body including the window glass. Enter the car and eventually reach microphone 61. Due to the absorption and refraction of sound waves by the car body, the audio characteristics of the voice command change, that is, the characteristic parameters of the sound wave 63 before passing through and the sound wave 64 after passing through in Figure 6 may be slightly different. According to the audio characteristics of the voice command picked up by the microphone 61 and combined with a preset algorithm (such as a neural network model), it can be determined that the sound source position of the voice command is located outside the car, that is, the person speaks the voice command from outside the car.
可选地,这里的音频特征可以包括频谱结构、能量、振幅、频率、周期等任意的特征参数,但不限于此。Optionally, the audio characteristics here may include any characteristic parameters such as spectral structure, energy, amplitude, frequency, period, etc., but are not limited to this.
可选地,该语音特征可以包括频谱结构,可以根据获取到的该语音指令的该频谱结构确定声源位置是否位于车辆内部。声音穿过车窗玻璃等障碍物以后,频谱结构中的高频段部分能量被吸收,高频部分将明显得到衰减,而车内发出的正常未穿过障碍物的声音的频谱结构中的高频部分会被保存的比较完整,因此同等情况下,车外发出的语音指令和车内发出的语音指令在频谱结构上会产生差异,可以通过这部分差异确定声源位置是否位于车辆内部。通过频谱结构来确定声源位置是否位于车辆内部的好处是,确认方法简单高效并且容易实现,无需借助于其他数据,能够减小车辆其他传感器的使用频次。Optionally, the voice feature may include a spectrum structure, and whether the sound source position is located inside the vehicle may be determined based on the acquired spectrum structure of the voice instruction. After the sound passes through obstacles such as car window glass, the energy in the high-frequency band in the spectrum structure is absorbed, and the high-frequency part will be significantly attenuated. However, the high-frequency spectrum structure of the normal sound emitted in the car that has not passed through obstacles is Some parts will be preserved relatively completely, so under the same circumstances, there will be differences in the spectrum structure between the voice commands issued outside the car and the voice commands issued inside the car. This part of the difference can be used to determine whether the sound source position is located inside the vehicle. The advantage of using the spectrum structure to determine whether the sound source position is located inside the vehicle is that the confirmation method is simple, efficient and easy to implement without resorting to other data, which can reduce the frequency of use of other sensors in the vehicle.
例如,可以将该频谱结构输入到预先训练好的神经网络模型(例如CNN模型)中,神经网络模型对频谱结构进行分析计算,并最终输出声源位置是否位于车辆内部的判断结果。For example, the spectral structure can be input into a pre-trained neural network model (such as a CNN model). The neural network model analyzes and calculates the spectral structure, and finally outputs a judgment result of whether the sound source position is located inside the vehicle.
一些情况下,可以根据语音指令的频谱结构中高频部分的特征确定语音指令的声源位置是否位于所述车辆内部,当高频部分完整性满足预设条件时,确定声源位置位于车辆内部。In some cases, it can be determined whether the sound source position of the voice command is located inside the vehicle based on the characteristics of the high-frequency part in the spectrum structure of the voice command. When the integrity of the high-frequency part meets the preset conditions, it is determined that the sound source position is located inside the vehicle.
由于声音穿过车窗玻璃等障碍物以后,频谱结构中的高频段部分能量被吸收,高频部分将明显得到衰减,而车内发出的正常未穿过障碍物的声音的频谱结构中的高频部分会被保存的比较完整,因此同等情况下,车外发出的语音指令和车内发出的语音指令在高频部分的完整性上会产生较为明显的差异,可以通过高频部分的特征来确定语音指令的声源位置是否位于所述车辆内部,当高频部分完整性满足预设条件时,确定声源位置位于所述车辆内部。通过频谱结构中高频部分的特征来确定声源位置是否位于车辆内部的好处是,确认方法简单高效并且容易实现,无需借助于其他数据,能够减小车辆其他传感器的使用频次。Because after the sound passes through obstacles such as car window glass, the energy in the high-frequency band in the spectrum structure is absorbed, and the high-frequency part will be significantly attenuated. However, the normal sound emitted in the car that has not passed through obstacles has a high-frequency band in the spectrum structure. The high-frequency part will be preserved relatively completely. Therefore, under the same circumstances, the voice commands issued outside the car and the voice commands issued inside the car will have a more obvious difference in the integrity of the high-frequency part. It can be determined by the characteristics of the high-frequency part. Determine whether the sound source position of the voice command is located inside the vehicle. When the integrity of the high-frequency part meets the preset conditions, it is determined that the sound source position is located inside the vehicle. The advantage of determining whether the sound source position is located inside the vehicle through the characteristics of the high-frequency part of the spectrum structure is that the confirmation method is simple, efficient and easy to implement without resorting to other data, which can reduce the frequency of use of other sensors in the vehicle.
例如,可以将该频谱结构中的高频部分输入到预先训练好的神经网络模型(例如CNN模型)中,神经网络模型对频谱结构进行分析计算,并最终输出声源位置是否位于车辆内部的判断结果。For example, the high-frequency part of the spectrum structure can be input into a pre-trained neural network model (such as a CNN model). The neural network model analyzes and calculates the spectrum structure, and finally outputs a judgment of whether the sound source position is located inside the vehicle. result.
步骤430,当确定声源位置位于车辆内部时,控制车辆执行与语音指令相应的操作。Step 430: When it is determined that the sound source position is inside the vehicle, control the vehicle to perform operations corresponding to the voice instructions.
当在步骤420中确定声源位置位于车辆内部时,即确定是车内的驾乘人员发出的语音指令,此时可以根据该语音指令控制车辆执行相应的操作。一些情况下,该语音指令可以是用于唤醒车辆的语音助手的唤醒指令,例如“小艺小艺”,此时可以唤醒车辆的语音助手。一些情况下,该语音指令可以是用于对车辆进行状态控制、媒体类控制、空调控制、座椅控制的控制指令,此时可以控制相应的控制对象执行相应的操作。When it is determined in step 420 that the sound source is located inside the vehicle, it is determined that the sound source is a voice command issued by a driver or passenger in the vehicle. At this time, the vehicle can be controlled to perform corresponding operations based on the voice command. In some cases, the voice command may be a wake-up command used to wake up the voice assistant of the vehicle, such as "Xiaoyi Xiaoyi", in which case the voice assistant of the vehicle can be woken up. In some cases, the voice command can be a control command for status control, media control, air conditioning control, and seat control of the vehicle. In this case, the corresponding control object can be controlled to perform the corresponding operation.
其中,控制车辆执行与所述语音指令相应的操作具体包括:对获取到的语音指令进行语义识别,以获得控制指令,根据该控制指令对车辆进行相应的控制。Wherein, controlling the vehicle to perform operations corresponding to the voice instructions specifically includes: performing semantic recognition on the acquired voice instructions to obtain control instructions, and controlling the vehicle accordingly according to the control instructions.
也就是说,当确定声源位置位于车辆内部时,才会对语音指令进行语义解析,以生成相应的 控制指令,当确定声源位置不是位于车辆内部(即位于车辆外部)时,可以不对语音指令进行语义解析,由此有利于节约处理器的算力。That is to say, when it is determined that the sound source is located inside the vehicle, the voice command will be semantically parsed to generate the corresponding For control instructions, when it is determined that the sound source position is not located inside the vehicle (that is, outside the vehicle), the semantic analysis of the voice instruction may not be performed, which is beneficial to saving the computing power of the processor.
步骤440,当确定声源位置不是位于车辆内部时,控制车辆不执行该语音指令。Step 440: When it is determined that the sound source position is not located inside the vehicle, control the vehicle not to execute the voice command.
具体地,当在步骤420中确定声源位置不是位于车辆内部,即确定声源位置位于车辆外部时,说明该语音指令可能是由车外人员(例如陌生人、路人或者不法分子)所说出并被麦克风碰巧拾取到,此时可以不对该语音指令进行语义识别,不执行该语音指令,即对该语音指令不进行响应,由此能够提高车辆控制的安全性,使得车内乘客的人身和财产安全得以保障。Specifically, when it is determined in step 420 that the sound source position is not located inside the vehicle, that is, it is determined that the sound source position is located outside the vehicle, it means that the voice command may have been spoken by someone outside the vehicle (such as a stranger, a passerby, or a criminal) and is accidentally picked up by the microphone. At this time, the voice command does not need to be semantically recognized and the voice command is not executed, that is, the voice command is not responded to. This can improve the safety of vehicle control and ensure the safety of the passengers in the car. Property safety is ensured.
根据本申请实施例提供的车辆的控制方法400,当接收到用于控制车辆的语音指令以后,不是直接执行该指令,而是首先判断该语音指令的声源位置是否位于车辆的内部,当判定该语音指令的声源位置位于车辆内部时,才执行该语音指令对应的控制操作。从而使得对车辆的控制更加智能化,能够提高车辆的使用安全性,并使得车内乘客的人身和财产安全得以保障。本申请可以帮助车内的用户有效规避车外陌生人员对于车辆的控制和干扰,不会造成车辆的误操作,进一步保证了车内人员,特别是休息场景下的安全。According to the vehicle control method 400 provided by the embodiment of the present application, after receiving a voice command for controlling the vehicle, instead of directly executing the command, it first determines whether the sound source position of the voice command is located inside the vehicle. When determining The control operation corresponding to the voice command is only executed when the sound source position of the voice command is located inside the vehicle. This makes the control of the vehicle more intelligent, improves the safety of the vehicle, and ensures the personal and property safety of passengers in the vehicle. This application can help users in the car effectively avoid control and interference of the vehicle by strangers outside the car, without causing misoperation of the vehicle, further ensuring the safety of people in the car, especially in resting scenes.
以图1所示的场景为例,车辆11的驾乘人员将车辆11停在路边在车内进行休息,此时车外人员13喊出“打开车门”的语音指令,由于车辆11应用了本申请实施例提供的控制方法400,使得车辆11首先按照预设的判定逻辑判定该语音指令的声源位置是位于车辆的外部,此时车辆11并不会执行该语音指令,即不会打开车门,由此使得车内乘客的人身和财产安全得以保障。Taking the scene shown in Figure 1 as an example, the driver and passengers of vehicle 11 parked the vehicle 11 on the roadside and took a rest in the car. At this time, the person 13 outside the vehicle shouted the voice command of "open the door". Since the vehicle 11 applied The control method 400 provided by the embodiment of the present application enables the vehicle 11 to first determine that the sound source position of the voice command is located outside the vehicle according to the preset determination logic. At this time, the vehicle 11 will not execute the voice command, that is, it will not open the door. Car doors, thus ensuring the personal and property safety of passengers in the car.
参见附图7,图7是本申请实施例提供的车辆的控制方法700的流程示意图。控制方法700可以包括以下步骤710-步骤750,其中,步骤710、步骤730以及步骤750可以分别参见前述实施例中对步骤410、步骤430以及步骤440的表述,这里重点阐述不同部分。Referring to FIG. 7 , FIG. 7 is a schematic flowchart of a vehicle control method 700 provided by an embodiment of the present application. The control method 700 may include the following steps 710 to 750. For step 710, step 730 and step 750, reference may be made to the descriptions of step 410, step 430 and step 440 in the foregoing embodiments respectively. Different parts will be emphasized here.
步骤710,获取对车辆进行控制的语音指令。Step 710: Obtain voice instructions for controlling the vehicle.
步骤720,根据麦克风对语音指令的拾取强度确定声源位置是否位于车辆内部。Step 720: Determine whether the sound source is located inside the vehicle based on the microphone's pickup intensity of the voice command.
具体地,麦克风距离语音指令的声源位置越近,则对该语音指令的拾取(拾音)强度则越强,即拾取到的该语音指令的音量也越大,因此麦克风对语音指令的拾取强度能够反映距离声源位置的远近,可以根据该拾取强度确定声源位置是否位于车辆内部。例如,可以根据麦克风对语音指令的拾取强度确定声源位置具体所在何处,或者声源位置的具体方位,之后进一步确定声源位置是否位于车辆内部。Specifically, the closer the microphone is to the sound source of the voice command, the stronger the pickup (sound pickup) intensity of the voice command will be, that is, the volume of the picked-up voice command will be greater, so the microphone will pick up the voice command. The intensity can reflect the distance from the sound source location. Based on the pickup intensity, it can be determined whether the sound source location is inside the vehicle. For example, the specific location of the sound source location, or the specific orientation of the sound source location, can be determined based on the pickup intensity of the voice command by the microphone, and then it can be further determined whether the sound source location is located inside the vehicle.
一些情况下,车辆可以仅配置一个麦克风,可以通过该唯一的一个麦克风对语音指令的拾取强度确定声源位置是否位于车辆内部。图8是图7所示的控制方法的应用场景示意图,如图8中的(a)部分所示,车辆配置有位于车内的车内麦克风81,车内麦克风81邻近驾驶位设置,是车辆的唯一一个麦克风。车内麦克风81仅能够对周边小范围内的声音进行精确拾取。当车内麦克风81接收到“打开座椅加热功能”的语音指令以后,确定该语音指令的拾取强度超过阈值,则确定该语音指令应该是距离车内麦克风81较近距离的驾驶位上的人员80所发出,即确定声源位置就在车内,此时车辆可以执行该语音指令,即打开座椅加热功能,对座椅进行加热。In some cases, the vehicle may be equipped with only one microphone, and whether the sound source position is located inside the vehicle can be determined by the pickup intensity of the voice command by the only microphone. Figure 8 is a schematic diagram of the application scenario of the control method shown in Figure 7. As shown in part (a) of Figure 8, the vehicle is equipped with an in-vehicle microphone 81 located in the vehicle. The in-vehicle microphone 81 is located adjacent to the driving position and is the vehicle's the only microphone. The in-car microphone 81 can only accurately pick up sounds within a small range of the surrounding area. After the in-car microphone 81 receives the voice command "turn on the seat heating function" and determines that the pickup intensity of the voice command exceeds the threshold, it is determined that the voice command should be from a person in the driving seat relatively close to the in-car microphone 81 80 is issued, that is, it is determined that the sound source location is in the car. At this time, the vehicle can execute the voice command, that is, turn on the seat heating function to heat the seat.
一些情况下,车辆通常可能配置有多个麦克风,可以通过该多个麦克风中每个麦克风对语音指令的拾取强度,结合预设的算法(例如训练好的神经网络模型),确定声源位置是否位于车辆内部。该多个麦克风可以均设置于车内,例如可以以麦克风阵列的方式集中设置于车辆的中控台附近,也可以分布于车内的不同座位处。该多个麦克风还可以包括至少一个用于拾取车外语音的车外麦克风,从而使得用户在车外也可以与车辆进行语音交互,能够获得更好的用车体验。In some cases, the vehicle may usually be equipped with multiple microphones. The pickup strength of each of the multiple microphones for voice commands can be used in conjunction with a preset algorithm (such as a trained neural network model) to determine whether the sound source location is Located inside the vehicle. The plurality of microphones can be installed in the car. For example, they can be centrally installed near the center console of the vehicle in the form of a microphone array, or they can be distributed at different seats in the car. The plurality of microphones may also include at least one external microphone for picking up voices outside the vehicle, so that the user can perform voice interaction with the vehicle outside the vehicle and obtain a better vehicle experience.
可选地,如图8中的(b)部分所示,车辆至少配置一个车内麦克风81以及至少一个车外麦克风82,当位于车外的人员80说出“打开后备箱”的语音指令,车内麦克风81与车外麦克风82同时对该指令进行语音拾取,此时车外麦克风82的拾取强度要明显大于车内麦克风81,由此能够确定声源位置(即人员80的位置)位于车辆外部,可以不执行该指令。Optionally, as shown in part (b) of Figure 8 , the vehicle is equipped with at least one in-vehicle microphone 81 and at least one external microphone 82. When the person 80 outside the vehicle speaks the voice command of "open the trunk", The in-vehicle microphone 81 and the out-of-vehicle microphone 82 pick up the voice of the instruction at the same time. At this time, the pickup intensity of the out-of-vehicle microphone 82 is significantly greater than that of the in-vehicle microphone 81, so it can be determined that the sound source position (that is, the position of the person 80) is located in the vehicle. Externally, this instruction does not need to be executed.
可选地,车辆配置有多个车内麦克风,此时可以根据该多个车内麦克风81中每个麦克风对语音指令的拾取强度,结合预设的算法直接确定声源位置是否位于车辆内部。或者,也可以首先确定声源位置具体所在何处(即具体坐标)、声源位置的方位、与每个麦克风的距离等信息,之后通过上述信息间接确定声源位置是否位于车辆内部。 Optionally, the vehicle is equipped with multiple in-vehicle microphones. In this case, whether the sound source position is located inside the vehicle can be directly determined based on the pickup strength of each of the multiple in-vehicle microphones 81 for voice commands in combination with a preset algorithm. Alternatively, it is also possible to first determine the specific location of the sound source (i.e., specific coordinates), the direction of the sound source, the distance to each microphone, and other information, and then indirectly determine whether the sound source is located inside the vehicle through the above information.
例如,如图8中的(c)部分所示,车辆配置有3个分布于车内的不同座位处的车内麦克风81,这3个车内麦克风81的连线大体构成等腰三角形结构。位于后排中间座位的人员80说出“向后移动后排座椅”的语音指令,这3个车内麦克风81对位于该语音指令的拾取强度大致相同,则可以确定声源位置邻近三角形的中心位置,即能够确定声源位置(即人员80的位置)位于车辆内部。For example, as shown in part (c) of FIG. 8 , the vehicle is equipped with three in-vehicle microphones 81 distributed at different seats in the vehicle. The connection lines of these three in-vehicle microphones 81 generally form an isosceles triangle structure. The person 80 in the middle seat of the back row speaks the voice command "Move the rear seat backwards", and the three in-car microphones 81 pick up the voice command with roughly the same intensity, then it can be determined that the sound source position is adjacent to the triangle. The central position, that is, the position of the sound source (ie, the position of the person 80) can be determined to be located inside the vehicle.
步骤730,若确定声源位置位于车辆内部,则控制车辆执行与该控制指令相应的操作。Step 730: If it is determined that the sound source is located inside the vehicle, control the vehicle to perform operations corresponding to the control instruction.
步骤740,若确定声源位置位于车辆外部,则判定预设的触发条件是否被满足,若触发条件被满足,则进入步骤730,即控制车辆执行与该控制指令相应的操作。Step 740, if it is determined that the sound source is located outside the vehicle, it is determined whether the preset trigger condition is met. If the trigger condition is met, step 730 is entered, that is, the vehicle is controlled to perform operations corresponding to the control instruction.
步骤750,若触发条件没有被满足,则不执行该语音指令。Step 750: If the trigger condition is not met, the voice command is not executed.
也就是说,若确定声源位置位于车辆外部,可以作进一步的判定,根据判定结果,决定是否执行与该控制指令相应的操作,而不是像前述实施例那样直接不执行该语音指令。具体地,若触发条件被满足,则控制车辆执行与该控制指令相应的操作,若触发条件没有被满足,则不执行该语音指令。通过以上设置,在提高车辆语音控制的安全性的前提下,还能够增加控制灵活性,有利于提高用户的使用体验。That is to say, if it is determined that the sound source is located outside the vehicle, further determination can be made, and based on the determination result, it is decided whether to execute the operation corresponding to the control instruction, instead of directly not executing the voice instruction as in the previous embodiment. Specifically, if the trigger condition is met, the vehicle is controlled to perform an operation corresponding to the control instruction; if the trigger condition is not met, the voice instruction is not executed. Through the above settings, on the premise of improving the safety of vehicle voice control, it can also increase the control flexibility, which is beneficial to improving the user experience.
在这里,触发条件可以由厂家预先设置,也可以由用户进行自设定,例如用户可以根据哪些场景可能在车外对车辆进行语音控制来进行条件设定。Here, the trigger conditions can be preset by the manufacturer, or can be set by the user. For example, the user can set the conditions based on the scenarios in which the user may voice control the vehicle outside the vehicle.
可选地,触发条件可以是车辆的电子钥匙位于车辆附近、通过手机等移动终端的定位功能确定车主位于车辆附近、特殊时间段(例如上、下班时间)、车辆进入某一特定模式(例如,露营模式)等,本申请对此不作限定。Optionally, the triggering condition may be that the electronic key of the vehicle is located near the vehicle, the vehicle owner is determined to be located near the vehicle through the positioning function of a mobile terminal such as a mobile phone, a special time period (such as commuting and get off work hours), or the vehicle enters a specific mode (for example, Camping mode), etc., this application does not limit this.
以触发条件是车辆的电子钥匙位于车辆附近为例,若车辆能够建立与电子钥匙的蓝牙连接,则能够确定电子钥匙就在车辆附近,即确定车辆的主人也可能位于车辆附近,此时触发条件被满足,可以控制车辆执行与该控制指令相应的操作。Take the trigger condition that the vehicle's electronic key is located near the vehicle as an example. If the vehicle can establish a Bluetooth connection with the electronic key, it can be determined that the electronic key is near the vehicle, that is, it is determined that the owner of the vehicle may also be located near the vehicle. At this time, the trigger condition is satisfied, the vehicle can be controlled to perform operations corresponding to the control instruction.
参见附图9,图9是本申请实施例提供的车辆的控制方法900的流程示意图。控制方法900可以包括以下步骤910-步骤960,其中,步骤910、步骤940以及步骤950可以分别参见前述实施例中对步骤410、步骤420以及步骤430的表述,这里重点阐述不同部分。Referring to FIG. 9 , FIG. 9 is a schematic flowchart of a vehicle control method 900 provided by an embodiment of the present application. The control method 900 may include the following steps 910 to 960. For step 910, step 940 and step 950, reference may be made to the descriptions of step 410, step 420 and step 430 in the foregoing embodiments respectively. Different parts will be emphasized here.
步骤910,获取对车辆进行控制的语音指令。Step 910: Obtain voice instructions for controlling the vehicle.
步骤920,根据多个麦克风中的每一个对语音指令的拾取强度确定声源位置的方位。Step 920: Determine the orientation of the sound source location based on the pickup intensity of the voice command by each of the multiple microphones.
步骤930,根据声源位置的方位确定声源位置是否位于车辆内部。Step 930: Determine whether the sound source is located inside the vehicle based on the direction of the sound source.
具体地,一些情况下,例如麦克风的数量不足,或者声源位置与麦克风的相对位置不理想等原因,根据拾取强度可能无法准确确定声源的具体位置,或者需要较大的处理器算力,此时可以根据对个麦克风对该语音指令的拾取强度确定声源位置的方位,借助于预设的算法或者逻辑,通过该方位来确定声源位置是否位于车辆内部。Specifically, in some cases, such as an insufficient number of microphones or an unsatisfactory relative position between the sound source and the microphone, it may not be possible to accurately determine the specific location of the sound source based on the pickup intensity, or a large processor computing power may be required. At this time, the orientation of the sound source position can be determined based on the pickup intensity of the voice command by a pair of microphones, and with the help of a preset algorithm or logic, it can be determined whether the sound source position is located inside the vehicle through the orientation.
可选地,当确定声源位置的方位以后,进一步确定在该方位上车辆内部不具有座位,此时可以确定声源位置位于车辆外部。例如,麦克风被安装于车内的中控台上,而该方位指向车辆的车头前方,此时在该方位上的车内空间不具有座位,无法容纳乘客,因此可以确定声源位置位于车辆外部。Optionally, after determining the orientation of the sound source location, it is further determined that there are no seats inside the vehicle at this orientation. At this time, it can be determined that the sound source location is located outside the vehicle. For example, if the microphone is installed on the center console of the car and the direction points to the front of the vehicle, there are no seats in the car space at this direction and cannot accommodate passengers. Therefore, it can be determined that the sound source is located outside the vehicle. .
可选地,当确定声源位置的方位以后,进一步确定在该方位上的车窗是否被关闭。例如,如果车窗具有较强的隔音效果并且当前车窗被完全关闭,则可以确定声源位置位于车辆内部。Optionally, after determining the orientation of the sound source location, it is further determined whether the car window at this orientation is closed. For example, if the car window has a strong sound insulation effect and the current car window is completely closed, it can be determined that the sound source location is inside the vehicle.
可选地,当确定声源位置的方位以后,进一步确定在该方位上车内和车外的人员分布情况,结合该人员分布情况确定声源位置是否位于车辆内部。例如,此时如果在该方位上车内没有人员存在,则能够确定声源位置位于车辆外部;或者,此时如果在该方位上车外没有人员存在,则能够确定声源位置位于车辆内部;或者,此时如果在该方位上车内和车外均有人员存在,此时可以借助于传感器的其他数据来确定声源位置是否位于车辆内部。Optionally, after determining the orientation of the sound source location, further determine the distribution of people inside and outside the vehicle at this orientation, and determine whether the sound source location is located inside the vehicle based on the distribution of people. For example, if there are no people inside the vehicle at this orientation, it can be determined that the sound source is located outside the vehicle; or, if there are no people outside the vehicle at this orientation, it can be determined that the sound source is located inside the vehicle; Alternatively, if there are people both inside and outside the vehicle at this location, you can use other data from the sensor to determine whether the sound source is located inside the vehicle.
图10是图9所示的控制方法900的应用场景示意图。结合图9和图10,在本申请实施例中,步骤930具体包括:FIG. 10 is a schematic diagram of an application scenario of the control method 900 shown in FIG. 9 . In conjunction with Figure 9 and Figure 10, in this embodiment of the present application, step 930 specifically includes:
步骤931,确定在该方位上车辆内部是否具有人员存在,若否,则确定声源位置位于车辆外部,此时进入步骤960,发送是否执行该语音指令的询问信息;若是,则进入步骤932。 Step 931: Determine whether there is a person inside the vehicle at the orientation. If not, determine that the sound source is located outside the vehicle. At this time, step 960 is entered to send a query message asking whether to execute the voice command; if so, step 932 is entered.
如图10中的(a)部分所示,车内麦克风81设置于车辆的中控台附近,可以首先根据车内麦克风81对语音指令的拾取强度确定声源位置的方位如图中虚线箭头所指的方向,此时如果借助于车内摄像头83、座椅压力传感器84等采集到的数据确定在该方位上车内没有人员存在,则能够确定声源位置位于车辆外部,例如可以是车外位于该方位上的人员80说出该语音指令。As shown in part (a) of Figure 10 , the in-car microphone 81 is installed near the center console of the vehicle. The direction of the sound source position can first be determined based on the pickup intensity of the voice command by the in-car microphone 81 as indicated by the dotted arrow in the figure. At this time, if it is determined with the help of data collected by the in-vehicle camera 83, the seat pressure sensor 84, etc. that there is no one in the car at this direction, it can be determined that the sound source position is outside the vehicle, for example, it can be outside the vehicle. The person 80 located at this location speaks the voice command.
步骤932,确定在该方位上车辆外部是否具有人员存在,若否,则确定声源位置位于车辆内部,此时进入步骤950,控制车辆执行与该语音指令相应的控制操作;若是,说明在该方位上车内和车外均有人员存在,这时需要借助于其他数据做进一步判断,可以进入步骤940。Step 932: Determine whether there is a person outside the vehicle at the orientation. If not, determine that the sound source position is inside the vehicle. At this time, step 950 is entered to control the vehicle to perform the control operation corresponding to the voice command; if so, indicate that the sound source is located inside the vehicle. There are people both inside and outside the vehicle in the direction. At this time, further judgment needs to be made with the help of other data, and step 940 can be entered.
如图10中的(b)部分所示,车内麦克风81设置于车辆的中控台附近,可以首先根据车内麦克风81对语音指令的拾取强度确定声源位置的方位如图中虚线箭头所指的方向,此时如果借助于车外摄像头85、雷达或者激光测距仪等采集到的数据确定在该方位上车外没有人员存在,则能够确定声源位置位于车辆内部,例如可以是车内主驾驶位置上的人员80说出该语音指令。As shown in part (b) of Figure 10 , the in-car microphone 81 is installed near the center console of the vehicle. The direction of the sound source position can first be determined based on the pickup intensity of the voice command by the in-car microphone 81 as indicated by the dotted arrow in the figure. At this time, if it is determined that there is no person outside the vehicle in this direction with the help of data collected by the exterior camera 85, radar or laser range finder, it can be determined that the sound source position is inside the vehicle, for example, it can be the vehicle The person 80 in the main driving position speaks the voice command.
如图10中的(c)部分所示,如果借助于车内摄像头83、座椅压力传感器84等采集到的数据确定在该方位上车内具有人员存在,并且,借助于车外摄像头85、雷达或者激光测距仪等采集到的数据确定在该方位上车外也有人员存在,则此时还无法确定声源位置是否位于车辆内部,因此需要进一步结合其他方法(数据)做进一步判断,本申请对这里的其他方法具体是何方法不做特殊限定,例如可以进入步骤940。As shown in part (c) of Figure 10 , if it is determined that there is a person in the car at this orientation using the data collected by the in-car camera 83, the seat pressure sensor 84, etc., and, with the help of the outside camera 85, If the data collected by radar or laser range finder determines that there are people outside the vehicle at this location, it is still not possible to determine whether the sound source is located inside the vehicle. Therefore, it is necessary to further combine other methods (data) to make further judgments. This article The application does not specifically limit the other methods here. For example, step 940 may be entered.
步骤940,根据音频特征确定声源位置是否位于车辆内。若是,进入步骤950,控制车辆执行与该语音指令相应的控制操作;若否,进入步骤960,发送是否执行该语音指令的询问信息。对于步骤940中如何根据音频特征来确定声源位置是否位于车辆内,可以参见前文对步骤420的相关表述,这里不再赘述。Step 940: Determine whether the sound source position is located in the vehicle according to the audio characteristics. If yes, proceed to step 950 to control the vehicle to perform a control operation corresponding to the voice command; if not, proceed to step 960 to send an inquiry message as to whether to execute the voice command. For how to determine whether the sound source position is located in the vehicle based on the audio characteristics in step 940, please refer to the relevant description of step 420 mentioned above, and will not be described again here.
在本申请实施例中,当确定语音指令的声源位置位于车辆外部时,则进入步骤960,发送是否执行该语音指令的询问信息,而不是像前述实施例那样直接不执行该语音指令。使得本申请实施例提供的控制方法不仅能够提高车辆语音控制的安全性,还能够增加控制灵活性,有利于提高用户的使用体验。In this embodiment of the present application, when it is determined that the sound source position of the voice command is located outside the vehicle, step 960 is entered to send an inquiry message as to whether to execute the voice command, instead of directly not executing the voice command as in the previous embodiment. The control method provided by the embodiment of the present application can not only improve the safety of vehicle voice control, but also increase the control flexibility, which is beneficial to improving the user experience.
可选地,这里的询问信息可以为声音信息、图像信息或文字信息等。Optionally, the query information here may be sound information, image information, text information, etc.
例如,这里的询问信息可以是通过车辆的音箱设备发出是否执行该语音指令的语音询问信息,也可以是在车辆内部的中控屏、仪表屏或者抬头显示(heads up display,HUD)设备上显示的图像询问信息,本申请对此不作限定。For example, the inquiry information here can be a voice inquiry information whether to execute the voice command issued through the vehicle's speaker device, or it can be displayed on the central control screen, instrument screen or heads up display (HUD) device inside the vehicle. image query information, this application does not limit this.
例如,当车外人员说出“打开车窗”的语音指令,车辆通过内置的判断逻辑确定该语音指令来自于车辆外部,此时,可以通过车内的音箱设备播报“请问是否需要打开车窗?”的询问信息,如果通过触摸屏或者语音助手得到车内人员的肯定答复,例如驾驶员说出“是的,打开车窗”,则此时可以执行“打开车窗”的操作,而若驾驶员说出“不要打开车窗”,或者未得到进一步答复,那么可以不执行“打开车窗”的操作。For example, when a person outside the vehicle speaks the voice command "Open the window", the vehicle uses built-in judgment logic to determine that the voice command comes from outside the vehicle. At this time, the speaker device in the car can broadcast "Do you need to open the window?" ?" inquiry information, if you get a positive reply from the person in the car through the touch screen or voice assistant, for example, the driver says "Yes, open the window", then you can perform the "open window" operation at this time, and if the driver If the driver says "Don't open the window" or no further reply is received, the operation of "opening the window" does not need to be performed.
上文结合图1至图10详细描述了本申请实施例提供的车辆的控制方法,下面结合图11、图12详细描述本申请实施例的装置。应理解,图11、图12所示的装置能够实现图4、图7以及图9所示的方法流程中的一个或者多个的步骤。为避免重复,在此不再详细赘述。The vehicle control method provided by the embodiment of the present application is described in detail above with reference to FIGS. 1 to 10 , and the device of the embodiment of the present application is described in detail below with reference to FIGS. 11 and 12 . It should be understood that the devices shown in Figures 11 and 12 can implement one or more steps in the method flows shown in Figures 4, 7 and 9. To avoid repetition, they will not be described in detail here.
图11是本申请实施例提供的车辆的控制装置1100的示意性框图。如图11所示,该控制装置1100包括获取单元1110、确定单元1120以及控制单元1130。FIG. 11 is a schematic block diagram of a vehicle control device 1100 provided by an embodiment of the present application. As shown in FIG. 11 , the control device 1100 includes an acquisition unit 1110 , a determination unit 1120 and a control unit 1130 .
获取单元1110,用于获取对车辆进行控制的语音指令;Obtaining unit 1110, used to obtain voice instructions for controlling the vehicle;
确定单元1120,用于确定所述语音指令的声源位置是否位于所述车辆内部;Determining unit 1120, used to determine whether the sound source position of the voice instruction is located inside the vehicle;
控制单元1130,当所述声源位置位于所述车辆内部时,用于控制所述车辆执行与所述语音指令相应的操作。The control unit 1130 is configured to control the vehicle to perform operations corresponding to the voice instructions when the sound source is located inside the vehicle.
可选地,所述确定单元1120具体用于:Optionally, the determining unit 1120 is specifically used to:
根据所述语音指令的音频特征确定所述声源位置是否位于所述车辆内部。Whether the sound source position is located inside the vehicle is determined based on the audio characteristics of the voice instruction.
可选地,所述音频特征包括频谱结构。Optionally, the audio features include spectral structure.
可选地,所述确定单元1120具体用于:根据所述语音指令的频谱结构中高频部分的特征确定所述语音指令的声源位置是否位于所述车辆内部;当所述高频部分完整性满足预设条件时,确定 所述声源位置位于所述车辆内部。Optionally, the determining unit 1120 is specifically configured to: determine whether the sound source position of the voice command is located inside the vehicle according to the characteristics of the high-frequency part in the spectrum structure of the voice command; when the integrity of the high-frequency part When the preset conditions are met, OK The sound source location is located inside the vehicle.
可选地,所述确定单元1120具体用于:Optionally, the determining unit 1120 is specifically used to:
根据所述车辆的麦克风对所述语音指令的拾取强度确定所述声源位置是否位于所述车辆内部。Whether the sound source position is located inside the vehicle is determined based on the pickup intensity of the voice command by the vehicle's microphone.
可选地,所述确定单元1120具体用于:Optionally, the determining unit 1120 is specifically used to:
根据多个麦克风对所述语音指令的拾取强度确定所述声源位置的方位;Determine the orientation of the sound source location based on the pickup intensity of the voice command by multiple microphones;
根据所述方位确定所述声源位置是位于所述车辆内部还是外部。Whether the sound source position is located inside or outside the vehicle is determined based on the orientation.
可选地,所述确定单元1120通过以下信息中的至少一种来确定所述声源位置是否位于所述车辆内部:Optionally, the determining unit 1120 determines whether the sound source position is located inside the vehicle through at least one of the following information:
所述语音指令的音频特征;The audio characteristics of the voice command;
所述车辆的麦克风对所述语音指令的拾取强度;The pickup intensity of the voice command by the vehicle's microphone;
所述车辆内部的人员分布情况;The distribution of people inside the vehicle;
所述车辆外部预设范围内的人员分布情况;The distribution of people within the preset range outside the vehicle;
所述车辆的行驶速度;The driving speed of the vehicle;
所述车辆的车窗开闭情况。The opening and closing conditions of the vehicle's windows.
可选地,所述语音指令包括唤醒指令,所述控制单元1130具体用于:Optionally, the voice instruction includes a wake-up instruction, and the control unit 1130 is specifically configured to:
唤醒所述车辆的语音助手。Wake up the vehicle's voice assistant.
可选地,所述控制单元1130具体用于:Optionally, the control unit 1130 is specifically used to:
对所述语音指令进行语义解析以获得控制指令;Perform semantic analysis on the voice instructions to obtain control instructions;
根据所述控制指令对所述车辆进行控制。The vehicle is controlled according to the control instructions.
可选地,当所述声源位置位于所述车辆外部时,所述控制单元1130还用于:Optionally, when the sound source is located outside the vehicle, the control unit 1130 is also used to:
控制所述车辆不执行所述语音指令。The vehicle is controlled not to execute the voice command.
可选地,当所述声源位置位于所述车辆外部时,若确定预设的触发条件被满足,所述控制单元1130还用于:Optionally, when the sound source position is located outside the vehicle, if it is determined that the preset trigger condition is met, the control unit 1130 is also used to:
控制所述车辆执行与所述语音指令相应的操作。The vehicle is controlled to perform operations corresponding to the voice instructions.
可选地,所述控制装置还包括:Optionally, the control device also includes:
发送单元1140,当所述声源位置位于所述车辆外部时,用于发送是否执行所述语音指令的询问信息。The sending unit 1140 is configured to send query information on whether to execute the voice command when the sound source is located outside the vehicle.
具体地,该控制装置1100可对应于根据本申请实施例的方法400、方法700以及方法900中的车辆或者控制车辆的设备(例如云端服务器),或配置于车辆或者车辆控制设备中的芯片。该控制装置1100可以包括用于执行图4、图7以及图9中车辆执行的方法的各个单元。并且,该控制装置1100中的各单元和上述其他操作和/或功能分别为了实现方法400、方法700以及方法900的相应流程,各单元执行上述相应步骤的具体过程在方法400、方法700以及方法900中已经详细说明,为了简洁,在此不再赘述。Specifically, the control device 1100 may correspond to the vehicle or the device (such as a cloud server) that controls the vehicle in the methods 400, 700, and 900 according to the embodiments of the present application, or a chip configured in the vehicle or the vehicle control device. The control device 1100 may include various units for executing the methods performed by the vehicle in FIGS. 4 , 7 and 9 . Moreover, each unit and the above-mentioned other operations and/or functions in the control device 1100 are respectively to implement the corresponding processes of the method 400, the method 700 and the method 900. The specific process of each unit performing the above-mentioned corresponding steps is in the method 400, the method 700 and the method. It has been explained in detail in 900 and will not be repeated here for the sake of brevity.
图12是本申请实施例提供的车辆的控制装置1200的结构性示意图。该控制装置1200可以为车辆或者车辆的控制设备,也可以为设置于车辆或者车辆的控制设备内的芯片或者电路。如图12所示,该控制装置1200包括:处理器1210、存储器1220和通信接口1230。其中,存储器1220中存储有指令,处理器1210用于执行存储器1220中的指令,当该指令被执行时,该处理器1210用于执行上述方法实施例提供的方法,处理器1210还用于控制通信接口1230与外界进行通信。FIG. 12 is a schematic structural diagram of a vehicle control device 1200 provided by an embodiment of the present application. The control device 1200 may be a vehicle or a vehicle control device, or may be a chip or circuit provided in the vehicle or a vehicle control device. As shown in Figure 12, the control device 1200 includes: a processor 1210, a memory 1220 and a communication interface 1230. Instructions are stored in the memory 1220, and the processor 1210 is used to execute the instructions in the memory 1220. When the instructions are executed, the processor 1210 is used to execute the method provided by the above method embodiment, and the processor 1210 is also used to control The communication interface 1230 communicates with the outside world.
进一步地,该处理器1210、存储器1220和通信接口1230可以通过内部连接通路互相通信,传递控制和/或数据信号。Further, the processor 1210, the memory 1220 and the communication interface 1230 can communicate with each other through internal connection channels to transmit control and/or data signals.
进一步地,该存储器1220可以集成在处理器1210中,也可以与处理器1210分开设置。Further, the memory 1220 can be integrated in the processor 1210 or can be provided separately from the processor 1210 .
具体地,控制装置1200可以用于执行图4、图7以及图9中的方法400、方法700以及方法900中的各个步骤,该控制装置1200还可以包括用于执行图4、图7以及图9中的方法400、方法700以及方法900的车辆执行的方法的模块。并且,该控制装置1200中的各模块和上述其他操作和/或功能分别为了实现图4、图7以及图9中的方法400、方法700以及方法900的相应流程。各模块执行上述相应步骤的具体过程在方法400、方法700以及方法900中已经详细说明,为了简洁,在此不再赘述。 Specifically, the control device 1200 can be used to execute each step in the methods 400, 700 and 900 in FIGS. Modules of the method executed by the vehicle of method 400, method 700 and method 900 in 9. Moreover, each module in the control device 1200 and the above-mentioned other operations and/or functions are to implement the corresponding processes of the method 400, the method 700 and the method 900 in FIG. 4, FIG. 7 and FIG. 9 respectively. The specific process of each module performing the above corresponding steps has been described in detail in method 400, method 700, and method 900, and will not be described again for the sake of brevity.
本申请实施例还提供一种计算机可读存储介质,该计算机可读存储介质包括计算机程序,当其在计算机上运行时,使得该计算机执行上述方法实施例提供的方法。Embodiments of the present application also provide a computer-readable storage medium. The computer-readable storage medium includes a computer program that, when run on a computer, causes the computer to execute the method provided by the above method embodiment.
本申请实施例还提供一种包含指令的计算机程序产品,当该计算机程序产品在计算机上运行时,使得该计算机执行上述方法实施例提供的方法。Embodiments of the present application also provide a computer program product containing instructions, which when the computer program product is run on a computer, causes the computer to execute the method provided by the above method embodiment.
本申请实施例还提供一种芯片系统,包括存储器和处理器,该存储器用于存储计算机程序,该处理器用于从存储器中调用并运行该计算机程序,使得安装有该芯片系统的网络设备执行上述方法实施例提供的方法。Embodiments of the present application also provide a chip system, including a memory and a processor. The memory is used to store a computer program. The processor is used to call and run the computer program from the memory, so that the network device installed with the chip system executes the above. Methods provided by method embodiments.
其中,该芯片系统可以包括用于发送信息或数据的输入电路或者接口,以及用于接收信息或数据的输出电路或者接口。The chip system may include an input circuit or interface for sending information or data, and an output circuit or interface for receiving information or data.
应理解,本申请实施例中,该处理器可以为中央处理单元(central processing unit,CPU),该处理器还可以是其他通用处理器、数字信号处理器(digital signal processor,DSP)、专用集成电路(application specific integrated circuit,ASIC)、现成可编程门阵列(field programmable gate array,FPGA)或者其他可编程逻辑器件、分立门或者晶体管逻辑器件、分立硬件组件等。通用处理器可以是微处理器或者该处理器也可以是任何常规的处理器等。It should be understood that in the embodiments of the present application, the processor may be a central processing unit (CPU). The processor may also be other general-purpose processors, digital signal processors (DSP), or dedicated integrated processors. Circuit (application specific integrated circuit, ASIC), off-the-shelf programmable gate array (field programmable gate array, FPGA) or other programmable logic devices, discrete gate or transistor logic devices, discrete hardware components, etc. A general-purpose processor may be a microprocessor or the processor may be any conventional processor, etc.
还应理解,本申请实施例中的存储器可以是易失性存储器或非易失性存储器,或可包括易失性和非易失性存储器两者。其中,非易失性存储器可以是只读存储器(read-only memory,ROM)、可编程只读存储器(programmable ROM,PROM)、可擦除可编程只读存储器(erasable PROM,EPROM)、电可擦除可编程只读存储器(electrically EPROM,EEPROM)或闪存。易失性存储器可以是随机存取存储器(random access memory,RAM),其用作外部高速缓存。通过示例性但不是限制性说明,许多形式的随机存取存储器(random access memory,RAM)可用,例如静态随机存取存储器(static RAM,SRAM)、动态随机存取存储器(DRAM)、同步动态随机存取存储器(synchronous DRAM,SDRAM)、双倍数据速率同步动态随机存取存储器(double data rate SDRAM,DDR SDRAM)、增强型同步动态随机存取存储器(enhanced SDRAM,ESDRAM)、同步连接动态随机存取存储器(synchlink DRAM,SLDRAM)和直接内存总线随机存取存储器(direct rambus RAM,DR RAM)。It should also be understood that the memory in the embodiments of the present application may be volatile memory or non-volatile memory, or may include both volatile and non-volatile memory. Among them, non-volatile memory can be read-only memory (ROM), programmable ROM (PROM), erasable programmable read-only memory (erasable PROM, EPROM), electrically removable memory. Erase electrically programmable read-only memory (EPROM, EEPROM) or flash memory. Volatile memory can be random access memory (RAM), which is used as an external cache. By way of illustration, but not limitation, many forms of random access memory (RAM) are available, such as static random access memory (static RAM (SRAM)), dynamic random access memory (DRAM), synchronous dynamic random access memory (RAM) Access memory (synchronous DRAM, SDRAM), double data rate synchronous dynamic random access memory (double data rate SDRAM, DDR SDRAM), enhanced synchronous dynamic random access memory (enhanced SDRAM, ESDRAM), synchronous connection dynamic random access memory access memory (synchlink DRAM, SLDRAM) and direct memory bus random access memory (direct rambus RAM, DR RAM).
本领域普通技术人员可以意识到,结合本文中所公开的实施例描述的各示例的单元及算法步骤,能够以电子硬件、或者计算机软件和电子硬件的结合来实现。这些功能究竟以硬件还是软件方式来执行,取决于技术方案的特定应用和设计约束条件。专业技术人员可以对每个特定的应用来使用不同方法来实现所描述的功能,但是这种实现不应认为超出本申请的范围。Those of ordinary skill in the art will appreciate that the units and algorithm steps of each example described in conjunction with the embodiments disclosed herein can be implemented with electronic hardware, or a combination of computer software and electronic hardware. Whether these functions are performed in hardware or software depends on the specific application and design constraints of the technical solution. Skilled artisans may implement the described functionality using different methods for each specific application, but such implementations should not be considered beyond the scope of this application.
所属领域的技术人员可以清楚地了解到,为描述的方便和简洁,上述描述的系统、装置和单元的具体工作过程,可以参考前述方法实施例中的对应过程,在此不再赘述。Those skilled in the art can clearly understand that for the convenience and simplicity of description, the specific working processes of the systems, devices and units described above can be referred to the corresponding processes in the foregoing method embodiments, and will not be described again here.
在本申请所提供的几个实施例中,应该理解到,所揭露的系统、装置和方法,可以通过其它的方式实现。例如,以上所描述的装置实施例仅仅是示意性的,例如,所述单元的划分,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式,例如多个单元或组件可以结合或者可以集成到另一个系统,或一些特征可以忽略,或不执行。另一点,所显示或讨论的相互之间的耦合或直接耦合或通信连接可以是通过一些接口,装置或单元的间接耦合或通信连接,可以是电性,机械或其它的形式。In the several embodiments provided in this application, it should be understood that the disclosed systems, devices and methods can be implemented in other ways. For example, the device embodiments described above are only illustrative. For example, the division of the units is only a logical function division. In actual implementation, there may be other division methods. For example, multiple units or components may be combined or can be integrated into another system, or some features can be ignored, or not implemented. On the other hand, the coupling or direct coupling or communication connection between each other shown or discussed may be through some interfaces, and the indirect coupling or communication connection of the devices or units may be in electrical, mechanical or other forms.
所述作为分离部件说明的单元可以是或者也可以不是物理上分开的,作为单元显示的部件可以是或者也可以不是物理单元,即可以位于一个地方,或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部单元来实现本实施例方案的目的。The units described as separate components may or may not be physically separated, and the components shown as units may or may not be physical units, that is, they may be located in one place, or they may be distributed to multiple network units. Some or all of the units can be selected according to actual needs to achieve the purpose of the solution of this embodiment.
另外,在本申请各个实施例中的各功能单元可以集成在一个处理单元中,也可以是各个单元单独物理存在,也可以两个或两个以上单元集成在一个单元中。In addition, each functional unit in each embodiment of the present application can be integrated into one processing unit, each unit can exist physically alone, or two or more units can be integrated into one unit.
所述功能如果以软件功能单元的形式实现并作为独立的产品销售或使用时,可以存储在一个计算机可读取存储介质中。基于这样的理解,本申请的技术方案本质上或者说对现有技术做出贡献的部分或者该技术方案的部分可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储介质中,包括若干指令用以使得一台计算机设备(可以是个人计算机,服务器,或者网络设备等)执行本申请各个实施例所述方法的全部或部分步骤。而前述的存储介质包括:U盘、移动 硬盘、只读存储器(Read-Only Memory,ROM)、随机存取存储器(Random Access Memory,RAM)、磁碟或者光盘等各种可以存储程序代码的介质。If the functions are implemented in the form of software functional units and sold or used as independent products, they can be stored in a computer-readable storage medium. Based on this understanding, the technical solution of the present application is essentially or the part that contributes to the existing technology or the part of the technical solution can be embodied in the form of a software product. The computer software product is stored in a storage medium, including Several instructions are used to cause a computer device (which may be a personal computer, a server, or a network device, etc.) to execute all or part of the steps of the methods described in various embodiments of this application. The aforementioned storage media include: U disk, mobile Various media that can store program code include hard disk, read-only memory (Read-Only Memory, ROM), random access memory (Random Access Memory, RAM), magnetic disk or optical disk.
以上所述,仅为本申请的具体实施方式,但本申请的保护范围并不局限于此,任何熟悉本技术领域的技术人员在本申请揭露的技术范围内,可轻易想到变化或替换,都应涵盖在本申请的保护范围之内。因此,本申请的保护范围应以所述权利要求的保护范围为准。 The above are only specific embodiments of the present application, but the protection scope of the present application is not limited thereto. Any person familiar with the technical field can easily think of changes or substitutions within the technical scope disclosed in the present application. should be covered by the protection scope of this application. Therefore, the protection scope of this application should be subject to the protection scope of the claims.

Claims (16)

  1. 一种车辆的控制方法,其特征在于,包括:A vehicle control method, characterized by including:
    获取对车辆进行控制的语音指令;Get voice commands to control the vehicle;
    确定所述语音指令的声源位置是否位于所述车辆内部;Determine whether the sound source position of the voice command is located inside the vehicle;
    当所述声源位置位于所述车辆内部时,控制所述车辆执行与所述语音指令相应的操作。When the sound source is located inside the vehicle, the vehicle is controlled to perform an operation corresponding to the voice instruction.
  2. 根据权利要求1所述的控制方法,其特征在于,所述确定所述语音指令的声源位置是否位于所述车辆内部,包括:The control method according to claim 1, wherein determining whether the sound source position of the voice command is located inside the vehicle includes:
    根据所述语音指令的音频特征确定所述声源位置是否位于所述车辆内部。Whether the sound source position is located inside the vehicle is determined based on the audio characteristics of the voice instruction.
  3. 根据权利要求2所述的控制方法,其特征在于,所述音频特征包括频谱结构。The control method according to claim 2, characterized in that the audio characteristics include spectral structure.
  4. 根据权利要求3所述的控制方法,其特征在于,所述确定所述语音指令的声源位置是否位于所述车辆内部,包括:The control method according to claim 3, wherein determining whether the sound source position of the voice command is located inside the vehicle includes:
    根据所述语音指令的频谱结构中高频部分的特征确定所述语音指令的声源位置是否位于所述车辆内部;Determine whether the sound source position of the voice command is located inside the vehicle according to the characteristics of the high-frequency part in the spectrum structure of the voice command;
    当所述高频部分完整性满足预设条件时,确定所述声源位置位于所述车辆内部。When the integrity of the high-frequency part meets the preset condition, it is determined that the sound source position is located inside the vehicle.
  5. 根据权利要求1-4中任一项所述的控制方法,其特征在于,所述确定所述语音指令的声源位置是否位于所述车辆内部,包括:The control method according to any one of claims 1 to 4, wherein determining whether the sound source position of the voice command is located inside the vehicle includes:
    根据所述车辆的麦克风对所述语音指令的拾取强度确定所述声源位置是否位于所述车辆内部。Whether the sound source position is located inside the vehicle is determined based on the pickup intensity of the voice command by the vehicle's microphone.
  6. 根据权利要求5所述的控制方法,其特征在于,所述根据所述车辆的麦克风对所述语音指令的拾取强度确定所述声源位置是否位于所述车辆内部,包括:The control method according to claim 5, wherein determining whether the sound source position is located inside the vehicle according to the pickup intensity of the voice command by the vehicle's microphone includes:
    根据多个麦克风对所述语音指令的拾取强度确定所述声源位置的方位;Determine the orientation of the sound source location based on the pickup intensity of the voice command by multiple microphones;
    根据所述方位确定所述声源位置是否位于所述车辆内部。Determine whether the sound source position is located inside the vehicle according to the orientation.
  7. 根据权利要求1所述的控制方法,其特征在于,通过以下信息中的至少一种来确定所述声源位置是否位于所述车辆内部:The control method according to claim 1, characterized in that whether the sound source position is located inside the vehicle is determined through at least one of the following information:
    所述语音指令的音频特征;The audio characteristics of the voice command;
    所述车辆的麦克风对所述语音指令的拾取强度;The pickup intensity of the voice command by the vehicle's microphone;
    所述车辆内部的人员分布情况;The distribution of people inside the vehicle;
    所述车辆外部预设范围内的人员分布情况;The distribution of people within the preset range outside the vehicle;
    所述车辆的行驶速度;The driving speed of the vehicle;
    所述车辆的车窗开闭情况。The opening and closing conditions of the vehicle's windows.
  8. 根据权利要求1-7中任一项所述的控制方法,其特征在于,所述语音指令包括唤醒指令,所述控制所述车辆执行与所述语音指令相应的操作,包括:The control method according to any one of claims 1 to 7, wherein the voice command includes a wake-up command, and controlling the vehicle to perform operations corresponding to the voice command includes:
    唤醒所述车辆的语音助手。Wake up the vehicle's voice assistant.
  9. 根据权利要求1-8中任一项所述的控制方法,其特征在于,所述控制所述车辆执行与所述语音指令相应的操作,包括:The control method according to any one of claims 1 to 8, wherein the controlling the vehicle to perform operations corresponding to the voice instructions includes:
    对所述语音指令进行语义识别以获得控制指令;Perform semantic recognition on the voice instructions to obtain control instructions;
    根据所述控制指令对所述车辆进行控制。The vehicle is controlled according to the control instructions.
  10. 根据权利要求1-9中任一项所述的控制方法,其特征在于,所述方法还包括:The control method according to any one of claims 1-9, characterized in that the method further includes:
    当所述声源位置位于所述车辆外部时,控制所述车辆不执行所述语音指令。When the sound source position is located outside the vehicle, the vehicle is controlled not to execute the voice command.
  11. 根据权利要求1-9中任一项所述的控制方法,其特征在于,所述方法还包括:The control method according to any one of claims 1-9, characterized in that the method further includes:
    当所述声源位置位于所述车辆外部时,若确定预设的触发条件被满足,控制所述车辆执行与所述语音指令相应的操作。When the sound source is located outside the vehicle, if it is determined that the preset trigger condition is met, the vehicle is controlled to perform an operation corresponding to the voice command.
  12. 根据权利要求1-9中任一项所述的控制方法,其特征在于,所述方法还包括:The control method according to any one of claims 1-9, characterized in that the method further includes:
    当所述声源位置位于所述车辆外部时,发送是否执行所述语音指令的询问信息。When the sound source is located outside the vehicle, query information is sent as to whether to execute the voice command.
  13. 一种车辆的控制装置,其特征在于,包括至少一个处理器,所述至少一个处理器用于与存储器耦合,读取并执行所述存储器中的指令,以实现如权利要求1至12中任一项所述的控制方法。 A vehicle control device, characterized by comprising at least one processor, the at least one processor being coupled to a memory, reading and executing instructions in the memory, to implement any one of claims 1 to 12 The control method described in the item.
  14. 一种计算机可读存储介质,其特征在于,所述计算机可读存储介质上存储有计算机程序,当所述计算机程序在计算机上运行时,使得所述计算机执行如权利要求1至12中任意一项所述的控制方法。A computer-readable storage medium, characterized in that a computer program is stored on the computer-readable storage medium. When the computer program is run on a computer, it causes the computer to execute any one of claims 1 to 12. The control method described in the item.
  15. 一种芯片系统,其特征在于,包括处理器,用于从存储器中调用并运行计算机程序,使得安装有所述芯片系统的计算机执行如权利要求1至12中任意一项所述的控制方法。A chip system, characterized in that it includes a processor for calling and running a computer program from a memory, so that the computer installed with the chip system executes the control method according to any one of claims 1 to 12.
  16. 一种车辆,其特征在于,包括如权利要求13所述的车辆控制装置。 A vehicle, characterized by comprising the vehicle control device according to claim 13.
PCT/CN2023/116496 2022-09-05 2023-09-01 Vehicle control method and control apparatus WO2024051592A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202211098498.0A CN117711394A (en) 2022-09-05 2022-09-05 Vehicle control method and control device
CN202211098498.0 2022-09-05

Publications (1)

Publication Number Publication Date
WO2024051592A1 true WO2024051592A1 (en) 2024-03-14

Family

ID=90157603

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2023/116496 WO2024051592A1 (en) 2022-09-05 2023-09-01 Vehicle control method and control apparatus

Country Status (2)

Country Link
CN (1) CN117711394A (en)
WO (1) WO2024051592A1 (en)

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2010266399A (en) * 2009-05-18 2010-11-25 Ono Sokki Co Ltd Sound source identification apparatus and sound source identifying program
JP2016185798A (en) * 2015-03-27 2016-10-27 パイオニア株式会社 Vehicle window closure forgetting prevention device, vehicle window closure forgetting prevention system, vehicle window closure forgetting prevention method and vehicle window closure forgetting prevention program
CN108986806A (en) * 2018-06-30 2018-12-11 上海爱优威软件开发有限公司 Sound control method and system based on Sounnd source direction

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2010266399A (en) * 2009-05-18 2010-11-25 Ono Sokki Co Ltd Sound source identification apparatus and sound source identifying program
JP2016185798A (en) * 2015-03-27 2016-10-27 パイオニア株式会社 Vehicle window closure forgetting prevention device, vehicle window closure forgetting prevention system, vehicle window closure forgetting prevention method and vehicle window closure forgetting prevention program
CN108986806A (en) * 2018-06-30 2018-12-11 上海爱优威软件开发有限公司 Sound control method and system based on Sounnd source direction

Also Published As

Publication number Publication date
CN117711394A (en) 2024-03-15

Similar Documents

Publication Publication Date Title
CN107776574B (en) Driving mode switching method and device for automatic driving vehicle
WO2021052213A1 (en) Method and device for adjusting accelerator pedal characteristic
WO2022000448A1 (en) In-vehicle air gesture interaction method, electronic device, and system
CN110758241B (en) Occupant protection method and apparatus
CN110920619B (en) Vehicle regulation and control method and device and electronic equipment
WO2022134069A1 (en) Vehicle calling method, intelligent vehicle, and device
US20240137721A1 (en) Sound-Making Apparatus Control Method, Sound-Making System, and Vehicle
CN111223479A (en) Operation authority control method and related equipment
WO2022061702A1 (en) Method, apparatus, and system for driving alerts
WO2021217575A1 (en) Identification method and identification device for object of interest of user
CN117476005A (en) Roof tent control method, system, vehicle and storage medium based on voice recognition
WO2024051592A1 (en) Vehicle control method and control apparatus
CN111819626A (en) Voice interaction method and device
CN112789197A (en) Automobile windowing noise control system and method and related vehicle-mounted equipment
CN114802435B (en) Vehicle control method, device, vehicle, storage medium and chip
KR20230114774A (en) CONTENTS linkage system for autonomous vehicles
CN114572219B (en) Automatic overtaking method and device, vehicle, storage medium and chip
CN115257628B (en) Vehicle control method, device, storage medium, vehicle and chip
WO2023207704A1 (en) Vehicle control method based on voice instruction, and related apparatus
KR20230114775A (en) CONTENTS SHARING system for autonomous vehicles
CN115535004B (en) Distance generation method, device, storage medium and vehicle
CN116872956A (en) Auxiliary driving method and device and vehicle
EP4351174A1 (en) Audio stream processing method, address processing method, and related device
KR101910055B1 (en) Multimedia Apparatus for Vehicle
CN116853278A (en) Vehicle control method, device, equipment and storage medium based on line of sight deviation

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 23862286

Country of ref document: EP

Kind code of ref document: A1