WO2023065481A1 - 一种基于位置的语音交互方法及系统 - Google Patents

一种基于位置的语音交互方法及系统 Download PDF

Info

Publication number
WO2023065481A1
WO2023065481A1 PCT/CN2021/136137 CN2021136137W WO2023065481A1 WO 2023065481 A1 WO2023065481 A1 WO 2023065481A1 CN 2021136137 W CN2021136137 W CN 2021136137W WO 2023065481 A1 WO2023065481 A1 WO 2023065481A1
Authority
WO
WIPO (PCT)
Prior art keywords
user
electronic device
voice
vehicle
target vehicle
Prior art date
Application number
PCT/CN2021/136137
Other languages
English (en)
French (fr)
Inventor
张瑜
Original Assignee
博泰车联网科技(上海)股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 博泰车联网科技(上海)股份有限公司 filed Critical 博泰车联网科技(上海)股份有限公司
Priority to EP21961219.9A priority Critical patent/EP4420936A1/en
Publication of WO2023065481A1 publication Critical patent/WO2023065481A1/zh
Priority to US18/642,285 priority patent/US20240276149A1/en

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04WWIRELESS COMMUNICATION NETWORKS
    • H04W4/00Services specially adapted for wireless communication networks; Facilities therefor
    • H04W4/02Services making use of location information
    • H04W4/021Services related to particular areas, e.g. point of interest [POI] services, venue services or geofences
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/048Interaction techniques based on graphical user interfaces [GUI]
    • G06F3/0481Interaction techniques based on graphical user interfaces [GUI] based on specific properties of the displayed interaction object or a metaphor-based environment, e.g. interaction with desktop elements like windows or icons, or assisted by a cursor's changing behaviour or appearance
    • G06F3/04817Interaction techniques based on graphical user interfaces [GUI] based on specific properties of the displayed interaction object or a metaphor-based environment, e.g. interaction with desktop elements like windows or icons, or assisted by a cursor's changing behaviour or appearance using icons
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/048Interaction techniques based on graphical user interfaces [GUI]
    • G06F3/0484Interaction techniques based on graphical user interfaces [GUI] for the control of specific functions or operations, e.g. selecting or manipulating an object, an image or a displayed text element, setting a parameter value or selecting a range
    • G06F3/04842Selection of displayed objects or displayed text elements
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/72Mobile telephones; Cordless telephones, i.e. devices for establishing wireless links to base stations without route selection
    • H04M1/724User interfaces specially adapted for cordless or mobile telephones
    • H04M1/72403User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality
    • H04M1/72409User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality by interfacing with external accessories
    • H04M1/724098Interfacing with an on-board device of a vehicle
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/72Mobile telephones; Cordless telephones, i.e. devices for establishing wireless links to base stations without route selection
    • H04M1/724User interfaces specially adapted for cordless or mobile telephones
    • H04M1/72403User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality
    • H04M1/72409User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality by interfacing with external accessories
    • H04M1/72412User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality by interfacing with external accessories using two-way short-range wireless interfaces
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/72Mobile telephones; Cordless telephones, i.e. devices for establishing wireless links to base stations without route selection
    • H04M1/724User interfaces specially adapted for cordless or mobile telephones
    • H04M1/72403User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality
    • H04M1/7243User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality with interactive means for internal management of messages
    • H04M1/72433User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality with interactive means for internal management of messages for voice messaging, e.g. dictaphones
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/72Mobile telephones; Cordless telephones, i.e. devices for establishing wireless links to base stations without route selection
    • H04M1/724User interfaces specially adapted for cordless or mobile telephones
    • H04M1/72448User interfaces specially adapted for cordless or mobile telephones with means for adapting the functionality of the device according to specific conditions
    • H04M1/72457User interfaces specially adapted for cordless or mobile telephones with means for adapting the functionality of the device according to specific conditions according to geographic location
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R3/00Circuits for transducers, loudspeakers or microphones
    • H04R3/12Circuits for transducers, loudspeakers or microphones for distributing signals to two or more loudspeakers
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04WWIRELESS COMMUNICATION NETWORKS
    • H04W12/00Security arrangements; Authentication; Protecting privacy or anonymity
    • H04W12/60Context-dependent security
    • H04W12/63Location-dependent; Proximity-dependent
    • H04W12/64Location-dependent; Proximity-dependent using geofenced areas
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04WWIRELESS COMMUNICATION NETWORKS
    • H04W4/00Services specially adapted for wireless communication networks; Facilities therefor
    • H04W4/30Services specially adapted for particular environments, situations or purposes
    • H04W4/40Services specially adapted for particular environments, situations or purposes for vehicles, e.g. vehicle-to-pedestrians [V2P]
    • H04W4/48Services specially adapted for particular environments, situations or purposes for vehicles, e.g. vehicle-to-pedestrians [V2P] for in-vehicle communication
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/223Execution procedure of a spoken command
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/60Substation equipment, e.g. for use by subscribers including speech amplifiers
    • H04M1/6033Substation equipment, e.g. for use by subscribers including speech amplifiers for providing handsfree use or a loudspeaker mode in telephone sets
    • H04M1/6041Portable telephones adapted for handsfree use
    • H04M1/6075Portable telephones adapted for handsfree use adapted for handsfree use in a vehicle
    • H04M1/6083Portable telephones adapted for handsfree use adapted for handsfree use in a vehicle by interfacing with the vehicle audio system
    • H04M1/6091Portable telephones adapted for handsfree use adapted for handsfree use in a vehicle by interfacing with the vehicle audio system including a wireless interface
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/72Mobile telephones; Cordless telephones, i.e. devices for establishing wireless links to base stations without route selection
    • H04M1/724User interfaces specially adapted for cordless or mobile telephones
    • H04M1/72448User interfaces specially adapted for cordless or mobile telephones with means for adapting the functionality of the device according to specific conditions
    • H04M1/72454User interfaces specially adapted for cordless or mobile telephones with means for adapting the functionality of the device according to specific conditions according to context-related or environment-related conditions
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M2250/00Details of telephonic subscriber devices
    • H04M2250/74Details of telephonic subscriber devices with voice recognition means
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R2499/00Aspects covered by H04R or H04S not otherwise provided for in their subgroups
    • H04R2499/10General applications
    • H04R2499/13Acoustic transducers and sound field adaptation in vehicles
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04WWIRELESS COMMUNICATION NETWORKS
    • H04W12/00Security arrangements; Authentication; Protecting privacy or anonymity
    • H04W12/12Detection or prevention of fraud
    • H04W12/121Wireless intrusion detection systems [WIDS]; Wireless intrusion prevention systems [WIPS]
    • H04W12/122Counter-measures against attacks; Protection against rogue devices

Definitions

  • the present invention relates to the technical field of vehicle voice interaction, and more specifically, to a location-based voice interaction method and system.
  • Car human-computer interaction in simple terms, is the process of information exchange between people and cars through a certain interaction method, which can directly affect the user's car experience.
  • the interaction methods are divided into two categories: in-vehicle cockpit interaction and out-of-vehicle interaction.
  • the interaction between the car and the user's cockpit is usually that the user interacts with the car through voice or the central control touch screen in the cockpit of the car, and the speakers in the car output and play related audio.
  • the out-of-vehicle interaction between the car and the user is usually that the user sends information to the vehicle from the remote end through a mobile phone, etc., to control the needs of the corresponding user of the vehicle.
  • the purpose of the present invention is to provide a location-based voice interaction method and system, which has the advantage that within the preset distance range of the vehicle, according to user operations, electronic equipment or its associated components are used to collect user voice and output and play it through the vehicle's external speakers , so as to realize the sound reinforcement system based on the vehicle, so that the vehicle has functions such as external voice playback.
  • Another object of the present invention is to provide a location-based voice interaction method and system, which has the advantage that within the preset distance of the vehicle, when it is determined that the user has a voice interaction demand, the user's voice is collected and the vehicle is woken up for audio output playback , so as to realize the vehicle-based sound reinforcement system and reduce the power consumption of the vehicle.
  • Another object of the present invention is to provide a location-based voice interaction method and system, which has the advantage that within the preset distance range of the vehicle, according to the relative position of the user and the vehicle, the corresponding external speaker is selected to output the user's voice information, So as to realize the precise control of the sound reinforcement direction.
  • Another object of the present invention is to provide a location-based voice interaction method and system, which has the advantage of providing the user with a voice interaction interface on the electronic device within the preset distance range of the vehicle to obtain the user's voice interaction needs, Realize accurate voice collection and voice playback, and improve the convenience of use.
  • Another object of the present invention is to provide a location-based voice interaction method and system, which has the advantage that within the preset distance range of the vehicle, the voice interaction interface of the electronic device prompts the user whether the voice interaction is in progress, thereby improving user experience.
  • Another object of the present invention is to provide a location-based voice interaction method and system, which has the advantage of automatically starting or stopping voice interaction according to user operations or the relative positional relationship between the user and the vehicle, improving user experience and reducing vehicle power consumption Condition.
  • the present invention provides a location-based voice interaction method, comprising the following steps:
  • the action detection instruction indicates to detect the user's action
  • the target vehicle is provided with at least one external speaker
  • the collected sound signal is output via the at least one external speaker.
  • a wake-up instruction is generated to be sent to the target vehicle, the wake-up instruction instructs to wake up a vehicle-mounted controller corresponding to the target vehicle.
  • the outputting the collected sound signal through the at least one external speaker further includes the following steps:
  • the collected sound signal is output via the target speaker.
  • the voice icon is displayed as a first icon on the interactive interface
  • the voice icon is displayed as a second icon on the interactive interface.
  • the displaying the interactive interface including the voice icon, and generating the action detection instruction further includes the following steps:
  • the interactive interface is hidden and/or the detection of the user's action is stopped.
  • the relative positional relationship between the electronic device and the target vehicle is determined based on an ultra-wideband method.
  • the present invention provides a location-based voice interaction method, comprising the following steps:
  • the voice collection instruction indicates to collect a user's voice signal
  • the target vehicle is provided with at least one external speaker
  • the sending instruction instructs to send the collected sound signal to the vehicle-mounted controller corresponding to the target vehicle; as well as
  • the collected sound signal is output via the at least one external speaker.
  • a wake-up instruction is generated to be sent to the target vehicle, the wake-up instruction instructs to wake up a vehicle-mounted controller corresponding to the target vehicle.
  • the outputting the collected sound signal through the at least one external speaker further includes the following steps:
  • the collected sound signal is output via the target speaker.
  • the voice icon is displayed as the first icon on the interactive interface
  • the voice icon is displayed as a second icon on the interactive interface.
  • the following steps are further included:
  • the displaying the interactive interface containing the voice icon, and generating the voice collection instruction further includes the following steps:
  • the relative positional relationship between the electronic device and the target vehicle is determined based on an ultra-wideband method.
  • the present invention provides a position-based voice interaction system, which includes electronic equipment, a vehicle-mounted controller and at least one external speaker arranged on the target vehicle, wherein,
  • the electronic device is communicatively connected to the on-board controller, which is configured to:
  • the action detection instruction indicates detection of a user's action
  • the on-board controller is communicatively connected to the at least one external speaker, and is configured to: receive the collected sound signal and send it to the at least one external speaker;
  • the at least one external speaker is configured to output the collected sound signal.
  • the electronic device further includes a first ultra-wideband communication module
  • the vehicle controller further includes a second ultra-wideband communication module, wherein,
  • the first UWB communication module and the second UWB communication module can establish a UWB-based communication connection, which is configured to measure the relative positional relationship between the electronic device and the target vehicle.
  • the present invention provides a position-based voice interaction system, including electronic equipment, an on-board controller and at least one external speaker arranged on the target vehicle:
  • the electronic device is communicatively connected to the on-board controller, which is configured to:
  • the voice collection instruction indicates to collect a user's voice signal
  • the target vehicle is provided with at least one external speaker
  • the sending instruction instructs to send the collected sound signal to the vehicle-mounted controller corresponding to the target vehicle;
  • the on-board controller is communicatively connected to the at least one external speaker, and is configured to: receive the collected sound signal and send it to the at least one external speaker;
  • the at least one external speaker is configured to output the collected sound signal.
  • the electronic device further includes a first ultra-wideband communication module
  • the vehicle controller further includes a second ultra-wideband communication module, wherein,
  • the first UWB communication module and the second UWB communication module can establish a UWB-based communication connection, which is configured to measure the relative positional relationship between the electronic device and the target vehicle.
  • FIG. 1 discloses a flowchart of a location-based voice interaction method according to an embodiment of the present invention
  • Fig. 2 discloses a schematic diagram of a first icon according to an embodiment of the present invention
  • Fig. 3 discloses a schematic diagram of a second icon according to an embodiment of the present invention.
  • FIG. 4 discloses a flow chart of a location-based voice interaction method according to another embodiment of the present invention.
  • FIG. 5 discloses a functional block diagram of a location-based voice interaction system according to an embodiment of the present invention
  • Fig. 6 discloses a functional block diagram of a location-based voice interaction system according to another embodiment of the present invention.
  • ultra-wideband (Ultra Wide Band, UWB) technology is a wireless carrier communication technology, which does not use sinusoidal carrier, but uses nanosecond-level non-sinusoidal narrow pulses to transmit data.
  • UWB technology solves the major problems in propagation that have plagued traditional wireless communication technology for many years. It has the advantages of insensitivity to channel fading, low power spectral density of transmitted signals, low interception rate, low system complexity, and positioning accuracy of several centimeters. .
  • Figure 1 discloses a flow chart of a location-based voice interaction method according to an embodiment of the present invention.
  • the location-based voice interaction method proposed by the present invention starts the sound collection process by detecting user actions, specifically including the following step:
  • step S11 and step S12 may be an electronic device.
  • the electronic device may be a wearable smart device such as a mobile phone, a tablet, smart glasses, a smart helmet, or a smart watch.
  • step S13 may be an electronic device or an associated component of the electronic device.
  • the associated component of the electronic device may be an external voice collection component such as an earphone, a headset, a microphone, etc. that establishes a wired or wireless connection with the electronic device.
  • an external voice collection component such as an earphone, a headset, a microphone, etc. that establishes a wired or wireless connection with the electronic device.
  • the subject of execution of the step S14 may be the target vehicle. Therefore, within the preset distance range of the vehicle, the electronic device or its associated components can be used to collect the user's voice according to the user's operation, and output and play it through the external speaker on the vehicle, so as to realize the sound reinforcement system based on the vehicle, and make the vehicle equipped with external voice playback, etc. Function.
  • the relative positional relationship between the mobile phone and the target vehicle may be measured based on near-field communication methods such as Bluetooth positioning and ultra-wideband UWB.
  • near-field communication methods such as Bluetooth positioning and ultra-wideband UWB.
  • the relative positional relationship between the mobile phone and the target vehicle is determined based on an ultra-wideband UWB method.
  • UWB technology Compared with RFID and Bluetooth technology, the accuracy of UWB distance measurement is higher, which can reach centimeter level.
  • UWB technology has extremely high time resolution.
  • the anti-interference performance of UWB is very strong.
  • the frequency band used by Bluetooth and WiFi is 2.4GHz, which is susceptible to external interference (for example, microwave ovens are also 2.4GHz).
  • the commonly used frequency bands for UWB are 6.5G and 8G, which are less susceptible to external interference.
  • the time stamp of UWB is extremely accurate and can resist multipath interference.
  • UWB has the ability to protect against relay attacks, which is mainly due to its ability to accurately calculate the actual physical distance.
  • the display screen of the mobile phone displays an interactive interface including the first icon, and generates an action detection instruction, the action detection instruction instructs to detect the user's action.
  • the relative positional relationship refers to the relative distance relationship and relative angle relationship between the electronic device (such as a mobile phone) and the target vehicle.
  • the above relative positional relationship needs to meet certain preset conditions before the interactive interface is displayed and further voice interaction process starts.
  • the preset condition is that the electronic device is outside the target vehicle.
  • the preset condition may be that the mobile phone is outside the car and 20 meters away from the car. In other embodiments, the preset condition may be that the mobile phone is outside the vehicle and within 180° from the left side of the target vehicle body.
  • the user's voice interaction requirement can be determined according to the relative positional relationship between the electronic device and the target vehicle.
  • FIG. 2 discloses a schematic diagram of a first icon according to an embodiment of the present invention.
  • the first icon shown in FIG. 2 is a small icon of a microphone.
  • the mobile phone and the target vehicle need to be authenticated and matched, which can also be identified through ultra-wideband UWB.
  • remote devices such as mobile phones are equivalent to digital keys.
  • the on-board controller of the target vehicle can be in an unawakened state, thereby saving the energy consumption of the target vehicle to a certain extent.
  • the vehicle-mounted controller of the target vehicle can be woken up.
  • the mobile phone generates a wake-up instruction to send to the target vehicle, and the wake-up instruction indicates to wake up the vehicle-mounted controller corresponding to the target vehicle.
  • the mobile phone when the mobile phone detects that the relative positional relationship between the mobile phone and the target vehicle does not meet the preset conditions, hide the interactive interface and/or stop detecting the user's actions, and consider that the mobile phone is far away from the target vehicle. Within a certain distance of the vehicle, voice interaction is stopped, which reduces the power consumption of the vehicle on the one hand, and improves the user experience on the other hand.
  • the mobile phone detects whether the user's action satisfies the preset activation action, and if the user's action satisfies the preset activation action, generates a sound collection instruction to collect the user's voice signal.
  • the user's preset activation action includes any of the following:
  • the user performs a second preset operation on the electronic device.
  • the first preset operation includes, but is not limited to: operations such as single click, double click, long press, heavy press, circle selection, slide or drag.
  • the second preset operation includes, but is not limited to: operations such as shaking and shaking.
  • the sound collection instruction instructs the mobile phone or associated components of electronic equipment such as earphones, headsets, and microphones to collect the user's voice signal.
  • the user's voice interaction needs can be determined according to the user's actions; Clear voice input improves user experience.
  • Associated components of electronic equipment such as mobile phones or earphone microphones execute sound collection instructions, collect user voice signals and send them to the target vehicle via electronic equipment (such as mobile phones). Further, when the associated components of electronic devices such as mobile phones or earphone microphones execute sound collection instructions, the user's voice signals can be collected and sent to the vehicle-mounted controller of the target vehicle via electronic devices (such as mobile phones) in real time.
  • the collected sound signal can be sent to the target vehicle through wireless communication.
  • the collected sound signal when the collected sound signal is sent to the target vehicle by wireless communication, it is not limited to the UWB channel, and other wireless data transmission methods such as Bluetooth, WIFI, and 4G/5G mobile communication networks can also be used.
  • the voice icon is displayed as the second icon on the interactive interface.
  • FIG. 3 discloses a schematic diagram of a second icon according to an embodiment of the present invention.
  • the second icon shown in FIG. 3 is a large icon of a microphone.
  • the mobile phone detects whether the user's action satisfies the preset disconnection action, and if the user's action meets the preset disconnection action, stop collecting the user's said sound signal, so that the user can start or stop it through the action operation Acquire sound signals.
  • the user's preset disconnection action also includes any of the following:
  • the user performs a second preset operation on the electronic device.
  • the first preset operation includes, but is not limited to: operations such as single click, double click, long press, heavy press, circle selection, slide or drag.
  • the second preset operation includes, but is not limited to: operations such as shaking and shaking.
  • the preset start action and the preset disconnection action can be set as corresponding operations, for example, click once to start collecting, and click again to stop collecting.
  • the preset start action and preset disconnection action can also be set as uncorresponding operations, such as one click to start collection, double click to stop collection or one click to start collection, and shake the phone to stop collection.
  • a plurality of external speakers provided in the vehicle output the collected user's voice signal.
  • multiple external speakers installed in the vehicle can output the collected user's voice signal in real time, or it can be delayed (for example, 10 minutes), timing (for example, set to 18:00) or triggered based on preset conditions (for example, detect that the car door is tapped or opened) and output the collected user's voice signal.
  • the present invention uses UWB technology to accurately obtain the relative positional relationship, when the vehicle is provided with multiple external speakers, the specified target speaker can be selected for sound playback based on the precise relative positional relationship, thereby satisfying the requirements of the vehicle. Stereo surround playback effect.
  • the target speakers when the vehicle is symmetrically distributed with 4 external speakers, the 2 target speakers that are closer to the user can be selected for playback. Another 2 target speakers for playback.
  • the corresponding external speaker within the angular range of the user can be selected as the target speaker to play. speaker.
  • the target speaker may also be determined according to the orientation of the user. For example, if the orientation of the user is facing south, an external speaker facing south is selected as the target speaker. Specifically, the orientation of the user may be determined according to the orientation of the movement track of the electronic device.
  • the corresponding external speaker can be selected to output the user's voice information, thereby realizing precise control of the sound reinforcement direction.
  • the flow of the location-based voice interaction method in the embodiment shown in FIG. 1 will be described below with reference to FIG. 2 and FIG. 3 , taking the electronic device as a mobile phone as an example.
  • a small microphone icon as shown in Figure 2 is displayed on the mobile phone interface for the user to select;
  • the mobile phone When the user clicks the small microphone icon on the mobile phone interface, the mobile phone detects the user's click action, generates a sound collection command, and the mobile phone microphone or earphone starts to collect the user's voice in real time, and the large microphone icon shown in Figure 3 is displayed on the mobile phone interface. Used to indicate that voice collection input is in progress;
  • the mobile phone sends the collected user voice signal to the target vehicle, and the external speaker of the target vehicle outputs the collected user voice signal.
  • the mobile phone When the relative positional relationship between the mobile phone and the target vehicle no longer satisfies the preset conditions, for example, the mobile phone enters the car or leaves a certain distance, hide the interactive interface and/or stop detecting the user's actions, the microphone icon on the mobile phone interface The hiding disappears, which on the one hand reduces the power consumption of mobile phones/vehicles, and on the other hand improves the user experience.
  • Figure 4 discloses a flow chart of a location-based voice interaction method according to another embodiment of the present invention.
  • the location-based voice interaction method proposed by the present invention judges the distance between the user's face and the mobile phone by detecting the user's voice , when the user's face is close to the electronic device, the collected sound is sent to the target vehicle, and the voice interaction with the target vehicle is started, which specifically includes the following steps:
  • steps S21 to S23 are executed by the electronic device and/or associated components of the electronic device.
  • the electronic device may be a wearable smart device such as a mobile phone, a tablet, smart glasses, a smart helmet, or a smart watch.
  • the associated component of the electronic device may be an external voice collection component such as an earphone, a headset, a microphone, etc. that establishes a wired or wireless connection with the electronic device.
  • an external voice collection component such as an earphone, a headset, a microphone, etc. that establishes a wired or wireless connection with the electronic device.
  • the execution subject of the step S24 is the target vehicle.
  • the relative positional relationship between the mobile phone and the target vehicle is measured based on the ultra-wideband UWB method.
  • the display screen of the mobile phone displays an interactive interface containing the first icon, and starts to collect the user's voice signal in real time.
  • a mobile phone or an earphone connected to the mobile phone may be used to collect the user's voice signal.
  • the collected user's voice signal is used as an audio input signal on the one hand, and as a judgment signal for sound field positioning of the relative position and distance between the user's face and the mobile phone on the other hand.
  • the microphone of the mobile phone collects the user's voice signal to perform sound field positioning on the relative position and distance between the user's face and the mobile phone.
  • the first location condition may be that the mobile phone is outside the car and 20 meters away from the car.
  • the first icon may be a small microphone icon as shown in FIG. 2 .
  • the mobile phone and the target vehicle need to be authenticated and matched, which can also be identified through ultra-wideband UWB.
  • remote devices such as mobile phones are equivalent to digital keys.
  • the on-board controller of the target vehicle can be in an unawakened state, thereby saving the energy consumption of the target vehicle to a certain extent.
  • the vehicle-mounted controller of the target vehicle can be woken up.
  • the mobile phone generates a wake-up instruction to send to the target vehicle, and the wake-up instruction indicates to wake up the vehicle-mounted controller corresponding to the target vehicle.
  • the mobile phone detects that the relative positional relationship between the mobile phone and the target vehicle does not satisfy the first position condition, hide the interactive interface and/or stop collecting the user's sound signal, it is considered that the mobile phone has moved away from the target vehicle.
  • the target vehicle is within a certain distance, and the voice interaction is stopped.
  • the mobile phone calculates the relative positional distance between the user's face and the mobile phone based on the collected sound signals.
  • the relative positional distance between the user's face and the mobile phone meets the second position condition, it is considered that the user needs to turn on the wireless microphone function to perform voice interaction with the car through the mobile phone.
  • the second position condition may be that the relative position distance between the face and the mobile phone is 5 centimeters.
  • the second location condition can be preset.
  • a sending instruction is generated, and the collected user voice signal is sent to the target vehicle.
  • the collected sound signal can be sent to the target vehicle through wireless communication.
  • the collected sound signal when the collected sound signal is sent to the target vehicle by wireless communication, it is not limited to the UWB channel, and other wireless data transmission methods such as Bluetooth, WIFI, and 4G/5G mobile communication networks can also be used.
  • the voice icon is displayed as a second icon on the interactive interface.
  • the second icon may be a large microphone icon as shown in FIG. 3 .
  • the mobile phone detects that the relative distance between the user's face and the mobile phone does not satisfy the second position condition, it stops sending the collected sound signal.
  • the technical solution of this embodiment actively collects the user's voice and sends it through the relative position distance between the user's face and the electronic device. Compared with the embodiment shown in Figure 1, it reduces the user's operation on the interactive interface and promotes Non-inductive voice interaction improves user experience and reduces energy consumption of electronic devices.
  • Multiple external speakers provided in the vehicle output the collected user's voice signals in real time.
  • the present invention uses UWB technology to accurately obtain the relative positional relationship, when the vehicle is provided with multiple external speakers, the specified target speaker can be selected for sound playback based on the precise relative positional relationship, thereby satisfying the requirements of the vehicle. Stereo surround playback effect.
  • the 2 target speakers that are closer to the user can be selected for playback.
  • Another 2 target speakers for playback can be selected for playback.
  • Fig. 5 discloses a functional block diagram of a location-based voice interaction system according to an embodiment of the present invention.
  • an on-board controller 520 and at least one external speaker 530 on the vehicle wherein,
  • the electronic device 510 is communicably connected to the vehicle controller 520, which is configured to:
  • the action detection instruction indicates to detect the user's action
  • the vehicle controller 520 is communicably connected to the at least one external speaker 530, and is configured to: receive the collected sound signal and send it to the at least one external speaker 530;
  • the at least one external speaker 530 is configured to output the collected sound signal.
  • the electronic device 510 also includes a first UWB communication module 511
  • the vehicle controller 520 also includes a second UWB communication module 521, wherein,
  • the first UWB communication module 511 and the second UWB communication module 521 can establish a UWB-based communication connection, which is configured to measure the relative positional relationship between the electronic device 510 and the target vehicle.
  • the electronic device 510 further includes an external voice collection component 512, and the external voice collection component 512 is configured to collect the user's voice signal according to the voice collection instruction.
  • the present invention uses UWB technology to accurately obtain the relative positional relationship, when the vehicle is provided with multiple external speakers 530, the specified target speaker can be selected for sound playback based on the precise relative positional relationship, thereby satisfying the requirements of the vehicle. stereo surround playback effect.
  • the 2 target speakers that are closer to the user can be selected for playback.
  • Another 2 target speakers for playback can be selected for playback.
  • FIG. 6 discloses a functional block diagram of a location-based voice interaction system according to another embodiment of the present invention.
  • the location-based voice interaction system proposed by the present invention includes an electronic device 610 and on-board controller 620 and at least one external speaker 630 on the target vehicle, wherein,
  • the electronic device 610 is communicably connected to the vehicle controller 620, which is configured to:
  • the voice collection instruction indicates to collect the user's voice signal
  • the target vehicle is provided with at least one external speaker 630;
  • a sending instruction is generated, and the sending instruction instructs to send the collected sound signal to the vehicle-mounted controller corresponding to the target vehicle 620;
  • the vehicle controller 620 is communicatively connected to the at least one external speaker 630, and is configured to: receive the collected sound signal and send it to the at least one external speaker 630;
  • the at least one external speaker 630 is configured to output the collected sound signal.
  • the electronic device 610 also includes a first UWB communication module 611
  • the vehicle controller 620 also includes a second UWB communication module 621, wherein,
  • the first UWB communication module 611 and the second UWB communication module 621 can establish a UWB-based communication connection, which is configured to measure the relative positional relationship between the electronic device 610 and the target vehicle.
  • the present invention uses UWB technology to accurately obtain the relative positional relationship, when the vehicle is provided with multiple external speakers 630, the specified target speaker can be selected for sound playback based on the precise relative positional relationship, thereby satisfying the requirements of the vehicle. stereo surround playback effect.
  • the 2 target speakers that are closer to the user can be selected for playback.
  • Another 2 target speakers for playback can be selected for playback.
  • a position-based voice interaction method and system proposed by the present invention replaces the traditional microphone with the mobile phone, activates the microphone of the mobile phone within the range of the precise position and distance, realizes the sound reproduction in the external speaker of the car, and realizes the mobile audio mode of the car.
  • Personnel outside the vehicle use the microphone of the mobile phone and the external sound playback function of the vehicle to enable the vehicle to have application functions such as karaoke outside the vehicle and real-time speeches around the outside of the vehicle.
  • DSP digital signal processor
  • ASIC application-specific integrated circuit
  • FPGA field-programmable gate array
  • a general-purpose processor may be a microprocessor, but in the alternative, the processor may be any conventional processor, controller, microcontroller, or state machine.
  • a processor may also be implemented as a combination of computing devices, e.g., a combination of a DSP and a microprocessor, multiple microprocessors, one or more microprocessors in cooperation with a DSP core, or any other such configuration.
  • a software module may reside in RAM memory, flash memory, ROM memory, EPROM memory, EEPROM memory, registers, hard disk, a removable disk, a CD-ROM, or any other form of storage medium known in the art.
  • An exemplary storage medium is coupled to the processor such that the processor can read information from, and write information to, the storage medium.
  • the storage medium may be integrated into the processor.
  • the processor and storage medium can reside in an ASIC.
  • the ASIC may reside in a user terminal.
  • the processor and storage medium may reside as discrete components in the user terminal.

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Human Computer Interaction (AREA)
  • General Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • Acoustics & Sound (AREA)
  • Computer Security & Cryptography (AREA)
  • General Health & Medical Sciences (AREA)
  • Otolaryngology (AREA)
  • Environmental & Geological Engineering (AREA)
  • Multimedia (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • Business, Economics & Management (AREA)
  • General Business, Economics & Management (AREA)
  • Fittings On The Vehicle Exterior For Carrying Loads, And Devices For Holding Or Mounting Articles (AREA)
  • Telephone Function (AREA)
  • User Interface Of Digital Computer (AREA)

Abstract

一种基于位置的语音交互方法及系统,涉及汽车语音交互技术领域。上述方法包括以下步骤:响应于检测到电子设备与目标车辆之间的相对位置关系满足预设条件,显示包含有语音图标的交互界面,并生成动作检测指令(S11),所述目标车辆设置至少一个外部扬声器;响应于检测到所述用户的预设开启动作,生成声音采集指令(S12);执行所述声音采集指令,以采集所述声音信号并发送至所述目标车辆相对应的车载控制器(S13);以及经由所述至少一个外部扬声器输出所采集的声音信号(S14)。上述方法通过在车辆设置外部扬声器,用户在车外时可以在精确距离范围内与汽车进行直接语音交互,实现汽车的移动音响模式。

Description

一种基于位置的语音交互方法及系统 技术领域
本发明涉及汽车语音交互技术领域,更具体地说,涉及一种基于位置的语音交互方法及系统。
背景技术
汽车人机交互,简单来说就是人与汽车通过一定的交互方式完成的信息交换过程,它能够直接影响用户的用车体验。一般来说,交互方式分为车内座舱交互以及车外交互两大类。
汽车与用户的车内座舱交互方式,通常是用户在车内座舱,通过语音或者中控触屏等方式与汽车进行交互,车内扬声器进行相关音频的输出播放。
汽车与用户的车外交互方式,通常是用户通过手机等方式从远端发送信息给车辆,控制车辆相应用户的需求。
当用户需要使用汽车扬声器作为音频输出设备时,只能采用车内座舱交互方式,目前并没有用户在车外时与汽车进行直接语音交互方式。
发明内容
本发明的目的是提供一种基于位置的语音交互方法及系统,其优势在于,在车辆预设距离范围内,根据用户操作利用电子设备或其关联部件采集用户语音并通过车载外部扬声器进行输出播放,从而实现基于车辆的扩声系统,使车辆具备对外语音播放等功能。
本发明的另一个目的是提供一种基于位置的语音交互方法及系统,其优势在于,在车辆预设距离范围内,在确定用户存在语音交互需求时采集用户声音并唤醒车辆以进行音频输出播放,从而实现基于车辆的扩声系统,并减少车辆的耗电情况。
本发明的另一个目的是提供一种基于位置的语音交互方法及系统,其优势在于,在车辆预设距离范围内,根据用户与车辆的相对位置,选择相对应的外部扬声器输出用户声音信息,从而实现扩声方向的精准控制。
本发明的另一个目的是提供一种基于位置的语音交互方法及系统,其优势在于,在车辆预设距离范围内,在电子设备上为用户提供语音交互界面,以获取用户的语音交互需求,实现精准的语音采集和语音播放,提升使用的便捷性。
本发明的另一个目的是提供一种基于位置的语音交互方法及系统,其优势在于,在车辆预设距离范围内,通过电子设备的语音交互界面提示用户是否正在进行语音交互,提高用户体验。
本发明的另一个目的是提供一种基于位置的语音交互方法及系统,其优势在于,根据用户操作或者用户与车辆的相对位置关系自动开始或者停止语音交互,提升用户体验,减少车辆的耗电情况。
为了实现上述目的,本发明提供了一种基于位置的语音交互方法,包括以下步骤:
响应于检测到所述电子设备与目标车辆之间的相对位置关系满足预设条件,显示包含有语音图标的交互界面,并生成动作检测指令,所述动作检测指令指示检测用户的动作,所述目标车辆设置至少一个外部扬声器;
响应于检测到所述用户的预设开启动作,生成声音采集指令,所述声音采集指令指示采集所述用户的声音信号;
执行所述声音采集指令,以采集所述声音信号并发送至所述目标车辆相对应的车载控制器;以及
经由所述至少一个外部扬声器输出所采集的声音信号。
在一实施例中,所述生成声音采集指令之前,还包括以下步骤:
生成唤醒指令以发送至所述目标车辆,所述唤醒指令指示唤醒所述目标车辆相对应的车载控制器。
在一实施例中,所述经由所述至少一个外部扬声器输出所采集的声音信号,进一步还包括以下步骤:
基于所述相对位置关系,在所述至少一个外部扬声器中选择目标扬声器;以及
经由所述目标扬声器输出所采集的声音信号。
在一实施例中,在所述声音采集指令未被执行时,所述语音图标在所述交互界面上被显示为第一图标;以及
在所述声音采集指令被执行时,所述语音图标在所述交互界面上被显示为第二图标。
在一实施例中,所述执行所述声音采集指令之后,进一步包括以下步骤:
响应于检测到所述用户的预设断开动作,停止采集所述用户的所述声音信号。
在一实施例中,所述显示包含有语音图标的交互界面,并生成动作检测指令之后,进一步包括以下步骤:
响应于检测到所述电子设备与目标车辆之间的相对位置关系不满足预设条件,隐藏所述交互界面和/或停止检测所述用户的动作。
在一实施例中,所述电子设备与目标车辆之间的相对位置关系是基于超宽带方式而测定的。
为了实现上述目的,本发明提供了一种基于位置的语音交互方法,包括以下步骤:
响应于检测到所述电子设备与目标车辆之间的相对位置关系满足第一位置条 件,显示包含有语音图标的交互界面,并生成声音采集指令,所述声音采集指令指示采集用户的声音信号,所述目标车辆设置至少一个外部扬声器;
基于所采集的声音信号,确定所述用户的面部与所述电子设备的相对位置距离;
响应于确定所述用户的面部与所述电子设备的相对位置距离满足第二位置条件,生成发送指令,所述发送指令指示发送所采集的声音信号至所述目标车辆相对应的车载控制器;以及
经由所述至少一个外部扬声器输出所采集的声音信号。
在一实施例中,所述生成发送指令之前,还包括以下步骤:
生成唤醒指令以发送至所述目标车辆,所述唤醒指令指示唤醒所述目标车辆相对应的车载控制器。
在一实施例中,所述经由所述至少一个外部扬声器输出所采集的声音信号,进一步还包括以下步骤:
基于所述相对位置关系,在所述至少一个外部扬声器中选择目标扬声器;以及
经由所述目标扬声器输出所采集的声音信号。
在一实施例中,在所述用户的面部与所述电子设备的相对位置距离未满足第二位置条件时,所述语音图标在所述交互界面上显示为第一图标;以及
在所述用户的面部与所述电子设备的相对位置距离满足第二位置条件时,所述语音图标在所述交互界面上显示为第二图标。
在一实施例中,所述确定所述用户的面部与所述电子设备的相对位置距离之后,还包括以下步骤:
响应于确定所述用户的面部与所述电子设备的相对位置距离不满足第二位置条件,停止发送所采集的声音信号。
在一实施例中,所述显示包含有语音图标的交互界面,并生成声音采集指令之后,进一步包括以下步骤:
响应于检测到所述电子设备与目标车辆之间的相对位置关系不满足第一位置条件,隐藏所述交互界面和/或停止采集所述用户的声音信号。
在一实施例中,所述电子设备与目标车辆之间的相对位置关系是基于超宽带方式而测定的。
为了实现上述目的,本发明提供了一种基于位置的语音交互系统,包括电子设备以及设置于目标车辆上的车载控制器和至少一个外部扬声器,其中,
所述电子设备可通信地连接于所述车载控制器,其被配置为:
响应于检测到所述电子设备与目标车辆之间的相对位置关系满足预设条件,显示包含有语音图标的交互界面,并生成动作检测指令,所述动作检测指令指示检测用户的动作;
响应于检测到所述用户的预设开启动作,生成声音采集指令,所述声音采集指令指示采集所述用户的声音信号;以及
执行所述声音采集指令,以采集所述声音信号并发送至所述目标车辆相对应的车载控制器;
所述车载控制器可通信地连接于所述至少一个外部扬声器,其被配置为:接收所采集的声音信号,并发送至所述至少一个外部扬声器;
所述至少一个外部扬声器被配置为输出所采集的声音信号。
在一实施例中,所述电子设备还包括第一超宽带通信模块,所述车载控制器还包括第二超宽带通信模块,其中,
所述第一超宽带通信模块与所述第二超宽带通信模块可建立基于超宽带方式的通信连接,其被配置为测定所述电子设备与目标车辆之间的相对位置关系。
为了实现上述目的,本发明提供了一种基于位置的语音交互系统,包括电子设备以及设置于目标车辆上的车载控制器和至少一个外部扬声器:
所述电子设备可通信地连接于所述车载控制器,其被配置为:
响应于检测到所述电子设备与目标车辆之间的相对位置关系满足第一位置条件,显示包含有语音图标的交互界面,并生成声音采集指令,所述声音采集指令指示采集用户的声音信号,所述目标车辆设置至少一个外部扬声器;
基于所采集的声音信号,确定所述用户的面部与所述电子设备的相对位置距离;
响应于确定所述用户的面部与所述电子设备的相对位置距离满足第二位置条件,生成发送指令,所述发送指令指示发送所采集的声音信号至所述目标车辆相对应的车载控制器;
所述车载控制器可通信地连接于所述至少一个外部扬声器,其被配置为:接收所采集的声音信号,并发送至所述至少一个外部扬声器;
所述至少一个外部扬声器被配置为输出所采集的声音信号。
在一实施例中,所述电子设备还包括第一超宽带通信模块,所述车载控制器还包括第二超宽带通信模块,其中,
所述第一超宽带通信模块与所述第二超宽带通信模块可建立基于超宽带方式的通信连接,其被配置为测定所述电子设备与目标车辆之间的相对位置关系。
附图说明
本发明上述的以及其他的特征、性质和优势将通过下面结合附图和实施例的描述而变的更加明显,在附图中相同的附图标记始终表示相同的特征,其中:
图1揭示了根据本发明一实施例的基于位置的语音交互方法流程图;
图2揭示了根据本发明一实施例的第一图标示意图;
图3揭示了根据本发明一实施例的第二图标示意图;
图4揭示了根据本发明另一实施例的基于位置的语音交互方法流程图;
图5揭示了根据本发明一实施例的基于位置的语音交互系统的原理框图;
图6揭示了根据本发明另一实施例的基于位置的语音交互系统的原理框图。
具体实施方式
为了使本发明的目的、技术方案及优点更加清楚明白,以下结合附图及实施例,对本发明进行进一步详细说明。应当理解,此处所描述的具体实施例仅用以解释发明,并不用于限定发明。
如前所述,当用户需要使用汽车扬声器作为音频输出设备时,只能采用车内座舱交互方式,目前并没有用户在车外时与汽车进行直接语音交互方式。所谓的车外直接语音交互方式,是指用户对车辆进行语音输入,车辆通过外置扬声器(包括但不限于振子发声面板)等方式对外播放声音,从而将车辆作为一个移动的音响,使得车辆满足如K歌、实时演讲等使用场景。
但是,一方面,由于目前车辆对外播放也只能通过车内扬声器进行相关音频的输出播放,用户在车外很难听清输出的音频信息;
另一方面,现有技术手机与汽车之间的连接多为蓝牙、RFID(射频识别)和WiFi等技术,这样用户在车外时与汽车进行直接语音交互时又存在以下问题:
1)定位精度问题,现有技术的连接方式的测距精度普遍较低,难以进行精确定位以区分用户是在车内还是车外,也就难以进行语音交互的准确设计;
2)安全性问题,现有技术的连接方式容易受到中继攻击的问题,影响汽车的使用安全。
目前,超宽带(Ultra Wide Band,UWB)技术是一种无线载波通信技术,它不采用正弦载波,而是利用纳秒级的非正弦波窄脉冲传输数据。UWB技术解决了困扰传统无线通信技术多年的有关传播方面的重大难题,具有对信道衰落不敏感、发射信号功率谱密度低、截获率低、系统复杂度低、能提供数厘米的定位精度等优点。
图1揭示了根据本发明一实施例的基于位置的语音交互方法流程图,如图1所示,本发明提出的基于位置的语音交互方法,通过检测用户的动作开启声音采集流程,具体包括以下步骤:
S11、响应于检测到电子设备与目标车辆之间的相对位置关系满足预设条件,显示包含有语音图标的交互界面,并生成动作检测指令,所述动作检测指令指示检测用户的动作,所述目标车辆设置至少一个外部扬声器;
S12、响应于检测到所述用户的预设开启动作,生成声音采集指令,所述声音采集指令指示采集所述用户的声音信号;
S13、执行所述声音采集指令,以采集所述声音信号并发送至所述目标车辆相对应的车载控制器;以及
S14、经由所述至少一个外部扬声器输出所采集的声音信号。
在图1所示的实施例中,步骤S11和步骤S12的执行主体可以为电子设备。
所述电子设备可以是手机、平板、智能眼镜、智能头盔或者智能手表等可穿戴智能设备。
在图1所述的实施例中,步骤S13的执行主体可以为电子设备或电子设备的关联部件。
所述电子设备的关联部件可以是耳机、耳麦、话筒等与所述电子设备建立有线或无线连接的外接语音采集部件。
如图1所述的实施例中,所述步骤S14的执行主体可以为目标车辆。由此,在车辆预设距离范围内,能够根据用户操作利用电子设备或其关联部件采集用户语音并通过车载外部扬声器进行输出播放,从而实现基于车辆的扩声系统,使车辆具备对外语音播放等功能。
下面以电子设备是手机为例,详细说明如图1所示的实施例中,本发明提出的基于位置的语音交互方法的具体步骤。
S11、响应于检测到电子设备与目标车辆之间的相对位置关系满足预设条件,显示包含有语音图标的交互界面,并生成动作检测指令,所述动作检测指令指示检测用户的动作,所述目标车辆设置至少一个外部扬声器。
本实施例中,手机与目标车辆之间的相对位置关系可以是基于蓝牙定位、超宽带UWB等近场通信方式而测定的。优选地,手机与目标车辆之间的相对位置关系是基于超宽带UWB方式而测定的。
相对于RFID和蓝牙技术,UWB测量距离的精确度更高,可以达到厘米级。此外,UWB技术具有极高的时间分辨率。UWB的抗干扰性能很强。蓝牙与WiFi使用频段为2.4GHz,易受外界干扰(例如,微波炉也是2.4GHz)。UWB常用频段为是6.5G和8G,不易受外界干扰。而且UWB的时间戳极为精确,可以做到抗多径干扰。安全性方面,UWB具备中继攻击保护能力,这主要得益于其精确计算实际物理距离的特性,如果判断飞行时间过长,直接判定基于UWB技术的手机端不在有效范围内,即便放大信号干扰,也无法与汽车进行通信与语音交互,攻击者无法对其进行欺骗。
当检测到手机与目标车辆之间的相对位置关系满足预设条件,手机的显示屏显示包含有第一图标的交互界面,并生成动作检测指令,所述动作检测指令指示检测用户的动作。
所述相对位置关系,是指电子设备(例如手机)与目标车辆之间的相对距离关系和相对角度关系。上述相对位置关系需要满足一定的预设条件,才会显示交互界面,开始进一步的语音交互流程。优选地,所述预设条件为电子设备处于目标车辆之外。
在一些实施例中,预设条件可以为手机在车外且离车20米。在另一些实施例 中,预设条件可以为手机在车外且在目标车辆车身左侧180°范围内。由此可以根据电子设备与目标车辆之间的相对位置关系来确定用户的语音交互需求。
图2揭示了根据本发明一实施例的第一图标示意图,如图2所示的第一图标为麦克风的小图标。
在进行相对位置关系的检测之前,手机与目标车辆需要进行身份鉴权匹配,同样可以通过超宽带UWB方式进行鉴别,此时手机等远端设备相当于数字钥匙。
手机与目标车辆设备在进行UWB鉴别时,目标车辆的车载控制器可以处于未唤醒状态,从而一定程度上节约了目标车辆的能源消耗。
当检测到手机与目标车辆之间的相对位置关系满足预设条件时,可以对目标车辆的车载控制器进行唤醒。
手机生成唤醒指令以发送至所述目标车辆,所述唤醒指令指示唤醒所述目标车辆相对应的车载控制器。
更进一步的,当手机检测到所述手机与目标车辆之间的相对位置关系不满足预设条件时,隐藏所述交互界面和/或停止检测所述用户的动作,此时认为手机已经远离目标车辆一定距离范围,停止语音交互,从而一方面减少了车辆的耗电情况,另一方面提升了用户的使用体验。
S12、响应于检测到所述用户的预设开启动作,生成声音采集指令,所述声音采集指令指示采集所述用户的声音信号。
手机检测所述用户的动作是否满足预设开启动作,如果用户的动作满足预设开启动作,则生成声音采集指令,采集用户的声音信号。
可选的,用户的预设开启动作,包括以下的任意一种:
所述用户针对所述语音图标的第一预设操作;以及
所述用户针对所述电子设备的第二预设操作。
可选的,第一预设操作包括但不限于:单击、双击、长按、重压、圈选、滑动或拖动等操作。
可选的,第二预设操作包括但不限于:摇一摇、晃一晃等操作。
所述声音采集指令指示手机或耳机、耳麦、话筒等电子设备的关联部件采集所述用户的声音信号。由此,一方面可以根据用户的动作来确定用户的语音交互需求,另一方面可以无需准备专用的麦克风,并且进一步解放了用户的双手,使得用户可以在查阅电子设备上呈现的内容的同时进行清晰的语音输入,提高了用户体验。
S13、执行所述声音采集指令,以采集所述声音信号并发送至所述目标车辆相对应的车载控制器。
手机或耳机话筒等电子设备的关联部件执行声音采集指令,采集用户的声音信号并经由电子设备(例如手机)发送至目标车辆。进一步地,手机或耳机话筒等电子设备的关联部件执行声音采集指令时,可以采集用户的声音信号并实时经 由电子设备(例如手机)发送至目标车辆的车载控制器。
所采集的声音信号,可以通过无线通讯方式发送至目标车辆。
本实施例中,采用无线通讯方式将采集的声音信号发送至目标车辆时,并不仅限于UWB通道,也可以通过蓝牙、WIFI、4G/5G移动通信网络等其他无线数据传输方式。
电子设备或电子设备的关联部件执行声音采集指令时,语音图标在所述交互界面上被显示为第二图标。
图3揭示了根据本发明一实施例的第二图标示意图,如图3所示的第二图标为麦克风的大图标。
更进一步的,手机检测所述用户的动作是否满足预设断开动作,如果用户的动作满足预设断开动作,停止采集所述用户的所述声音信号,从而用户可以通过动作操作开始或停止采集声音信号。
可选的,用户的预设断开动作,同样包括以下的任意一种:
所述用户针对所述语音图标的第一预设操作;以及
所述用户针对所述电子设备的第二预设操作。
可选的,第一预设操作包括但不限于:单击、双击、长按、重压、圈选、滑动或拖动等操作。
可选的,第二预设操作包括但不限于:摇一摇、晃一晃等操作。
显然,预设开始动作和预设断开动作可以设置为对应的操作,例如单击一次开始采集,再单击一次停止采集。
预设开始动作和预设断开动作也可以设置为不对应的操作,例如单击一次开始采集,双击一次停止采集或者单击一次开始采集,摇一摇手机停止采集。
需要进一步说明的,当用户的动作满足预设断开动作,停止采集所述用户的所述声音信号之后,如果再次满足预设开启动作,同样可以再次开始采集声音信号,语音交互流程并没有停止。
S14、经由所述至少一个外部扬声器输出所采集的声音信号。
车辆设置的多个外部扬声器输出所采集的用户的声音信号。
进一步地,车辆设置的多个外部扬声器可以实时地输出所采集的用户的声音信号,也可以是延时(例如延迟10min)、定时(例如设定为18:00)或基于预设条件的触发(例如检测到车门被轻敲或被打开)而输出所采集的用户的声音信号。
更进一步的,由于本发明采用UWB技术可以精确获得相对位置关系,因此,当车辆设置有多个外部扬声器时,可以基于精确的相对位置关系,选择指定的目标扬声器进行声音播放,从而满足车辆的立体声环绕播放效果。
例如,当车辆对称分布设置有4个外部扬声器时,可以选择距离用户较近的2个目标扬声器进行播放,当用户改变位置时绕到车辆另一侧时,此时可以选择 离用户更近的另外2个目标扬声器进行播放。又例如,当车辆对称分布设置有4个外部扬声器时,可以选择用户所处角度范围内对应的外部扬声器为目标扬声器进行播放,例如用户处于车身左前90°范围内,则确定左前外部扬声器为目标扬声器。进一步地,还可以根据用户的朝向来确定目标扬声器,例如用户的朝向为朝南,则选择朝南的外部扬声器为目标扬声器。具体地,可以根据电子设备的移动轨迹朝向来确定用户的朝向。
由此,可以根据用户与车辆的相对位置关系,选择相对应的外部扬声器输出用户声音信息,从而实现扩声方向的精准控制。
下面结合图2和图3,以电子设备是手机为例,说明如图1所示的实施例的基于位置的语音交互方法流程。
当手机与目标车辆之间的相对位置关系满足预设条件时,手机界面上显示如图2所示的麦克风小图标,供用户进行选择;
当用户点击手机界面上的麦克风小图标时,手机检测到用户的点击动作,生成声音采集指令,手机话筒或者耳机开始实时采集用户的声音,手机界面上显示如图3所述的麦克风大图标,用于指示正在进行语音采集输入;
手机将采集的用户声音信号发送至目标车辆,目标车辆的外部扬声器输出所采集的用户声音信号。
当用户再次点击手机界面的麦克风大图标时,停止采集用户的声音,手机界面显示为如图2所示的麦克风小图标,并关闭无线麦克风功能,但是仍可接收用户再次点击手机界面上的麦克风小图标,从而再次打开无线麦克风功能,开始采集用户的声音。由此,实现精准地语音采集和语音播放。
当手机与目标车辆之间的相对位置关系不再满足预设条件时,例如手机进入车内或者离开一定距离,隐藏所述交互界面和/或停止检测所述用户的动作,手机界面的麦克风图标隐藏消失,从而一方面减少了手机/车辆的耗电情况,另一方面提升了用户的使用体验。
图4揭示了根据本发明另一实施例的基于位置的语音交互方法流程图,如图4所示,本发明提出的基于位置的语音交互方法,通过检测用户声音,判断用户面部与手机的距离,在用户面部贴近电子设备时将所采集的声音发送至目标车辆,开启与目标车辆的语音交互,具体包括以下步骤:
S21、响应于检测到所述电子设备与目标车辆之间的相对位置关系满足第一位置条件,显示包含有语音图标的交互界面,并生成声音采集指令,所述声音采集指令指示采集用户的声音信号,所述目标车辆设置至少一个外部扬声器;
S22、基于所采集的声音信号,确定所述用户的面部与所述电子设备的相对位置距离;
S23、响应于确定所述用户的面部与所述电子设备的相对位置距离满足第二位置条件,生成发送指令,所述发送指令指示发送所采集的声音信号至所述目标 车辆相对应的车载控制器;以及
S24、经由所述至少一个外部扬声器输出所采集的声音信号。
在图4所示的实施例中,步骤S21-步骤S23的执行主体为电子设备和/或电子设备的关联部件。
所述电子设备可以是手机、平板、智能眼镜、智能头盔或者智能手表等可穿戴智能设备。
所述电子设备的关联部件可以是耳机、耳麦、话筒等与所述电子设备建立有线或无线连接的外接语音采集部件。
在图4所述的实施例中,所述步骤S24的执行主体为目标车辆。
下面以电子设备是手机为例,详细说明如图4所示的实施例中,本发明提出的基于位置的语音交互方法的具体步骤。
S21、响应于检测到所述电子设备与目标车辆之间的相对位置关系满足第一位置条件,显示包含有语音图标的交互界面,并生成声音采集指令,所述声音采集指令指示采集用户的声音信号,所述目标车辆设置至少一个外部扬声器;
本实施例中,手机与目标车辆之间的相对位置关系是基于超宽带UWB方式而测定的。
当检测到手机与目标车辆之间的相对位置关系满足第一位置条件,手机的显示屏显示包含有第一图标的交互界面,并开始实时采集用户的声音信号。
本实施例中,可以采用手机或者与手机连接的耳机采集用户的声音信号。
所采集的用户声音信号,一方面作为音频输入信号,另一个方面,作为对用户面部与手机的相对位置距离进行声场定位的判断信号。
当与手机连接的耳机所采集的用户声音信号作为音频输入时,手机麦克风采集用户声音信号对用户的面部与手机的相对位置距离进行声场定位。
在一些实施例中,第一位置条件可以为手机在车外且离车20米。
第一图标可以是如图2所示的麦克风的小图标。
在进行相对位置关系的检测之前,手机与目标车辆需要进行身份鉴权匹配,同样可以通过超宽带UWB方式进行鉴别,此时手机等远端设备相当于数字钥匙。
手机与目标车辆设备在进行UWB鉴别时,目标车辆的车载控制器可以处于未唤醒状态,从而一定程度上节约了目标车辆的能源消耗。
当检测到手机与目标车辆之间的相对位置关系满足第一位置条件,可以对目标车辆的车载控制器进行唤醒。
手机生成唤醒指令以发送至所述目标车辆,所述唤醒指令指示唤醒所述目标车辆相对应的车载控制器。
更进一步的,当手机检测到所述手机与目标车辆之间的相对位置关系不满足第一位置条件,隐藏所述交互界面和/或停止采集所述用户的声音信号,此时认为手机已经远离目标车辆一定距离范围,停止语音交互。
S22、基于所采集的声音信号,确定所述用户的面部与所述电子设备的相对位置距离。
手机根据所采集的声音信号,计算用户面部与手机的相对位置距离,当用户面部与手机的相对位置距离满足第二位置条件时,认为用户需要开启无线麦克风功能,通过手机与汽车进行语音交互。
在一些实施例中,第二位置条件可以为面部与手机的相对位置距离为5厘米。
显然,第二位置条件是可以预先设置的。
S23、响应于确定所述用户的面部与所述电子设备的相对位置距离满足第二位置条件,生成发送指令,所述发送指令指示发送所采集的声音信号至所述目标车辆相对应的车载控制器。
当手机检测到所述用户的面部与手机的相对位置距离满足第二位置条件,生成发送指令,将采集的用户声音信号并发送至目标车辆。
所采集的声音信号,可以通过无线通讯方式发送至目标车辆。
本实施例中,采用无线通讯方式将采集的声音信号发送至目标车辆时,并不仅限于UWB通道,也可以通过蓝牙、WIFI、4G/5G移动通信网络等其他无线数据传输方式。
在所述用户的面部与所述电子设备的相对位置距离满足第二位置条件时,所述语音图标在所述交互界面上显示为第二图标。
第二图标可以是如图3所示的麦克风的大图标。
更进一步的,当手机检测到所述用户的面部与所述手机的相对位置距离不满足第二位置条件,停止发送所采集的声音信号。
更进一步的,当手机在停止发送所采集的声音信号之后,如果再次检测到所述用户的面部与所述手机的相对位置距离满足第二位置条件,则再次生成发送指令,发送所采集的声音信号。
显然,本实施例的技术方案,主动采集用户声音并通过用户面部与电子设备的相对位置距离进行发送,相比于如图1所示的实施例,减少了用户对交互界面的操作,促进了无感语音交互,提升了用户体验,降低了对电子设备的能耗。
S24、经由所述至少一个外部扬声器输出所采集的声音信号。
车辆设置的多个外部扬声器实时输出所采集的用户的声音信号。
更进一步的,由于本发明采用UWB技术可以精确获得相对位置关系,因此,当车辆设置有多个外部扬声器时,可以基于精确的相对位置关系,选择指定的目标扬声器进行声音播放,从而满足车辆的立体声环绕播放效果。
例如,当车辆对称分布设置有4个外部扬声器时,可以选择距离用户较近的2个目标扬声器进行播放,当用户改变位置时绕到车辆另一侧时,此时可以选择离用户更近的另外2个目标扬声器进行播放。
尽管为使解释简单化将上述方法图示并描述为一系列动作,但是应理解并领 会,这些方法不受动作的次序所限,因为根据一个或多个实施例,一些动作可按不同次序发生和/或与来自本文中图示和描述或本文中未图示和描述但本领域技术人员可以理解的其他动作并发地发生。
图5揭示了根据本发明一实施例的基于位置的语音交互系统的原理框图,如图5所示的实施例中,本发明提出的基于位置的语音交互系统,包括电子设备510以及设置于目标车辆上的车载控制器520和至少一个外部扬声器530,其中,
所述电子设备510可通信地连接于所述车载控制器520,其被配置为:
响应于检测到所述电子设备510与目标车辆之间的相对位置关系满足预设条件,显示包含有语音图标的交互界面,并生成动作检测指令,所述动作检测指令指示检测用户的动作;
响应于检测到所述用户的预设开启动作,生成声音采集指令,所述声音采集指令指示采集所述用户的声音信号;以及
执行所述声音采集指令,以采集所述声音信号并发送至所述目标车辆相对应的车载控制器520;
所述车载控制器520可通信地连接于所述至少一个外部扬声器530,其被配置为:接收所采集的声音信号,并发送至所述至少一个外部扬声器530;
所述至少一个外部扬声器530被配置为输出所采集的声音信号。
更进一步的,所述电子设备510还包括第一超宽带通信模块511,所述车载控制器520还包括第二超宽带通信模块521,其中,
所述第一超宽带通信模块511与所述第二超宽带通信模块521可建立基于超宽带方式的通信连接,其被配置为测定所述电子设备510与目标车辆之间的相对位置关系。
更进一步的,所述电子设备510还包括外接语音采集部件512,所述外接语音采集部件512被配置为根据所述声音采集指令采集所述用户的声音信号。
更进一步的,由于本发明采用UWB技术可以精确获得相对位置关系,因此,当车辆设置有多个外部扬声器530时,可以基于精确的相对位置关系,选择指定的目标扬声器进行声音播放,从而满足车辆的立体声环绕播放效果。
例如,当车辆对称分布设置有4个外部扬声器时,可以选择距离用户较近的2个目标扬声器进行播放,当用户改变位置时绕到车辆另一侧时,此时可以选择离用户更近的另外2个目标扬声器进行播放。
图6揭示了根据本发明另一实施例的基于位置的语音交互系统的原理框图,如图6所示的实施例中,本发明提出的基于位置的语音交互系统,包括电子设备610以及设置于目标车辆上的车载控制器620和至少一个外部扬声器630,其中,
所述电子设备610可通信地连接于所述车载控制器620,其被配置为:
响应于检测到所述电子设备610与目标车辆之间的相对位置关系满足第一位置条件,显示包含有语音图标的交互界面,并生成声音采集指令,所述声音采集 指令指示采集用户的声音信号,所述目标车辆设置至少一个外部扬声器630;
基于所采集的声音信号,确定所述用户的面部与所述电子设备610的相对位置距离;
响应于确定所述用户的面部与所述电子设备610的相对位置距离满足第二位置条件,生成发送指令,所述发送指令指示发送所采集的声音信号至所述目标车辆相对应的车载控制器620;
所述车载控制器620可通信地连接于所述至少一个外部扬声器630,其被配置为:接收所采集的声音信号,并发送至所述至少一个外部扬声器630;
所述至少一个外部扬声器630被配置为输出所采集的声音信号。
更进一步的,所述电子设备610还包括第一超宽带通信模块611,所述车载控制器620还包括第二超宽带通信模块621,其中,
所述第一超宽带通信模块611与所述第二超宽带通信模块621可建立基于超宽带方式的通信连接,其被配置为测定所述电子设备610与目标车辆之间的相对位置关系。
更进一步的,由于本发明采用UWB技术可以精确获得相对位置关系,因此,当车辆设置有多个外部扬声器630时,可以基于精确的相对位置关系,选择指定的目标扬声器进行声音播放,从而满足车辆的立体声环绕播放效果。
例如,当车辆对称分布设置有4个外部扬声器时,可以选择距离用户较近的2个目标扬声器进行播放,当用户改变位置时绕到车辆另一侧时,此时可以选择离用户更近的另外2个目标扬声器进行播放。
本发明提出的一种基于位置的语音交互方法及系统,将手机替代传统麦克风,在精确位置距离范围内激活手机的麦克风,实现在汽车外部扬声器中的声音重现,实现汽车的移动音响模式,车外人员利用手机麦克风以及车辆的对外播放声音功能,让车辆具备如车外卡拉OK,围绕车外进行实时演讲等应用功能。
本发明提供的一种基于位置的语音交互方法及系统,具体具有以下有益效果:
1)在车辆设置外部扬声器,用户在车外时可以与汽车进行直接语音交互,利用手机麦克风以及车辆外部扬声器的对外播放声音功能,让车辆具备对外语音播放等功能;
2)基于UWB方式测定相对位置关系,精确度高,安全性高。
如本申请和权利要求书中所示,除非上下文明确提示例外情形,“一”、“一个”、“一种”和/或“该”等词并非特指单数,也可包括复数。一般说来,术语“包括”与“包含”仅提示包括已明确标识的步骤和元素,而这些步骤和元素不构成一个排它性的罗列,方法或者设备也可能包含其他的步骤或元素。
本领域技术人员将进一步领会,结合本文中所公开的实施例来描述的各种解 说性逻辑板块、模块、电路、和算法步骤可实现为电子硬件、计算机软件、或这两者的组合。为清楚地解说硬件与软件的这一可互换性,各种解说性组件、框、模块、电路、和步骤在上面是以其功能性的形式作一般化描述的。此类功能性是被实现为硬件还是软件取决于具体应用和施加于整体系统的设计约束。技术人员对于每种特定应用可用不同的方式来实现所描述的功能性,但这样的实现决策不应被解读成导致脱离了本发明的范围。
结合本文所公开的实施例描述的各种解说性逻辑模块、和电路可用通用处理器、数字信号处理器(DSP)、专用集成电路(ASIC)、现场可编程门阵列(FPGA)或其它可编程逻辑器件、分立的门或晶体管逻辑、分立的硬件组件、或其设计成执行本文所描述功能的任何组合来实现或执行。通用处理器可以是微处理器,但在替换方案中,该处理器可以是任何常规的处理器、控制器、微控制器、或状态机。处理器还可以被实现为计算设备的组合,例如DSP与微处理器的组合、多个微处理器、与DSP核心协作的一个或多个微处理器、或任何其他此类配置。
结合本文中公开的实施例描述的方法或算法的步骤可直接在硬件中、在由处理器执行的软件模块中、或在这两者的组合中体现。软件模块可驻留在RAM存储器、闪存、ROM存储器、EPROM存储器、EEPROM存储器、寄存器、硬盘、可移动盘、CD-ROM、或本领域中所知的任何其他形式的存储介质中。示例性存储介质耦合到处理器以使得该处理器能从/向该存储介质读取和写入信息。在替换方案中,存储介质可以被整合到处理器。处理器和存储介质可驻留在ASIC中。ASIC可驻留在用户终端中。在替换方案中,处理器和存储介质可作为分立组件驻留在用户终端中。
在本发明中,除非另有明确的规定和限定,术语“安装”、“相连”、“连接”、“固定”等术语应做广义理解,例如,可以是固定连接,也可以是可拆卸连接,或一体地连接;可以是机械连接,也可以是电连接;可以是直接相连,也可以通过中间媒介间接相连,可以是两个元件内部的连通。对于本领域的普通技术人员而言,可以根据具体情况理解上述术语在本发明中的具体含义。
上述实施例是提供给熟悉本领域内的人员来实现或使用本发明的,熟悉本领域的人员可在不脱离本发明的发明思想的情况下,对上述实施例做出种种修改或变化,因而本发明的保护范围并不被上述实施例所限,而应该是符合权利要求书提到的创新性特征的最大范围。

Claims (20)

  1. 一种基于位置的语音交互方法,其特征在于,包括以下步骤:
    响应于检测到电子设备与目标车辆之间的相对位置关系满足预设条件,显示包含有语音图标的交互界面,并生成动作检测指令,所述动作检测指令指示检测用户的动作,所述目标车辆设置至少一个外部扬声器;
    响应于检测到所述用户的预设开启动作,生成声音采集指令,所述声音采集指令指示采集所述用户的声音信号;
    执行所述声音采集指令,以采集所述声音信号并发送至所述目标车辆相对应的车载控制器;以及
    经由所述至少一个外部扬声器输出所采集的声音信号。
  2. 根据权利要求1所述的基于位置的语音交互方法,所述生成声音采集指令之前,还包括以下步骤:
    生成唤醒指令以发送至所述目标车辆,所述唤醒指令指示唤醒所述目标车辆相对应的车载控制器。
  3. 根据权利要求1所述的基于位置的语音交互方法,所述经由所述至少一个外部扬声器输出所采集的声音信号,进一步还包括以下步骤:
    基于所述相对位置关系,在所述至少一个外部扬声器中选择目标扬声器;以及
    经由所述目标扬声器输出所采集的声音信号。
  4. 根据权利要求1所述的基于位置的语音交互方法,所述用户的预设开启动作,包括以下的任意一种:
    所述用户针对所述语音图标的第一预设操作;以及
    所述用户针对所述电子设备的第二预设操作。
  5. 根据权利要求1所述的基于位置的语音交互方法,其中,在所述声音采集指令未被执行时,所述语音图标在所述交互界面上被显示为第一图标;以及
    在所述声音采集指令被执行时,所述语音图标在所述交互界面上被显示为第二图标。
  6. 根据权利要求1所述的基于位置的语音交互方法,所述执行所述声音采集指令之后,进一步包括以下步骤:
    响应于检测到所述用户的预设断开动作,停止采集所述用户的所述声音信号。
  7. 根据权利要求1所述的基于位置的语音交互方法,所述显示包含有语音图标的交互界面,并生成动作检测指令之后,进一步包括以下步骤:
    响应于检测到所述电子设备与目标车辆之间的相对位置关系不满足预设条件,隐藏所述交互界面和/或停止检测所述用户的动作。
  8. 根据权利要求1所述的基于位置的语音交互方法,所述电子设备与目标车辆之间的相对位置关系是基于超宽带方式而测定的。
  9. 一种基于位置的语音交互方法,其特征在于,包括以下步骤:
    响应于检测到电子设备与目标车辆之间的相对位置关系满足第一位置条件,显示包含有语音图标的交互界面,并生成声音采集指令,所述声音采集指令指示采集用户的声音信号,所述目标车辆设置至少一个外部扬声器;
    基于所采集的声音信号,确定所述用户的面部与所述电子设备的相对位置距离;
    响应于确定所述用户的面部与所述电子设备的相对位置距离满足第二位置条件,生成发送指令,所述发送指令指示发送所采集的声音信号至所述目标车辆相对应的车载控制器;以及
    经由所述至少一个外部扬声器输出所采集的声音信号。
  10. 根据权利要求9所述的基于位置的语音交互方法,所述生成发送指令之前,还包括以下步骤:
    生成唤醒指令以发送至所述目标车辆,所述唤醒指令指示唤醒所述目标车辆相对应的车载控制器。
  11. 根据权利要求9所述的基于位置的语音交互方法,所述经由所述至少一个外部扬声器输出所采集的声音信号,进一步还包括以下步骤:
    基于所述相对位置关系,在所述至少一个外部扬声器中选择目标扬声器;以及
    经由所述目标扬声器输出所采集的声音信号。
  12. 根据权利要求9所述的基于位置的语音交互方法,其中,在所述用户的面部与所述电子设备的相对位置距离未满足第二位置条件时,所述语音图标在所述交互界面上显示为第一图标;以及
    在所述用户的面部与所述电子设备的相对位置距离满足第二位置条件时,所述语音图标在所述交互界面上显示为第二图标。
  13. 根据权利要求9所述的基于位置的语音交互方法,所述确定所述用户的面部与所述电子设备的相对位置距离之后,还包括以下步骤:
    响应于确定所述用户的面部与所述电子设备的相对位置距离不满足第二位置条件,停止发送所采集的声音信号。
  14. 根据权利要求9所述的基于位置的语音交互方法,所述显示包含有语音图标的交互界面,并生成声音采集指令之后,进一步包括以下步骤:
    响应于检测到所述电子设备与目标车辆之间的相对位置关系不满足第一位置条件,隐藏所述交互界面和/或停止采集所述用户的声音信号。
  15. 根据权利要求10所述的基于位置的语音交互方法,所述电子设备与目标车辆之间的相对位置关系是基于超宽带方式而测定的。
  16. 一种基于位置的语音交互系统,其特征在于,包括电子设备以及设置于目标车辆上的车载控制器和至少一个外部扬声器,其中,
    所述电子设备可通信地连接于所述车载控制器,其被配置为:
    响应于检测到所述电子设备与目标车辆之间的相对位置关系满足预设条件,显示包含有语音图标的交互界面,并生成动作检测指令,所述动作检测指令指示检测用户的动作;
    响应于检测到所述用户的预设开启动作,生成声音采集指令,所述声音采集指令指示采集所述用户的声音信号;以及
    执行所述声音采集指令,以采集所述声音信号并发送至所述目标车辆相对应的车载控制器;
    所述车载控制器可通信地连接于所述至少一个外部扬声器,其被配置为:接收所采集的声音信号,并发送至所述至少一个外部扬声器;
    所述至少一个外部扬声器被配置为输出所采集的声音信号。
  17. 根据权利要求16所述的基于位置的语音交互系统,所述电子设备还包括第一超宽带通信模块,所述车载控制器还包括第二超宽带通信模块,其中,
    所述第一超宽带通信模块与所述第二超宽带通信模块可建立基于超宽带方式的通信连接,其被配置为测定所述电子设备与目标车辆之间的相对位置关系。
  18. 根据权利要求16所述的基于位置的语音交互系统,所述电子设备还包括外接语音采集部件,所述外接语音采集部件被配置为根据所述声音采集指令采集所述用户的声音信号。
  19. 一种基于位置的语音交互系统,其特征在于,包括电子设备以及设置于目标车辆上的车载控制器和至少一个外部扬声器:
    所述电子设备可通信地连接于所述车载控制器,其被配置为:
    响应于检测到所述电子设备与目标车辆之间的相对位置关系满足第一位置条件,显示包含有语音图标的交互界面,并生成声音采集指令,所述声音采集指令指示采集用户的声音信号,所述目标车辆设置至少一个外部扬声器;
    基于所采集的声音信号,确定所述用户的面部与所述电子设备的相对位置距离;
    响应于确定所述用户的面部与所述电子设备的相对位置距离满足第二位置条件,生成发送指令,所述发送指令指示发送所采集的声音信号至所述目标车辆相对应的车载控制器;
    所述车载控制器可通信地连接于所述至少一个外部扬声器,其被配置为:接收所采集的声音信号,并发送至所述至少一个外部扬声器;
    所述至少一个外部扬声器被配置为输出所采集的声音信号。
  20. 根据权利要求19所述的基于位置的语音交互系统,所述电子设备还包括第一超宽带通信模块,所述车载控制器还包括第二超宽带通信模块,其中,
    所述第一超宽带通信模块与所述第二超宽带通信模块可建立基于超宽带方式的通信连接,其被配置为测定所述电子设备与目标车辆之间的相对位置关系。
PCT/CN2021/136137 2021-10-22 2021-12-07 一种基于位置的语音交互方法及系统 WO2023065481A1 (zh)

Priority Applications (2)

Application Number Priority Date Filing Date Title
EP21961219.9A EP4420936A1 (en) 2021-10-22 2021-12-07 Position-based voice interaction method and system
US18/642,285 US20240276149A1 (en) 2021-10-22 2024-04-22 Location-based voice interaction method and system

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202111233280.7A CN115118816B (zh) 2021-10-22 2021-10-22 一种基于位置的语音交互方法及系统
CN202111233280.7 2021-10-22

Related Child Applications (1)

Application Number Title Priority Date Filing Date
US18/642,285 Continuation US20240276149A1 (en) 2021-10-22 2024-04-22 Location-based voice interaction method and system

Publications (1)

Publication Number Publication Date
WO2023065481A1 true WO2023065481A1 (zh) 2023-04-27

Family

ID=83325427

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2021/136137 WO2023065481A1 (zh) 2021-10-22 2021-12-07 一种基于位置的语音交互方法及系统

Country Status (4)

Country Link
US (1) US20240276149A1 (zh)
EP (1) EP4420936A1 (zh)
CN (1) CN115118816B (zh)
WO (1) WO2023065481A1 (zh)

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2016154777A1 (en) * 2015-03-27 2016-10-06 Bayerische Motoren Werke Aktiengesellschaft Intelligent voice assistant system, apparatus, and method for vehicle
CN107107846A (zh) * 2015-03-25 2017-08-29 宝马股份公司 用于经由车辆提供信息的系统、装置、方法和计算机程序产品
CN112937432A (zh) * 2021-02-19 2021-06-11 恒大新能源汽车投资控股集团有限公司 车辆发声装置的控制方法、装置、设备及存储介质
CN113345433A (zh) * 2021-05-30 2021-09-03 重庆长安汽车股份有限公司 一种车外语音交互系统

Family Cites Families (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8868254B2 (en) * 2012-06-08 2014-10-21 Apple Inc. Accessory control with geo-fencing
CN206272762U (zh) * 2016-12-08 2017-06-20 浙江警察学院 一种车载式车辆自动追踪系统
US10834248B2 (en) * 2016-12-26 2020-11-10 Huawei Technologies Co., Ltd. Device, method, and graphical user interface for starting application
CN108556986A (zh) * 2018-03-28 2018-09-21 上海乐愚智能科技有限公司 一种中控系统及车辆
CN109545219A (zh) * 2019-01-09 2019-03-29 北京新能源汽车股份有限公司 车载语音交互方法、系统、设备及计算机可读存储介质
CN111683325B (zh) * 2019-03-11 2022-02-08 深圳市冠旭电子股份有限公司 音效控制方法、装置、音箱、可穿戴设备及可读存储介质
CN110737422B (zh) * 2019-10-11 2023-04-28 北京地平线机器人技术研发有限公司 一种声音信号采集方法及装置
CN112752238B (zh) * 2019-10-30 2022-11-29 博泰车联网科技(上海)股份有限公司 基于使用场景提供信息服务的方法、设备和计算机存储介质
CN110544478A (zh) * 2019-11-04 2019-12-06 南京创维信息技术研究院有限公司 驾驶舱智能远场语音交互的系统及方法
CN113179202A (zh) * 2020-01-09 2021-07-27 上海博泰悦臻电子设备制造有限公司 用于分享数据的方法、电子设备和计算机存储介质
CN112667980A (zh) * 2021-01-05 2021-04-16 上海博泰悦臻网络技术服务有限公司 车辆、车机系统、车辆交互方法及装置
CN113438367A (zh) * 2021-06-23 2021-09-24 中国第一汽车股份有限公司 基于车辆系统的通话控制方法、装置、介质及电子设备

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107107846A (zh) * 2015-03-25 2017-08-29 宝马股份公司 用于经由车辆提供信息的系统、装置、方法和计算机程序产品
WO2016154777A1 (en) * 2015-03-27 2016-10-06 Bayerische Motoren Werke Aktiengesellschaft Intelligent voice assistant system, apparatus, and method for vehicle
CN112937432A (zh) * 2021-02-19 2021-06-11 恒大新能源汽车投资控股集团有限公司 车辆发声装置的控制方法、装置、设备及存储介质
CN113345433A (zh) * 2021-05-30 2021-09-03 重庆长安汽车股份有限公司 一种车外语音交互系统

Also Published As

Publication number Publication date
EP4420936A1 (en) 2024-08-28
US20240276149A1 (en) 2024-08-15
CN115118816A (zh) 2022-09-27
CN115118816B (zh) 2023-11-17

Similar Documents

Publication Publication Date Title
US10547736B2 (en) Detecting the location of a phone using RF wireless and ultrasonic signals
EP3547712B1 (en) Method for processing signals, terminal device, and non-transitory readable storage medium
EP3547659B1 (en) Method for processing audio signal and related products
CN110574103B (zh) 一种语音控制方法、可穿戴设备及终端
CN107026934B (zh) 一种声源定位方法和装置
KR102192361B1 (ko) 머리 움직임을 이용한 사용자 인터페이스 방법 및 장치
US20160266235A1 (en) Driver side location detection
WO2019018823A1 (en) DETECTION AND LOCATION OF A MOBILE DEVICE USING A SOUND
CN108600885B (zh) 声音信号处理方法及相关产品
JP2015008494A (ja) 距離ベースのセキュリティ
US20190333498A1 (en) Processing audio signals
EP4354900A1 (en) Audio information processing method, electronic device, system, product, and medium
CN110140343A (zh) 一种基于车联的提示方法及装置
KR102133004B1 (ko) 상황에 따라 볼륨을 자동으로 조절하는 장치 및 그 제어방법
WO2023065481A1 (zh) 一种基于位置的语音交互方法及系统
JP2016038202A (ja) 車両通信システム
US20230381025A1 (en) Situational awareness, communication, and safety in hearing protection and communication systems
CN110753157B (zh) 终端预警方法、系统、终端及可读存储介质
WO2023093412A1 (zh) 主动降噪的方法及电子设备
CN108099843B (zh) 车辆控制方法及装置
CN115835079A (zh) 透传模式的切换方法和切换装置
KR20210073476A (ko) 음파 통신을 이용한 차량 개폐 시스템에서 운전자가 차량의 내부 또는 외부에 위치함을 판별하는 방법 및 시스템
KR102106288B1 (ko) 운전석 주변에 모듈 형태로 장착되는 음파 통신을 이용한 차량 개폐 시스템
CN107578559A (zh) 共享汽车的控制方法、装置及计算机可读存储介质
KR20190026100A (ko) 블루투스 통신과 가청 또는 비가청주파수의 음파를 사용하여 스마트폰의 위치를 파악하는 방법 및 장치

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 21961219

Country of ref document: EP

Kind code of ref document: A1

WWE Wipo information: entry into national phase

Ref document number: 2021961219

Country of ref document: EP

NENP Non-entry into the national phase

Ref country code: DE

ENP Entry into the national phase

Ref document number: 2021961219

Country of ref document: EP

Effective date: 20240522