WO2016172899A1 - 导航信息处理、获取方法及语音导航系统 - Google Patents

导航信息处理、获取方法及语音导航系统 Download PDF

Info

Publication number
WO2016172899A1
WO2016172899A1 PCT/CN2015/077914 CN2015077914W WO2016172899A1 WO 2016172899 A1 WO2016172899 A1 WO 2016172899A1 CN 2015077914 W CN2015077914 W CN 2015077914W WO 2016172899 A1 WO2016172899 A1 WO 2016172899A1
Authority
WO
WIPO (PCT)
Prior art keywords
navigation
voice
request
information
navigation device
Prior art date
Application number
PCT/CN2015/077914
Other languages
English (en)
French (fr)
Inventor
李仁涛
Original Assignee
李仁涛
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 李仁涛 filed Critical 李仁涛
Priority to PCT/CN2015/077914 priority Critical patent/WO2016172899A1/zh
Publication of WO2016172899A1 publication Critical patent/WO2016172899A1/zh

Links

Images

Classifications

    • GPHYSICS
    • G01MEASURING; TESTING
    • G01CMEASURING DISTANCES, LEVELS OR BEARINGS; SURVEYING; NAVIGATION; GYROSCOPIC INSTRUMENTS; PHOTOGRAMMETRY OR VIDEOGRAMMETRY
    • G01C21/00Navigation; Navigational instruments not provided for in groups G01C1/00 - G01C19/00
    • G01C21/26Navigation; Navigational instruments not provided for in groups G01C1/00 - G01C19/00 specially adapted for navigation in a road network
    • G01C21/34Route searching; Route guidance

Definitions

  • the embodiments of the present invention relate to the field of wireless communications technologies, and in particular, to a navigation information processing, acquiring method, and a voice navigation system.
  • the Global Positioning System can realize navigation, positioning, timing and other functions to guide aircraft, ships, vehicles and individuals to safely and accurately follow the selected route and arrive at the destination on time.
  • the vehicle intelligent navigation system is an auxiliary device that uses GPS to assist users in accurate positioning.
  • the vehicle intelligent navigation system utilizes the position, speed and time information provided by GPS, and cooperates with the route planning ability of high-precision navigation electronic map to provide users with navigation functions, helping users to accurately and real-time plan driving routes on the electronic map, and guide users at the same time. Drive on the planned route and arrive at your destination.
  • a method of setting a destination by using a voice input method is to set a voice recognition system inside the in-vehicle intelligent navigation system.
  • the user can send a voice command to the in-vehicle smart navigation system.
  • the voice recognition system can recognize the destination information input by the user according to the voice command.
  • the in-vehicle intelligent navigation system can set the destination that the user needs to reach according to the identification of the acquired destination information, thereby completing the navigation function.
  • the prior art has at least the following disadvantages: since the voice recognition system needs to be set in the in-vehicle intelligent navigation system, the cost of the in-vehicle intelligent navigation system is high, and since the recognition rate of the speech recognition system is low, the in-vehicle intelligent navigation is caused. The system's ability to handle voice commands is very limited.
  • the embodiment of the invention provides a navigation information processing, obtaining method and a voice navigation system, so as to solve the problem that the cost of the vehicle-mounted intelligent navigation system in the prior art is high and the recognition accuracy is low, and the cost of the vehicle-mounted intelligent navigation system is reduced. Expand the recognition range of speech and improve the recognition accuracy.
  • An embodiment of the present invention provides a navigation information processing method, including: identifying a voice navigation request sent by a navigation device, and acquiring corresponding navigation request text information, where the voice navigation request is at least Include one of a first voice band request and a second voice band request sent by the user; and according to the logic control application document, send navigation setting information corresponding to the navigation request text information to the navigation device, so that the navigation device The navigation setting process is performed based on the navigation setting information.
  • An embodiment of the present invention provides a method for acquiring navigation information, including: sending a voice navigation request to the voice gateway according to the voice guidance information sent by the voice gateway, so that the voice gateway recognizes the voice navigation request, and controls the application document according to the logic. Receiving, by the feedback, the navigation request text information corresponding to the voice navigation request, where the voice navigation request includes at least one of a first voice frequency band request and a second voice frequency band request sent by the user; and receiving the Navigate the navigation setting information corresponding to the request text information.
  • the embodiment of the present invention further provides a voice navigation system, including: a voice gateway, configured to identify a voice navigation request sent by the navigation device, obtain corresponding navigation request text information, and control the application document according to the logic, and the navigation request text
  • the navigation setting information corresponding to the information is sent to the navigation device, so that the navigation device performs navigation setting processing according to the navigation setting information;
  • the voice navigation request includes at least a first voice frequency band request and a second voice sent by the user.
  • One of the frequency band requests a document server for transmitting the logical control application document to the voice gateway.
  • the navigation information processing and obtaining method and device provided by the embodiment of the present invention use the voice gateway to identify and process the voice information sent by the user by using the navigation device, and feed back the corresponding navigation setting information to the navigation device, and the navigation device does not need to set the voice itself.
  • the identification system reduces the cost of the navigation device; moreover, the recognition range of the voice information sent by the navigation device is expanded, and the recognition accuracy rate is high.
  • the navigation setting process can be performed according to the first voice band request and the second voice band request, and the function of the navigation system is improved.
  • FIG. 1 is a flowchart of an embodiment of a navigation information processing method according to the present invention.
  • FIG. 2 is a flowchart of another embodiment of a navigation information processing method according to the present invention.
  • FIG. 3 is a flowchart of an embodiment of a method for acquiring navigation information according to the present invention.
  • FIG. 4 is a flowchart of another embodiment of a method for acquiring navigation information according to the present invention.
  • FIG. 5 is a signaling flowchart of still another embodiment of a navigation information acquiring method according to the present invention.
  • FIG. 6 is a schematic structural diagram of an embodiment of a voice gateway according to the present invention.
  • FIG. 7 is a schematic structural diagram of another embodiment of a voice gateway according to the present invention.
  • FIG. 8 is a schematic structural diagram of an embodiment of a navigation device according to the present invention.
  • FIG. 9 is a schematic structural diagram of an embodiment of a voice navigation system according to the present invention.
  • FIG. 1 is a flowchart of an embodiment of a navigation information processing method according to the present invention. As shown in FIG. 1 , the method in this embodiment includes:
  • Step 101 Identify a voice navigation request sent by the navigation device, and obtain corresponding navigation request text information.
  • the voice navigation request includes at least one of a first voice frequency band request and a second voice frequency band request sent by the user.
  • the first voice frequency band request is located in a normal listening threshold, and the second voice frequency band request is beyond a normal listening threshold.
  • the first voice frequency band request is a request for a normal tone sent by a user, such as a phone call or a play.
  • the second voice band request is a scream of the user (anyone in the car).
  • the navigation device may be a Global Positioning System (GPS) navigator.
  • GPS Global Positioning System
  • the voice gateway can be used to identify the voice navigation request sent by the navigation device.
  • the voice gateway can be a voice extensible markup language (VXML) gateway to identify the navigation device.
  • VXML voice extensible markup language
  • the VXML gateway is a standardized voice response system platform.
  • the VXML gateway is mainly composed of Automatic Speech Recognition (ASR) components, VXML interpreter components, and Text-To-Speech (TTS) components. composition.
  • ASR Automatic Speech Recognition
  • VXML interpreter components VXML interpreter components
  • TTS Text-To-Speech
  • the ASR component can recognize the voice waveform signal into text information, so that the VXML interpreter component can interpret and process the text information, and the ASR component can realize the recognition of the natural language with high accuracy.
  • the voice navigation request sent by the navigation device may be identified and processed by using the ASR component in the VXML gateway, so as to obtain corresponding navigation request text information, which is to identify the voice navigation request in the form of a voice waveform into a text form. Navigate request text information.
  • Step 102 Send navigation setting information corresponding to the navigation request text information to the navigation device according to the logic control application document, and the navigation device performs navigation setting processing according to the navigation setting information.
  • the logic control application document After the navigation request text information is obtained by the voice gateway, and when the voice navigation request is the first voice frequency band request, for example, the text information of the navigation request is “Beijing”, the logic control application document obtains the corresponding “Beijing” content. And querying the corresponding navigation setting information; and sending the query to the navigation setting information to the navigation device.
  • the VXML gateway can obtain navigation setting information requested by the navigation device, and the VXML gateway can send the navigation setting information to the navigation device.
  • the navigation device After receiving the navigation setting information, the navigation device can set the navigation setting information according to the navigation setting information, so that the user obtains the positioning information through the navigation device.
  • the navigation setting information may be location data or command information.
  • the location data may not be stored.
  • the location data in the navigation setting information may be navigation information related to “Beijing”, such as latitude and longitude information.
  • the navigation setting information may be command information, and the navigation device may call the location data stored by the navigation device itself according to the command information.
  • VXML gateway is used as an example for description in this embodiment. In an actual implementation process, any type of voice gateway may be used.
  • the voice waveform information can be recognized as the navigation request text information in the form of text, thereby implementing the conversion of the voice signal to the machine language; then, the VXML gateway can control the application document in the logic.
  • the navigation device sends corresponding navigation setting information to the navigation device, so that the navigation device can obtain the required positioning information according to the navigation setting information.
  • the ASR component of the VXML gateway can use the ASR component of the VXML gateway to identify the voice navigation request sent by the navigation device, and the ASR component can recognize the natural language, and the recognition rate is high. Therefore, the correct rate of the voice navigation request is recognized in the embodiment and the recognition range is high.
  • the navigation device alerts the user to drive by setting its own safety alert device, such as a horn.
  • FIG. 2 is a flowchart of another embodiment of a navigation information processing method according to the present invention. As shown in FIG. 2, the method in this embodiment includes:
  • Step 201 Establish a voice interaction channel with the navigation device according to the call access request forwarded by the navigation device through the third-party service center.
  • the voice gateway in this embodiment also takes a VXML gateway as an example, and the navigation device can take an in-vehicle GPS navigator as an example.
  • the car GPS navigator can request a voice interaction channel with the VXML gateway by sending a call access request to the third-party service center, and then forwarding the call access request to the VXML gateway through the third-party service center.
  • a voice interaction channel can be established with the navigation device, and the voice interaction channel can facilitate voice interaction between the navigation device and the VXML gateway.
  • the third-party service center can call the center, or For other third-party service centers such as contact centers.
  • Step 202 Obtain a logical control application document corresponding to the call type identifier carried in the call access request from the document server.
  • the logic control application document stored on the document server may include a VXML application document, a text application document, and a binary application document. This embodiment is only described by taking a VXML application document as an example.
  • VXML application documents can be stored on the document server, and each VXML application document can control the interaction between the navigation device and the VXML gateway.
  • the VXML gateway can obtain the corresponding VXML application document from the document server according to the call type identifier carried in the call access request.
  • the document server can receive the VXML application document request packet sent by the VXML gateway.
  • the data packet may include a call type identifier carried in the call access request.
  • the call type identifier may be set to a document universal resource identifier (Uniform Resource Identifier, hereinafter referred to as URI), and the document server according to the data.
  • URI Uniform Resource Identifier
  • the URI carried in the package can extract or generate the corresponding VXML application document
  • the VXML application document can include a VXML script file automatically generated by the document server, or a pre-recorded audio file. Both the VXML script file and the audio file are used by the VXML gateway to guide the car GPS navigator in voice for the next step.
  • the document server sends the VXML application document to the VXML gateway, and the VXML interpreter component in the VXML gateway can interpret the VXML application document to obtain a script for interacting control commands between the car GPS navigator and the VXML gateway. file.
  • the voice setting request sent by the VXML gateway to the navigation device first prompts the user to set the departure place by using the navigation device. After inputting the departure place, the user is prompted to set the destination by using the navigation device.
  • Step 203 Send voice guidance information to the navigation device according to the logic control application document, and receive a voice navigation request sent by the navigation device according to the voice guidance information.
  • Step 203 may be specifically: acquiring the guidance text information according to the logic control application document; performing voice synthesis processing on the guidance text information to generate voice guidance information; transmitting the voice guidance information to the navigation device through the voice interaction channel; and receiving the navigation device through the voice interaction A voice navigation request sent by the channel corresponding to the voice guidance information.
  • the VXML gateway may send voice guidance information to the car GPS navigator according to the VXML application document.
  • the VXML gateway may extract the guidance text information from the VXML application document. Please tell me the destination you want to set, then the TTS component of the VXML gateway can convert the boot text information into voice guidance information, and then the VXML gateway sends the voice guidance information to the car GPS navigator through the established voice interaction channel.
  • the car GPS navigator can send a voice navigation request "Beijing" to the VXML gateway according to the voice guidance information "Please tell me the destination you want to set.”
  • Step 204 Identify a voice navigation request sent by the navigation device, and obtain corresponding navigation request text information.
  • the ASR component in the VXML gateway identifies and processes the "Beijing" in the form of a voice waveform sent by the car GPS navigator, thereby acquiring corresponding navigation request text information, which is to recognize the speech waveform representing "Beijing" into text.
  • Formal navigation requests textual information to facilitate VXML parser components for VXML parsing.
  • Step 205 The navigation setting information corresponding to the navigation request text information is sent to the navigation device according to the logic control application document, and the navigation device performs navigation setting processing according to the navigation setting information.
  • the step 205 may be specifically: sending the navigation setting information to the third-party service center by using the command issuing device, and the third-party service center sends the navigation setting information to the navigation device through the short message channel or the voice interaction channel. .
  • the VXML gateway may obtain navigation setting information requested by the navigation device, and the navigation setting information may be location data, that is, navigation information related to “Beijing”, such as latitude and longitude information. Then, the VXML gateway can send the navigation setting information to the third-party service center through the command issuing device, and then the third-party service center sends the navigation setting information to the navigation device through the short message channel in the form of a short message or through a voice interaction channel. Navigation device After receiving the navigation setting information, the navigation setting information may be set according to the navigation setting information, so that the user obtains the positioning information by using the navigation device. In the embodiment of the present invention, the third-party service center may call the center or the contact center. Other third-party service centers.
  • the ASR component of the voice gateway can be used to identify the voice navigation request sent by the navigation device, and the ASR component can recognize the natural language, and the recognition rate is high. Therefore, the correct rate of the voice navigation request is high in this embodiment, and the recognition range is high.
  • the driver can be easily interactively set with the voice gateway through the navigation device to obtain navigation setting information, and can also ensure the driver's personal safety in the driving state;
  • the navigation device itself does not need to integrate the voice recognition system on the hardware, but the voice recognition processing is handed over to the voice gateway for processing, thereby reducing the cost of the navigation device; in addition, the voice gateway sets the navigation through the short message channel or the voice interactive channel.
  • the information is sent to the navigation device, so that the navigation device can access the navigation setting information in a flexible manner.
  • the foregoing embodiment describes the operation corresponding to the voice gateway when the navigation device obtains the navigation setting information.
  • the following describes the implementation process of the navigation device using the navigation information acquiring method of the present invention.
  • FIG. 3 is a flowchart of an embodiment of a method for acquiring navigation information according to the present invention. As shown in FIG. 3, the method in this embodiment includes:
  • Step 301 Send a voice navigation request to the voice gateway according to the voice guidance information sent by the voice gateway.
  • the car GPS navigator can receive the voice guidance information sent by the VXML gateway, such as "please tell me the destination you want to set” in the form of voice.
  • the car GPS navigator can send a corresponding voice navigation request to the VXML gateway according to the voice guidance information "please tell me the destination you want to set", such as "Beijing" in the form of voice.
  • Step 302 After receiving the voice navigation request, the voice gateway determines the navigation setting information corresponding to the navigation request text information that is fed back according to the logic.
  • the ASR component in the VXML gateway can identify and process the voice navigation request sent by the car GPS navigator to obtain corresponding navigation request text information, and the recognition process is to recognize the “Beijing” in the form of a voice waveform into a text.
  • the form of navigation requests text information can be identified and process the voice navigation request sent by the car GPS navigator to obtain corresponding navigation request text information, and the recognition process is to recognize the “Beijing” in the form of a voice waveform into a text.
  • the form of navigation requests text information is to identify and process the voice navigation request sent by the car GPS navigator to obtain corresponding navigation request text information, and the recognition process is to recognize the “Beijing” in the form of a voice waveform into a text.
  • the VXML gateway can obtain the navigation setting information requested by the car GPS navigator, and then the VXML gateway can send the navigation setting information to the car GPS navigator.
  • the car GPS navigator can receive the navigation setting information fed back by the VXML gateway.
  • the setting information can be set according to the navigation setting information, so that the user obtains the positioning information through the vehicle GPS navigator.
  • the navigation setting information may be location data or command information.
  • the location data in the navigation setting information may be navigation information related to “Beijing”, such as latitude and longitude information.
  • the navigation setting information may be command information, and the navigation device may call the location data stored by the navigation device itself according to the command information.
  • the voice navigation request sent by the navigation device is identified by using the voice gateway. Therefore, the voice quality request of the voice navigation request sent by the navigation device is limited, and the recognition accuracy is high, and the driver's personal body can be guaranteed in the driving state. Security; moreover, the navigation device itself does not need to integrate a speech recognition system on the hardware, thereby reducing the cost of the navigation device.
  • FIG. 4 is a flowchart of another embodiment of a method for acquiring navigation information according to the present invention. As shown in FIG. 4, the method in this embodiment includes:
  • Step 401 Send a call access request to the voice gateway through a third-party service center, and establish a voice interaction channel with the voice gateway.
  • the navigation device may send a call access request to the third-party service center, and then the third-party service center may forward the call access request to the VXML gateway, thereby connecting the VXML gateway to establish a navigation device and the VXML gateway.
  • the third-party service center may call the center, or may be another third-party service center such as a contact center.
  • Step 402 After receiving the logical control application document obtained by the voice gateway according to the call type identifier carried in the call access request, the voice guidance information sent by the voice interaction channel is used to instruct the navigation device to send the voice navigation request.
  • the VXML gateway can obtain a VXML application document corresponding to the service type identifier from the document server, and the VXML interpreter component in the VXML gateway can interpret the VXML application document.
  • the logic control application document may be a program defined by the VXML script file, and the program is implemented to realize the interaction between the vehicle GPS navigation system and the VXML gateway.
  • the VXML gateway can send voice guidance information to the navigation device through the voice interaction channel, and the voice guidance information is used to guide the navigation device to send a voice navigation request to the VXML gateway through the voice interaction channel.
  • Step 403 Send a voice navigation request to the voice gateway according to the voice guidance information sent by the voice gateway.
  • the VXML gateway may send the voice guidance information to the in-vehicle GPS navigator according to the VXML application document.
  • the guidance text information extracted by the VXML gateway from the VXML application document is "Please tell me the destination you want to set”
  • the TTS component of the VXML gateway can convert the boot text information into voice guidance information, and then the VXML gateway sends the voice guidance information to the car GPS navigator through the established voice interaction channel.
  • the car GPS navigator can send a corresponding voice navigation request to the VXML gateway through the voice interaction channel according to the voice guidance information "please tell me the destination you want to set", such as "Beijing" in the form of voice.
  • Step 404 After receiving the voice navigation request and acquiring the corresponding navigation request text information, the navigation setting information corresponding to the navigation request text information fed back by the logic control application document is controlled according to the logic.
  • the step 404 may be specifically: receiving, by the voice gateway, the navigation setting information sent by the sending device to the third-party service center and sent by the third-party service center through the short message channel or the voice interaction channel.
  • the ASR component in the VXML gateway can identify and process the voice navigation request sent by the car GPS navigator to obtain corresponding navigation request text information, and the recognition process is to recognize the “Beijing” in the form of a voice waveform into a text.
  • the form of navigation requests text information can be identified and process the voice navigation request sent by the car GPS navigator to obtain corresponding navigation request text information, and the recognition process is to recognize the “Beijing” in the form of a voice waveform into a text.
  • the form of navigation requests text information is to identify and process the voice navigation request sent by the car GPS navigator to obtain corresponding navigation request text information, and the recognition process is to recognize the “Beijing” in the form of a voice waveform into a text.
  • the VXML gateway can obtain the navigation setting information requested by the car GPS navigator, and the VXML gateway can send the navigation setting information to the car GPS navigator.
  • the car GPS navigator can receive the navigation setting information fed back by the VXML gateway, and can set itself according to the navigation setting information, so that the user obtains the positioning information through the car GPS navigator.
  • the navigation setting information may be location data or command information.
  • the location data may not be stored.
  • the location data in the navigation setting information may be navigation information related to “Beijing”, such as latitude and longitude information.
  • the navigation setting information may be command information, and the navigation device may call the location data stored by the navigation device itself according to the command information.
  • the voice gateway can be used to identify the voice navigation request sent by the navigation device. Therefore, the correct rate of the voice navigation request is high and the recognition range is wide. The entire interaction process is voice interaction, and the driver is not required to manually Operation, therefore, to ensure that the driver is driving The personal safety; since the speech recognition processing and the speech synthesis processing are all handed over to the voice gateway for processing, the navigation device itself does not need to integrate the speech recognition and synthesis system on the hardware, thereby reducing the cost of the navigation device.
  • FIG. 5 is a signaling flowchart of still another embodiment of a method for acquiring navigation information according to the present invention. As shown in FIG. 5, the method in this embodiment includes:
  • Step 501 The navigation device sends a call access request to a third-party service center.
  • the navigation device may send a call access request to the third-party service center, and request the third-party service center to forward the call access request to the VXML gateway.
  • the third-party service center may call the center or the contact center. Other third-party service centers.
  • Step 502 The third-party service center forwards the call access request to the voice gateway.
  • Step 503 Establish a voice interaction channel between the navigation device and the voice gateway.
  • the third-party service center can forward the call access request to the VXML gateway, thereby connecting the VXML gateway, so that a voice interaction channel is established between the navigation device and the VXML gateway.
  • Step 504 The voice gateway acquires, from the document server, a logic control application document corresponding to the call type information carried in the call access request.
  • VXML application documents can be stored on the document server, and each VXML application document can control the interaction between the navigation device and the VXML gateway.
  • the VXML gateway can obtain the corresponding VXML application document from the document server according to the call type identifier carried in the call access request.
  • Step 505 The voice gateway acquires the guidance text information from the logic control application document, and performs voice synthesis processing on the guidance text information to generate voice guidance information.
  • the VXML gateway can extract boot text information from the VXML application document, and then the TTS component of the VXML gateway can convert the boot text information into voice guidance information.
  • Step 506 The voice gateway sends the voice guidance information to the navigation device by using a voice interaction channel.
  • the VXML gateway transmits the voice guidance information to the in-vehicle GPS navigator through the established voice interaction channel.
  • the VXML gateway can extract the boot text message "Please tell me the destination you want to set” from the VXML application document, and then use the TTS component to convert the text form "Please tell me the destination you want to set” into a voice form. "Please tell me where you want to set up.”
  • Step 507 The navigation device sends a voice navigation request corresponding to the voice guidance information to the voice gateway through the voice interaction channel.
  • the car GPS navigator can be based on voice guidance information "please tell me what you want to set up. "Send a voice navigation request "Beijing" to the VXML gateway.
  • Step 508 The voice gateway identifies the voice navigation request, obtains navigation request text information, and acquires navigation setting information corresponding to the navigation request text information according to the logic control application document.
  • the ASR component in the VXML gateway identifies and processes the "Beijing" in the form of a voice waveform sent by the car GPS navigator, thereby acquiring corresponding navigation request text information, which is to recognize the speech waveform representing "Beijing" into text.
  • the form of navigation requests text information, and then the text form "Beijing" as the input parameter of the script file defined by the logic control application document, the corresponding navigation setting information can be obtained.
  • the navigation setting information may be location data or command information.
  • the location data in the navigation setting information may be navigation information related to “Beijing”, such as latitude and longitude information.
  • the navigation setting information may be command information, and the navigation device may call the location data stored by the navigation device itself according to the command information.
  • Step 509 The voice gateway sends the navigation setting information to the command issuing device.
  • Step 510 The command issuing device sends the navigation setting information to a third-party service center.
  • Step 511 The third-party service center sends the navigation setting information to the navigation device through the short message channel or the voice interaction channel.
  • the voice gateway can be used to identify the voice navigation request sent by the navigation device. Therefore, the correct rate of the voice navigation request is high and the recognition range is wide. The entire interaction process is voice interaction, and the driver is not required to manually Operation, therefore, can ensure the personal safety of the driver in the driving state; since the speech recognition processing and the speech synthesis processing are all handed over to the voice gateway for processing, the navigation device itself does not need to integrate the speech recognition and synthesis system on the hardware. Thereby reducing the cost of the navigation device.
  • the voice gateway sends the navigation setting information to the navigation device through the short message channel or the voice interaction channel, so that the navigation device can obtain the navigation setting information in a flexible manner.
  • the voice gateway may also directly send the navigation setting information to the third-party service center through the voice interaction channel, and then send the information to the navigation device by the third-party service center.
  • FIG. 6 is a schematic structural diagram of a voice gateway according to an embodiment of the present invention.
  • the voice gateway of the embodiment includes: a voice recognition module 11 and a transceiver module 12, wherein the voice recognition module 11 is configured to identify a voice device.
  • the voice navigation request acquires the corresponding navigation request text information; the transceiver module 12 is configured to control the application document according to the logic, and the navigation setting information corresponding to the navigation request text information
  • the navigation device is sent to the navigation device for navigation setting processing according to the navigation setting information.
  • the voice recognition module 11 identifies the voice navigation request, and the process of acquiring the navigation request text information is to identify the voice navigation request in the form of a voice waveform into the navigation request text information in the form of text.
  • the process of the navigation module 12 acquiring the navigation setting information according to the logic control application document is: inputting the navigation request text information “Beijing” as the input of the script file defined by the logic control application document, thereby acquiring corresponding navigation setting information.
  • the transceiver module 12 can then send the navigation setting information to the navigation device.
  • the navigation device After receiving the navigation setting information, the navigation device can set the navigation setting information according to the navigation setting information, so that the user obtains the positioning information through the navigation device.
  • the navigation setting information may be location data or command information.
  • the location data in the navigation setting information may be navigation information related to “Beijing”, such as latitude and longitude information.
  • the navigation setting information may be command information, and the navigation device may call the location data stored by the navigation device itself according to the command information.
  • the voice recognition module is used to identify the voice navigation request sent by the navigation device, and the voice waveform information can be recognized as the navigation request text information in the form of text, thereby realizing the conversion of the voice signal to the machine language; the transceiver module can control the application according to the logic.
  • the document sends corresponding navigation setting information to the navigation device, so that the navigation device can obtain the required positioning information according to the navigation setting information.
  • the voice recognition module in this embodiment can use the ASR component to identify the voice navigation request sent by the navigation device, and the ASR component can recognize the natural language, and the recognition rate is high. Therefore, the voice gateway of the embodiment recognizes the voice navigation request.
  • the correct rate is high, the recognition range is wide, and the voice recognition method can ensure the personal safety of the driver in the driving state for the in-vehicle navigation device; when the navigation setting information is obtained by using the voice gateway of the embodiment, the navigation device itself does not
  • the need to integrate a speech recognition system on the hardware also reduces the cost of the navigation device.
  • FIG. 7 is a schematic structural diagram of another embodiment of a voice gateway according to the present invention.
  • the voice gateway of the embodiment includes: a voice recognition module 11 and a transceiver module 12, and further includes: an acquisition module 13 and a voice recognition module 11
  • the navigation request is sent to the navigation device for the navigation device according to the logic control application document, and the navigation setting information corresponding to the navigation request text information is sent to the navigation device according to the logic control application document.
  • the setting information is used for the navigation setting process; the transceiver module 12 is further configured to send the voice guidance to the navigation device according to the logic control application document.
  • the obtaining module 13 is configured to establish a voice interaction channel with the navigation device according to the call access request forwarded by the navigation device through the third-party service center, and obtain the voice interaction channel from the document server.
  • the voice gateway of the embodiment can send the navigation guide information to the navigation device, so that the user can conveniently guide the user to interact with the voice gateway through the navigation device, thereby acquiring the required navigation setting information. Since the entire interaction process does not require any manual operation, it is a voice interaction, so that the driver's personal safety in the driving state can be ensured, and the voice gateway of the present embodiment recognizes the voice information with high correct rate, wide recognition range, and accurate rate draft. When the voice gateway of this embodiment is adopted, the corresponding navigation device does not need to integrate the voice recognition system on the hardware, thereby reducing the cost of the navigation device.
  • FIG. 8 is a schematic structural diagram of an embodiment of a navigation device according to the present invention.
  • the navigation device of the embodiment includes: a voice sending module 15 and a receiving module 16, and the voice sending module 15 is configured to perform voice guidance according to a voice gateway. The information is sent to the voice gateway for the voice navigation request.
  • the receiving module 16 is configured to receive the navigation setting information corresponding to the navigation request text information fed back by the voice control application document after receiving the voice gateway identification voice navigation request and acquiring the corresponding navigation request text information.
  • the navigation device can perform voice interaction with the VXML gateway to implement navigation and positioning by sending a voice navigation request to the VXML gateway and receiving the navigation setting information fed back by the VXML gateway according to the voice navigation request.
  • the VXML gateway can use the ASR component to identify the voice navigation request sent by the navigation device, and the ASR component can recognize the natural language, and the recognition rate is high, the navigation device of the embodiment sends the voice navigation request.
  • the voice quality limit is small, the recognition accuracy rate is high, and the driver's personal safety can be guaranteed in the driving state; the navigation device itself does not need to integrate the voice recognition system on the hardware, but the voice recognition processing is handed over to the VXML gateway for processing. Thus, the cost of the navigation device is reduced.
  • the voice sending module 15 is further configured to send a call access request to the voice gateway through a third-party service center, and establish a voice interaction channel with the voice gateway;
  • the module 16 is further configured to: after receiving the logical control application document obtained by the voice gateway according to the call type identifier carried in the call access request, and sent by the voice interaction channel, the voice guidance information used by the navigation device to send the voice navigation request.
  • the navigation device completes the voice communication request with the VXML gateway by sending a voice navigation request to the VXML gateway and receiving the navigation setting information fed back by the VXML gateway according to the voice navigation request. Interaction, the function of navigation and positioning.
  • the voice quality is limited, the recognition accuracy is high, and the driver's personal safety in the driving state can be ensured; the navigation device does not need to integrate the voice recognition system on the hardware, thereby reducing the cost of the navigation device.
  • FIG. 9 is a schematic structural diagram of an embodiment of a voice navigation system according to the present invention.
  • the system in this embodiment includes: a voice gateway 1 and a document server 2, and the voice gateway 1 is configured to forward through a third-party service center according to the navigation device.
  • a call access request establish a voice interaction channel with the navigation device, and obtain a logical control application document corresponding to the call type identifier carried in the call access request from the document server, and send a voice guide to the navigation device according to the logic control application document.
  • the navigation device And receiving the voice navigation request sent by the navigation device according to the voice guidance information, identifying the voice navigation request, acquiring the corresponding navigation request text information, and transmitting the navigation setting information corresponding to the navigation request text information to the navigation device according to the logic control application document
  • the navigation device performs navigation setting processing according to the navigation setting information; the document server 2 is configured to send the logic control application document to the voice gateway 1.
  • the embodiment may further include a command issuing device, configured to send the navigation setting information to the third-party service center, and enable the third-party service center to send the navigation setting information to the navigation device through the short message channel or the voice interaction channel.
  • a command issuing device configured to send the navigation setting information to the third-party service center, and enable the third-party service center to send the navigation setting information to the navigation device through the short message channel or the voice interaction channel.
  • the navigation device can perform voice interaction with the voice gateway through the voice channel established between the voice gateway, so that the navigation device can obtain the required positioning information according to the navigation setting information.
  • the voice gateway of the embodiment can use the ASR component to identify the voice navigation request sent by the navigation device. Therefore, the voice gateway of the embodiment can recognize the voice navigation request with high correct rate and wide recognition range, and the voice recognition mode is
  • the navigation device can ensure the personal safety of the driver while driving; the navigation device itself does not need to integrate the voice recognition system on the hardware, thereby reducing the cost of the navigation device, and the voice gateway through the short message channel or the voice interaction channel Sending the navigation setting information to the navigation device also enables the navigation device to obtain navigation setting information in a flexible manner.
  • the present invention can be implemented by means of software plus a necessary general hardware platform, and of course, can also be through hardware, but in many cases the former is a better implementation. .
  • the technical solution of the embodiments of the present invention may be embodied in the form of a software product in essence or in the form of a software product, which is stored in a storage medium and includes a plurality of instructions for making
  • a mobile device which may be a cell phone, a personal computer, a media player, etc.
  • Storage referred to here Media such as: ROM / RAM, disk, CD, etc.

Landscapes

  • Engineering & Computer Science (AREA)
  • Radar, Positioning & Navigation (AREA)
  • Remote Sensing (AREA)
  • Automation & Control Theory (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Navigation (AREA)
  • Telephonic Communication Services (AREA)

Abstract

本发明涉及一种导航信息处理、获取方法及语音导航系统。该导航信息处理方法包括:识别导航设备发送的语音导航请求,获取相应的导航请求文本信息;根据逻辑控制应用文档,将与导航请求文本信息对应的导航设置信息发送给导航设备,供导航设备根据导航设置信息进行导航设置处理。该导航信息获取方法包括:根据语音网关发送的语音引导信息向语音网关发送语音导航请求;接收语音网关识别语音导航请求,获取相应的导航请求文本信息后,根据逻辑控制应用文档反馈的、与导航请求文本信息对应的导航设置信息。本发明实施例通过引入语音网关对导航设备发送的语音信息进行识别并反馈相应的导航设置信息,降低了导航设备的成本,扩大语音的识别范围。

Description

导航信息处理、获取方法及语音导航系统 技术领域
本发明实施例涉及无线通信技术领域,尤其涉及一种导航信息处理、获取方法及语音导航系统。
背景技术
全球定位系统(Global Positioning System,以下简称:GPS)可以实现导航、定位、授时等功能,引导飞机、船舶、车辆以及个人,安全、准确地沿着选定的路线,准时到达目的地。车载智能导航系统就是一种利用GPS协助用户准确定位的辅助设备。车载智能导航系统利用GPS提供的位置、速度及时间等信息,配合高精度导航电子地图的路线规划能力,为用户提供导航功能,帮助用户准确、实时地在电子地图上规划行车路线,同时引导用户按规划的路线行驶,并到达目的地。
为了保证驾驶的安全性,驾驶员在车载智能导航系统中设置目的地时,可以采用语音输入方式。目前,采用语音输入方式设置目的地的方法是在该车载智能导航系统的内部设置语音识别系统。当需要输入目的地时,用户可以向该车载智能导航系统发送语音命令。然后,该语音识别系统即可根据该语音命令识别出用户输入的目的地信息。最后,车载智能导航系统即可根据识别获取的目的地信息设置用户所需到达的目的地,从而完成导航功能。
然而,现有技术至少存在以下缺点:由于车载智能导航系统中需要设置语音识别系统,因此,车载智能导航系统的成本较高,而且,由于语音识别系统的识别正确率较低,致使车载智能导航系统处理语音命令的能力十分有限。
发明内容
本发明实施例提供一种导航信息处理、获取方法及语音导航系统,以解决现有技术中车载智能导航系统的成本较高,识别正确率较低的问题,实现降低车载智能导航系统的成本,扩大语音的识别范围,提高识别正确率的效果。
本发明实施例提供一种导航信息处理方法,包括:识别导航设备发送的语音导航请求,获取相应的导航请求文本信息,其中,所述语音导航请求至少包 括用户发出的第一语音频段请求以及第二语音频段请求其中之一;根据逻辑控制应用文档,将与所述导航请求文本信息对应的导航设置信息发送给所述导航设备,以便所述导航设备根据所述导航设置信息进行导航设置处理。
本发明实施例提供一种导航信息获取方法,包括:根据语音网关发送的语音引导信息,向所述语音网关发送语音导航请求,以便于所述语音网关识别语音导航请求,并根据逻辑控制应用文档反馈获取所述语音导航请求相应的导航请求文本信息,其中,所述语音导航请求至少包括用户发出的第一语音频段请求以及第二语音频段请求其中之一;接收所述语音网关发送的所述导航请求文本信息对应的导航设置信息。
本发明实施例还提供一种语音导航系统,包括:语音网关,用于识别导航设备发送的语音导航请求,获取相应的导航请求文本信息;并根据逻辑控制应用文档,将与所述导航请求文本信息对应的导航设置信息发送给所述导航设备,以便所述导航设备根据所述导航设置信息进行导航设置处理;其中,所述语音导航请求至少包括用户发出的第一语音频段请求以及第二语音频段请求其中之一;文档服务器,用于向所述语音网关发送所述逻辑控制应用文档。
本发明实施例提供的导航信息处理、获取方法及装置,通过采用语音网关对用户利用导航设备发送的语音信息进行识别处理,并将对应的导航设置信息反馈给导航设备,导航设备自身不用设置语音识别系统,降低了导航设备的成本;而且,扩大了导航设备发送的语音信息的识别范围,且识别正确率较高。此外,还能够根据第一语音频段请求以及第二语音频段请求进行导航设置处理,完善了导航系统的功能。
附图说明
图1为本发明导航信息处理方法一实施例的流程图;
图2为本发明导航信息处理方法另一实施例的流程图;
图3为本发明导航信息获取方法一实施例的流程图;
图4为本发明导航信息获取方法另一实施例的流程图;
图5为本发明导航信息获取方法再一实施例的信令流程图;
图6为本发明语音网关一实施例的结构示意图;
图7为本发明语音网关另一实施例的结构示意图;
图8为本发明导航设备一实施例的结构示意图;
图9为本发明语音导航系统一实施例的结构示意图。
具体实施方式
下面结合附图和具体实施例对本发明的技术方案作进一步更详细的描述。显然,所描述的实施例仅仅是本发明的一部分实施例,而不是全部的实施例。基于本发明中的实施例,本领域普通技术人员在没有作出创造性劳动的前提下所获得的所有其他实施例,都应属于本发明保护的范围。
下面通过附图和实施例,对本发明的技术方案做进一步的详细描述。
图1为本发明导航信息处理方法一实施例的流程图,如图1所示,本实施例的方法包括:
步骤101、识别导航设备发送的语音导航请求,获取相应的导航请求文本信息;其中,所述语音导航请求至少包括用户发出的第一语音频段请求以及第二语音频段请求其中之一。本实施例中,所述第一语音频段请求位于正常听阈内,所述第二语音频段请求超出正常听阈,例如,所述第一语音频段请求是用户发出的正常语调的请求,如电话或播放音乐之类,所述第二语音频段请求则是用户(车内任何人)发出的惊叫声等。
举例来说,该导航设备可以为车载全球定位系统(Global PositioningSystem,以下简称:GPS)导航仪。在该车载GPS导航仪发送语音导航请求后,可以通过语音网关识别导航设备发送的语音导航请求,该语音网关可以为语音可扩展标记语言(Voice Extensible Markup Language,以下简称:VXML)网关识别导航设备发送的语音导航请求。VXML网关是一种标准化的语音应答系统平台,VXML网关主要由自动语音识别(Automatic SpeechRecognition,以下简称:ASR)组件、VXML解释器组件以及语音合成(Text-To-Speech,以下简称:TTS)组件组成。其中,ASR组件可以将语音波形信号识别成文本信息,从而可以方便VXML解释器组件对该文本信息进行解释处理,ASR组件可实现对自然语言的识别,且准确率较高。
本实施例可以采用VXML网关中的ASR组件对导航设备发送的语音导航请求进行识别处理,从而获取相应的导航请求文本信息,该识别过程即为将语音波形形式的语音导航请求识别成文本形式的导航请求文本信息。
步骤102、根据逻辑控制应用文档,将与导航请求文本信息对应的导航设置信息发送给导航设备,供导航设备根据该导航设置信息进行导航设置处理。
语音网关获取到的导航请求文本信息后,且当语音导航请求为第一语音频段请求时,比如该导航请求的文本信息为“北京”,则逻辑控制应用文档获取到相应的“北京”的内容,进行查询相应的导航设置信息;并且把查询到导航设置信息发送给导航设备。
在该逻辑控制应用文档的控制下,VXML网关可以获取导航设备请求的导航设置信息,VXML网关可以将该导航设置信息发送给导航设备。导航设备在接收到该导航设置信息后,即可根据该导航设置信息进行自身设置,从而使用户通过该导航设备获取定位信息。该导航设置信息既可以为地点数据也可以为命令信息,对于导航设备本身不存储任何地点数据的情况,该导航设置信息中的地点数据可以为与“北京”相关的导航信息,如经纬度信息等;对于导航设备自身存储有地点数据的情况,该导航设置信息可以为命令信息,导航设备可以根据该命令信息调用该导航设备自身存储的地点数据自行设置。
需要说明的是,本实施例仅以VXML网关为例进行说明,在实际实现过程中,可以采用任意类型的语音网关。
本实施例通过对导航设备发送的语音导航请求进行识别,可以将语音波形信息识别为文本形式的导航请求文本信息,从而实现语音信号到机器语言的转换;然后,VXML网关可以在逻辑控制应用文档的控制下向导航设备发送对应的导航设置信息,从而方便导航设备根据该导航设置信息获取所需的定位信息。由于本实施例可以采用VXML网关的ASR组件对导航设备发送的语音导航请求进行识别,且ASR组件可以识别自然语言,识别率高,因此,本实施例识别语音导航请求的正确率高且识别范围广,且语音识别的方式对于车载导航设备来说,可以在行驶状态下保证驾驶员的人身安全;导航设备本身并不需要在硬件上集成语音识别系统,而是将语音识别处理交给VXML网关进行处理,从而降低了导航设备的成本。
此外,当语音导航请求为第二语音频段请求时,导航设备通过设置自身的安全警示装置,例如喇叭,警示用户注意驾驶。
图2为本发明导航信息处理方法另一实施例的流程图,如图2所示,本实施例的方法包括:
步骤201、根据导航设备通过第三方服务中心转发的呼叫接入请求,与导航设备建立语音交互通道;
本实施例中的语音网关也以VXML网关为例,导航设备可以以车载GPS导航仪为例。
举例来说,车载GPS导航仪可以通过向第三方服务中心发送呼叫接入请求,然后通过该第三方服务中心向VXML网关转发该呼叫接入请求,来请求与VXML网关建立语音交互通道,VXML网关根据该呼叫接入请求即可与该导航设备建立语音交互通道,该语音交互通道能够方便导航设备与VXML网关之间进行语音交互,本发明实施例中,第三方服务中心可以呼叫中心,也可以为联络中心等其他第三方服务中心。
步骤202、从文档服务器上获取与所述呼叫接入请求中携带的呼叫类型标识对应的逻辑控制应用文档。
存储在文档服务器上的逻辑控制应用文档可以包括VXML应用文档、文本应用文档以及二进制应用文档,本实施例仅以VXML应用文档为例进行说明。
举例来说,文档服务器上可以存储各种VXML应用文档,每种VXML应用文档都可对导航设备与VXML网关之间的交互进行控制。VXML网关可以根据呼叫接入请求中携带的呼叫类型标识从文档服务器上获取对应的VXML应用文档。
文档服务器可以接收VXML网关发送的VXML应用文档请求数据包。该数据包中可以包括呼叫接入请求中携带的呼叫类型标识,在本实施例中可以将该呼叫类型标识设为一个文档通用资源标志符(Uniform ResourceIdentifier,以下简称:URI),文档服务器根据数据包中携带的URI,即可提取或生成相应的VXML应用文档,该VXML应用文档中既可以包括文档服务器自动生成的VXML脚本文件,也可以包括预先录制的音频文件。VXML脚本文件和音频文件都用于VXML网关通过语音形式引导车载GPS导航仪进行下一步操作。
文档服务器将该VXML应用文档发送给VXML网关,该VXML网关中的VXML解释器组件即可对该VXML应用文档进行解释处理,获取车载GPS导航仪与VXML网关之间的进行交互的控制命令的脚本文件。VXML网关发送给导航设备的语音设置请求,首先提示用户利用导航设备设置出发地,当输入出发地后,然后提示用户利用导航设备设置目的地。
步骤203、根据逻辑控制应用文档,向导航设备发送语音引导信息,并接收导航设备根据语音引导信息发送的语音导航请求。
步骤203可以具体为:根据逻辑控制应用文档,获取引导文本信息;对引导文本信息进行语音合成处理,生成语音引导信息;通过语音交互通道将语音引导信息发送给导航设备;接收导航设备通过语音交互通道发送的、与语音引导信息对应的语音导航请求。
举例来说,VXML网关在获取上述VXML应用文档后,可以根据该VXML应用文档向车载GPS导航仪发送语音引导信息,在本实施例中,VXML网关可以从VXML应用文档中提取到引导文本信息“请告诉我你要设置的目的地”,然后VXML网关的TTS组件可以将引导文本信息转换成语音引导信息,然后VXML网关通过已经建立的语音交互通道将该语音引导信息发送给车载GPS导航仪。
车载GPS导航仪可以根据语音引导信息“请告诉我你要设置的目的地”向VXML网关发送语音导航请求“北京”。
步骤204、识别导航设备发送的语音导航请求,获取相应的导航请求文本信息;
VXML网关中的ASR组件对车载GPS导航仪发送的语音波形形式的“北京”进行识别处理,从而获取相应的导航请求文本信息,该识别过程即为将代表“北京”的语音波形的识别成文本形式的导航请求文本信息,以方便VXML解释器组件进行VXML解析处理。
步骤205、根据逻辑控制应用文档,将与导航请求文本信息对应的导航设置信息发送给导航设备,供导航设备根据导航设置信息进行导航设置处理。
步骤205可以具体为:通过命令下发设备将导航设置信息发送给第三方服务中心,第三方服务中心通过短信通道或者语音交互通道,将导航设置信息发送给导航设备。。
具体来说,在逻辑控制应用文档的控制下,VXML网关可以获取导航设备请求的导航设置信息,该导航设置信息可以为地点数据,即为与“北京”相关的导航信息,如经纬度信息等。然后,VXML网关可以通过命令下发设备将该导航设置信息发送给第三方服务中心,然后第三方服务中心再将该导航设置信息通过短信通道以短信形式或者通过语音交互通道发送给导航设备。导航设备 在接收到该导航设置信息后,即可根据该导航设置信息进行自身设置,从而使用户通过该导航设备获取定位信息,本发明实施例中,第三方服务中心可以呼叫中心,也可以为联络中心等其他第三方服务中心。
本实施例可以采用语音网关的ASR组件对导航设备发送的语音导航请求进行识别,且ASR组件可以识别自然语言,且识别率高,因此,本实施例识别语音导航请求的正确率高,识别范围广,且语音识别的方式对于车载导航设备来说,既可以使驾驶员方便地通过导航设备与语音网关进行交互设置,获取导航设置信息,又可以在行驶状态下保证驾驶员的人身安全;而且,导航设备本身并不需要在硬件上集成语音识别系统,而是将语音识别处理交给语音网关进行处理,从而降低了导航设备的成本;此外,语音网关通过短信通道或者语音交互通道将导航设置信息发送给导航设备,从而使得导航设备获取导航设置信息的途径灵活多样。
上述实施例介绍了导航设备获取导航设置信息时,与语音网关对应的操作,下面介绍导航设备采用本发明导航信息获取方法实施例的实现过程。
图3为本发明导航信息获取方法一实施例的流程图,如图3所示,本实施例的方法包括:
步骤301、根据语音网关发送的语音引导信息向语音网关发送语音导航请求;
举例来说,车载GPS导航仪可以接收到VXML网关发送的语音引导信息,如语音形式的“请告诉我你要设置的目的地”。
车载GPS导航仪可以根据语音引导信息“请告诉我你要设置的目的地”向VXML网关发送对应的语音导航请求,例如语音形式的“北京”。
步骤302、接收语音网关识别语音导航请求,获取相应的导航请求文本信息后,根据逻辑控制应用文档反馈的、与导航请求文本信息对应的导航设置信息。
具体来说,VXML网关中的ASR组件可以对车载GPS导航仪发送的语音导航请求进行识别处理,从而获取相应的导航请求文本信息,该识别过程即为将语音波形形式的“北京”识别成文本形式的导航请求文本信息。
在该逻辑控制应用文档的控制下,VXML网关可以获取车载GPS导航仪请求的导航设置信息,然后,VXML网关可以将该导航设置信息发送给车载GPS导航仪。至此,车载GPS导航仪即可接收到VXML网关反馈的导航设置信息, 即可根据该导航设置信息进行自身设置,从而使用户通过该车载GPS导航仪获取定位信息。该导航设置信息既可以为地点数据也可以为命令信息,对于导航设备本身不存储任何地点数据的情况,该导航设置信息中的地点数据可以为与“北京”相关的导航信息,如经纬度信息等;对于导航设备自身存储有地点数据的情况,该导航设置信息可以为命令信息,导航设备可以根据该命令信息调用该导航设备自身存储的地点数据自行设置。
在本实施例由于采用语音网关对导航设备发送的语音导航请求进行识别,因此,对导航设备发送的语音导航请求的语音质量限制小,识别正确率高,可以在行驶状态下保证驾驶员的人身安全;而且,导航设备本身并不需要在硬件上集成语音识别系统,从而降低了导航设备的成本。
下面详细介绍导航设备采用本发明导航信息获取方法另一实施例获取导航设置信息的过程。
图4为本发明导航信息获取方法另一实施例的流程图,如图4所示,本实施例的方法包括:
步骤401、通过第三方服务中心向语音网关发送呼叫接入请求,与语音网关建立语音交互通道;
举例来说,导航设备可以向第三方服务中心发送呼叫接入请求,然后第三方服务中心可以将该呼叫接入请求转发给VXML网关,从而接通VXML网关,使得导航设备与VXML网关之间建立语音交互通道,本发明实施例中,第三方服务中心可以呼叫中心,也可以为联络中心等其他第三方服务中心。
步骤402、接收语音网关根据呼叫接入请求中携带的呼叫类型标识获取的逻辑控制应用文档后,通过语音交互通道发送的,用于指示导航设备发送语音导航请求的语音引导信息;
举例来说,VXML网关可以从文档服务器中获取与该业务类型标识对应的VXML应用文档,该VXML网关中的VXML解释器组件即可对该VXML应用文档进行解释处理。该逻辑控制应用文档可以是VXML脚本文件定义的一段程序,通过执行该程序实现车载GPS导航系统与VXML网关的交互。VXML网关在获取该逻辑控制应用文档后即可通过语音交互通道向导航设备发送语音引导信息,该语音引导信息用于引导导航设备通过语音交互通道向VXML网关发送语音导航请求。
步骤403、根据语音网关发送的语音引导信息向语音网关发送语音导航请求;
举例来说,VXML网关在获取上述VXML应用文档后,可以根据该VXML应用文档向车载GPS导航仪发送语音引导信息,在本实施例可以假设VXML网关从VXML应用文档中提取到的引导文本信息为“请告诉我你要设置的目的地”,然后VXML网关的TTS组件可以将引导文本信息转换成语音引导信息,然后VXML网关通过已经建立的语音交互通道将该语音引导信息发送给车载GPS导航仪。
车载GPS导航仪可以根据语音引导信息“请告诉我你要设置的目的地”通过语音交互通道向VXML网关发送对应的语音导航请求,例如语音形式的“北京”。
步骤404、接收语音网关识别语音导航请求,获取相应的导航请求文本信息后,根据逻辑控制应用文档反馈的、与导航请求文本信息对应的导航设置信息。
步骤404可具体为:接收语音网关通过命令下发设备发送给第三方服务中心且第三方服务中心通过短信通道或者语音交互通道发送的导航设置信息。
具体来说,VXML网关中的ASR组件可以对车载GPS导航仪发送的语音导航请求进行识别处理,从而获取相应的导航请求文本信息,该识别过程即为将语音波形形式的“北京”识别成文本形式的导航请求文本信息。
最后,在该逻辑控制应用文档的控制下,VXML网关可以获取车载GPS导航仪请求的导航设置信息,VXML网关可以将该导航设置信息发送给车载GPS导航仪。至此,车载GPS导航仪即可接收到VXML网关反馈的导航设置信息,即可根据该导航设置信息进行自身设置,从而使用户通过该车载GPS导航仪获取定位信息。该导航设置信息既可以为地点数据也可以为命令信息,对于导航设备本身不存储任何地点数据的情况,该导航设置信息中的地点数据可以为与“北京”相关的导航信息,如经纬度信息等;对于导航设备自身存储有地点数据的情况,该导航设置信息可以为命令信息,导航设备可以根据该命令信息调用该导航设备自身存储的地点数据自行设置。
本实施例可以采用语音网关对导航设备发送的语音导航请求进行识别,因此,本实施例识别语音导航请求的正确率高、识别范围广;整个交互过程均为语音交互,不需要驾驶员进行手动操作,因此,可以保证驾驶员在行驶状态下 的人身安全;由于将语音识别处理和语音合成处理均交给语音网关进行处理,因此导航设备本身并不需要在硬件上集成语音识别和合成系统,从而降低了导航设备的成本。
图5为本发明导航信息获取方法再一实施例的信令流程图,如图5所示,本实施例的方法包括:
步骤501、导航设备向第三方服务中心发送呼叫接入请求。
导航设备可以向第三方服务中心发送呼叫接入请求,请求第三方服务中心将该呼叫接入请求转发给VXML网关,本发明实施例中,第三方服务中心可以呼叫中心,也可以为联络中心等其他第三方服务中心。
步骤502、第三方服务中心向语音网关转发该呼叫接入请求。
步骤503、导航设备与语音网关之间建立语音交互通道。
第三方服务中心可以将该呼叫接入请求转发给VXML网关,从而接通VXML网关,使得导航设备与VXML网关之间建立语音交互通道。
步骤504、语音网关从文档服务器上获取与该呼叫接入请求中携带的呼叫类型信息对应的逻辑控制应用文档。
文档服务器上可以存储各种VXML应用文档,每种VXML应用文档都可对导航设备与VXML网关之间的交互进行控制。VXML网关可以根据呼叫接入请求中携带的呼叫类型标识从文档服务器上获取对应的VXML应用文档。
步骤505、语音网关从该逻辑控制应用文档中获取引导文本信息,并对该引导文本信息进行语音合成处理,生成语音引导信息。
VXML网关可以从VXML应用文档中提取到引导文本信息,然后VXML网关的TTS组件可以将引导文本信息转换成语音引导信息
步骤506、语音网关通过语音交互通道将该语音引导信息发送给导航设备。
VXML网关通过已经建立的语音交互通道将该语音引导信息发送给车载GPS导航仪。例如,VXML网关可以从VXML应用文档中提取到引导文本信息“请告诉我你要设置的目的地”,然后采用TTS组件将文本形式的“请告诉我你要设置的目的地”转换成语音形式的“请告诉我你要设置的目的地”。
步骤507、导航设备通过语音交互通道向语音网关发送与该语音引导信息对应的语音导航请求。
例如,车载GPS导航仪可以根据语音引导信息“请告诉我你要设置的目的 地”向VXML网关发送语音导航请求“北京”。
步骤508、语音网关识别该语音导航请求,获取导航请求文本信息,并根据逻辑控制应用文档获取与该导航请求文本信息对应的导航设置信息。
VXML网关中的ASR组件对车载GPS导航仪发送的语音波形形式的“北京”进行识别处理,从而获取相应的导航请求文本信息,该识别过程即为将代表“北京”的语音波形的识别成文本形式的导航请求文本信息,然后将该文本形式的“北京”作为逻辑控制应用文档定义的脚本文件的输入参数,即可获得对应的导航设置信息。该导航设置信息既可以为地点数据也可以为命令信息,对于导航设备本身不存储任何地点数据的情况,该导航设置信息中的地点数据可以为与“北京”相关的导航信息,如经纬度信息等;对于导航设备自身存储有地点数据的情况,该导航设置信息可以为命令信息,导航设备可以根据该命令信息调用该导航设备自身存储的地点数据自行设置。
步骤509、语音网关将该导航设置信息发送给命令下发设备。
步骤510、命令下发设备将该导航设置信息发送给第三方服务中心。
步骤511、第三方服务中心通过短信通道或者语音交互通道,将导航设置信息发送给导航设备。
本实施例可以采用语音网关对导航设备发送的语音导航请求进行识别,因此,本实施例识别语音导航请求的正确率高、识别范围广;整个交互过程均为语音交互,不需要驾驶员进行手动操作,因此,可以保证驾驶员在行驶状态下的人身安全;由于将语音识别处理和语音合成处理均交给语音网关进行处理,因此导航设备本身并不需要在硬件上集成语音识别和合成系统,从而降低了导航设备的成本。此外,语音网关通过短信通道或者语音交互通道将导航设置信息发送给导航设备,从而使得导航设备获取导航设置信息的途径灵活多样。
本发明另一实施例中,在上述步骤S509中,语音网关还可以将导航设置信息通过语音交互通道直接发送给第三方服务中心,然后由第三方服务中心发送给导航设备。
图6为本发明语音网关一实施例的结构示意图,如图6所示,本实施例的语音网关包括:语音识别模块11以及收发模块12,其中,语音识别模块11用于识别导航设备发送的语音导航请求,获取相应的导航请求文本信息;收发模块12用于根据逻辑控制应用文档,将与导航请求文本信息对应的导航设置信息 发送给导航设备,供导航设备根据导航设置信息进行导航设置处理。
具体来说,语音识别模块11对语音导航请求进行识别,获取导航请求文本信息的过程即为将语音波形形式的语音导航请求识别成文本形式的导航请求文本信息。
收发模块12根据逻辑控制应用文档获取导航设置信息并发送的过程即为:将导航请求文本信息“北京”作为逻辑控制应用文档所定义的脚本文件的输入,从而获取相应的导航设置信息。然后,收发模块12可以将该导航设置信息发送给导航设备。导航设备在接收到该导航设置信息后,即可根据该导航设置信息进行自身设置,从而使用户通过该导航设备获取定位信息。该导航设置信息既可以为地点数据也可以为命令信息,对于导航设备本身不存储任何地点数据的情况,该导航设置信息中的地点数据可以为与“北京”相关的导航信息,如经纬度信息等;对于导航设备自身存储有地点数据的情况,该导航设置信息可以为命令信息,导航设备可以根据该命令信息调用该导航设备自身存储的地点数据自行设置。
本实施例通过语音识别模块对导航设备发送的语音导航请求进行识别,可以将语音波形信息识别为文本形式的导航请求文本信息,从而实现语音信号到机器语言的转换;收发模块可以根据逻辑控制应用文档向导航设备发送对应的导航设置信息,从而方便导航设备根据该导航设置信息获取所需的定位信息。由于本实施例中的语音识别模块可以采用ASR组件对导航设备发送的语音导航请求进行识别,且ASR组件可以识别自然语言,且识别率高,因此,本实施例的语音网关识别语音导航请求的正确率高、识别范围广,且语音识别的方式对于车载导航设备来说,可以在行驶状态下保证驾驶员的人身安全;使用本实施例的语音网关获取导航设置信息时,导航设备本身并不需要在硬件上集成语音识别系统,因此还降低了导航设备的成本。
图7为本发明语音网关另一实施例的结构示意图,如图7所示,本实施例的语音网关包括:语音识别模块11以及收发模块12,还包括:获取模块13,语音识别模块11用于识别导航设备发送的语音导航请求,获取相应的导航请求文本信息;收发模块12用于根据逻辑控制应用文档,将与导航请求文本信息对应的导航设置信息发送给导航设备,供导航设备根据导航设置信息进行导航设置处理;收发模块12还用于根据逻辑控制应用文档,向导航设备发送语音引导 信息,并接收导航设备根据语音引导信息发送的导航设置请求;获取模块13用于根据导航设备通过第三方服务中心转发的呼叫接入请求,与导航设备建立语音交互通道,并从文档服务器上获取与呼叫接入请求中携带的呼叫类型标识对应的逻辑控制应用文档。
本实施例的语音网关通过向导航设备发送导航引导信息,从而能够方便地引导用户通过导航设备与语音网关交互,从而获取所需的导航设置信息。由于整个交互过程不需要任何手动操作,均为语音交互,因此可以保证驾驶员在行驶状态下的人身安全,而且本实施例的语音网关识别语音信息的正确率高、识别范围广,准确率稿;采用本实施例的语音网关时,相应的导航设备不需要在硬件上集成语音识别系统,从而降低了导航设备的成本。
图8为本发明导航设备一实施例的结构示意图,如图8所示,本实施例的导航设备包括:语音发送模块15和接收模块16,语音发送模块15用于根据语音网关发送的语音引导信息向语音网关发送语音导航请求;接收模块16用于接收语音网关识别语音导航请求,获取相应的导航请求文本信息后,根据逻辑控制应用文档反馈的、与导航请求文本信息对应的导航设置信息。
在本实施例中,导航设备可以通过向VXML网关发送语音导航请求,并接收VXML网关根据该语音导航请求反馈的导航设置信息,来完成与VXML网关进行语音交互,实现导航定位的功能。在该过程中,由于VXML网关可以采用ASR组件对导航设备发送的语音导航请求进行识别,且ASR组件可以识别自然语言,且识别率高,因此,本实施例的导航设备对发送的语音导航请求的语音质量限制小,识别正确率高,且可以在行驶状态下保证驾驶员的人身安全;导航设备本身并不需要在硬件上集成语音识别系统,而是将语音识别处理交给VXML网关进行处理,从而使得导航设备的成本降低。
本发明导航设备另一实施例以本发明导航设备上一实施例为基础,语音发送模块15还用于通过第三方服务中心向语音网关发送呼叫接入请求,与语音网关建立语音交互通道;接收模块16还用于接收语音网关根据所述呼叫接入请求中携带的呼叫类型标识获取的逻辑控制应用文档后,通过语音交互通道发送的,用于指示导航设备发送语音导航请求的语音引导信息。
本实施例中,导航设备通过向VXML网关发送语音导航请求,并接收VXML网关根据该语音导航请求反馈的导航设置信息,来完成与VXML网关进行语音 交互,实现导航定位的功能。本实施例对语音质量限制小,识别正确率高,且可以保证驾驶员在行驶状态下的人身安全;导航设备并不需要在硬件上集成语音识别系统,从而降低了导航设备的成本。
图9为本发明语音导航系统一实施例的结构示意图,如图9所示,本实施例的系统包括:语音网关1和文档服务器2,语音网关1用于根据导航设备通过第三方服务中心转发的呼叫接入请求,与导航设备建立语音交互通道,并从文档服务器上获取与呼叫接入请求中携带的呼叫类型标识对应的逻辑控制应用文档,根据逻辑控制应用文档,向导航设备发送语音引导信息,并接收导航设备根据语音引导信息发送的语音导航请求,识别语音导航请求,获取相应的导航请求文本信息,根据逻辑控制应用文档,将与导航请求文本信息对应的导航设置信息发送给导航设备,供导航设备根据导航设置信息进行导航设置处理;文档服务器2用于向语音网关1发送逻辑控制应用文档。
更进一步地,本实施例还可以包括命令下发设备,用于将导航设置信息发送给第三方服务中心,使第三方服务中心通过短信通道或者语音交互通道,将导航设置信息发送给导航设备。
本实施例的语音导航系统中,导航设备可以通过与语音网关之间建立的语音通道与语音网关进行语音交互,从而方便导航设备根据该导航设置信息获取所需的定位信息。由于本实施例的语音网关可以采用ASR组件对导航设备发送的语音导航请求进行识别,因此,本实施例的语音网关识别语音导航请求的正确率高、识别范围广,且语音识别的方式对于车载导航设备来说,可以保证驾驶员在行驶状态下的人身安全;导航设备本身并不需要在硬件上集成语音识别系统,从而降低了导航设备的成本,而且,语音网关通过短信通道或者语音交互通道将导航设置信息发送给导航设备,还使得导航设备获取导航设置信息的途径灵活多样。
通过以上实施例的描述,本领域的技术人员可以清楚地了解到本发明可借助软件加必需的通用硬件平台的方式来实现,当然也可以通过硬件,但很多情况下前者是更佳的实施方式。基于这样的理解,本发明实施例的技术方案本质上或者说对现有技术做出贡献的部分可以以软件产品的形式体现出来,该软件产品存储在一个存储介质中,包括若干指令用以使得移动设备(可以是手机,个人计算机,媒体播放器等)执行本发明各个实施例所述的方法。这里所称的存储 介质,如:ROM/RAM、磁盘、光盘等。
最后应说明的是:以上实施例仅用以说明本发明的技术方案而非对其进行限制,尽管参照较佳实施例对本发明进行了详细的说明,本领域的普通技术人员应当理解:其依然可以对本发明的技术方案进行修改或者等同替换,而这些修改或者等同替换亦不能使修改后的技术方案脱离本发明技术方案的精神和范围。

Claims (10)

  1. 一种导航信息处理方法,其特征在于,包括:
    识别导航设备发送的语音导航请求,获取相应的导航请求文本信息,其中,所述语音导航请求至少包括用户发出的第一语音频段请求以及第二语音频段请求其中之一;
    根据逻辑控制应用文档,将与所述导航请求文本信息对应的导航设置信息发送给所述导航设备,以便所述导航设备根据所述导航设置信息进行导航设置处理。
  2. 根据权利要求1所述的导航信息处理方法,其特征在于,所述识别导航设备发送的语音导航请求,获取相应的导航请求文本信息之前,包括:
    根据所述逻辑控制应用文档,向所述导航设备发送语音引导信息,并接收所述导航设备根据所述语音引导信息发送的所述语音导航请求。
  3. 根据权利要求2所述的导航信息处理方法,其特征在于,所述向所述导航设备发送语音引导信息之前,包括:
    根据所述导航设备通过第三方服务中心转发的呼叫接入请求,与所述导航设备建立语音交互通道,并从文档服务器上获取与所述呼叫接入请求中携带的呼叫类型标识对应的逻辑控制应用文档;
    其中,所述第一语音频段请求位于正常听阈内,所述第二语音频段请求超出正常听阈。
  4. 根据权利要求3所述的导航信息处理方法,其特征在于,所述根据所述逻辑控制应用文档,向所述导航设备发送语音引导信息,并接收所述导航设备根据所述语音引导信息发送的所述语音导航请求,包括:
    根据所述逻辑控制应用文档,获取引导文本信息;
    对所述引导文本信息进行语音合成处理,生成所述语音引导信息;
    通过所述语音交互通道将所述语音引导信息发送给所述导航设备;
    接收所述导航设备通过所述语音交互通道发送的、与所述语音引导信息对应的所述语音导航请求。
  5. 一种导航信息获取方法,其特征在于,包括:
    根据语音网关发送的语音引导信息,向所述语音网关发送语音导航请求,以便于所述语音网关识别语音导航请求,并根据逻辑控制应用文档反馈获取所 述语音导航请求相应的导航请求文本信息,其中,所述语音导航请求至少包括用户发出的第一语音频段请求以及第二语音频段请求其中之一;
    接收所述语音网关发送的所述导航请求文本信息对应的导航设置信息。
  6. 根据权利要求5所述的导航信息获取方法,其特征在于,所述向所述语音网关发送语音导航请求之前,还包括:
    通过第三方服务中心向所述语音网关发送呼叫接入请求,并建立与所述语音网关的语音交互通道;
    通过所述语音交互通道,接收所述语音网关发送的语音引导信息,所述语音引导信息用于指示向所述语音网关发送语音导航请求。
  7. 权利要求5所述的导航信息获取方法,其特征在于,所述第一语音频段请求位于正常听阈内,所述第二语音频段请求超出正常听阈。
  8. 一种语音导航系统,其特征在于,包括:
    语音网关,用于识别导航设备发送的语音导航请求,获取相应的导航请求文本信息;并根据逻辑控制应用文档,将与所述导航请求文本信息对应的导航设置信息发送给所述导航设备,以便所述导航设备根据所述导航设置信息进行导航设置处理;其中,所述语音导航请求至少包括用户发出的第一语音频段请求以及第二语音频段请求其中之一;
    文档服务器,用于向所述语音网关发送所述逻辑控制应用文档。
  9. 根据权利要求8所述的语音导航系统,其特征在于,还包括:
    命令下发设备,用于将所述导航设置信息发送给所述第三方服务中心,以便所述第三方服务中心通过短信通道或者语音交互通道,将所述导航设置信息发送给所述导航设备。
  10. 根据权利要求8所述的语音导航系统,其特征在于,所述第一语音频段请求位于正常听阈内,所述第二语音频段请求超出正常听阈。
PCT/CN2015/077914 2015-04-30 2015-04-30 导航信息处理、获取方法及语音导航系统 WO2016172899A1 (zh)

Priority Applications (1)

Application Number Priority Date Filing Date Title
PCT/CN2015/077914 WO2016172899A1 (zh) 2015-04-30 2015-04-30 导航信息处理、获取方法及语音导航系统

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2015/077914 WO2016172899A1 (zh) 2015-04-30 2015-04-30 导航信息处理、获取方法及语音导航系统

Publications (1)

Publication Number Publication Date
WO2016172899A1 true WO2016172899A1 (zh) 2016-11-03

Family

ID=57198193

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2015/077914 WO2016172899A1 (zh) 2015-04-30 2015-04-30 导航信息处理、获取方法及语音导航系统

Country Status (1)

Country Link
WO (1) WO2016172899A1 (zh)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115015900A (zh) * 2022-05-30 2022-09-06 广州海事科技有限公司 一种船舶定位方法、系统、计算机设备及存储介质

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040068370A1 (en) * 2002-10-08 2004-04-08 Moody Peter A. Use of distributed speech recognition (DSR) for off-board application processing
CN101846525A (zh) * 2009-03-23 2010-09-29 华为软件技术有限公司 导航信息处理、获取方法及装置

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040068370A1 (en) * 2002-10-08 2004-04-08 Moody Peter A. Use of distributed speech recognition (DSR) for off-board application processing
CN101846525A (zh) * 2009-03-23 2010-09-29 华为软件技术有限公司 导航信息处理、获取方法及装置

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115015900A (zh) * 2022-05-30 2022-09-06 广州海事科技有限公司 一种船舶定位方法、系统、计算机设备及存储介质

Similar Documents

Publication Publication Date Title
US9564132B2 (en) Communication system and method between an on-vehicle voice recognition system and an off-vehicle voice recognition system
US10380992B2 (en) Natural language generation based on user speech style
US10679620B2 (en) Speech recognition arbitration logic
US9159322B2 (en) Services identification and initiation for a speech-based interface to a mobile device
US9679562B2 (en) Managing in vehicle speech interfaces to computer-based cloud services due recognized speech, based on context
US9183835B2 (en) Speech-based user interface for a mobile device
US9583100B2 (en) Centralized speech logger analysis
CN109065053B (zh) 用于处理信息的方法和装置
JP6202041B2 (ja) 車両用音声対話システム
CN101846525B (zh) 导航信息处理、获取方法及装置
CN103617795A (zh) 一种车载语音识别控制方法及系统
JP5413321B2 (ja) 通信システム、車載端末、および携帯端末
US20180075842A1 (en) Remote speech recognition at a vehicle
CN111094924A (zh) 用于执行基于语音的人机交互的数据处理装置和方法
JP2017211539A (ja) 音声処理システムおよび音声処理方法
US11386891B2 (en) Driving assistance apparatus, vehicle, driving assistance method, and non-transitory storage medium storing program
WO2016172899A1 (zh) 导航信息处理、获取方法及语音导航系统
JP2014062944A (ja) 情報処理装置
JP2017181667A (ja) 音声認識装置および音声認識方法
WO2015111256A1 (ja) 音声調整システム、サーバ及び車載装置
US20160307562A1 (en) Controlling speech recognition systems based on radio station availability
JP2020123759A (ja) 通信装置、通信システム、通信方法、及び通信制御プログラム
CN109916418A (zh) 调整导航路线的方法与装置
JP2017159850A (ja) 応答システムおよび応答プログラム

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 15890279

Country of ref document: EP

Kind code of ref document: A1

WA Withdrawal of international application
NENP Non-entry into the national phase

Ref country code: DE