WO2021232726A1 - 一种导航音频的播放方法、装置、设备和计算机存储介质 - Google Patents

一种导航音频的播放方法、装置、设备和计算机存储介质 Download PDF

Info

Publication number
WO2021232726A1
WO2021232726A1 PCT/CN2020/131319 CN2020131319W WO2021232726A1 WO 2021232726 A1 WO2021232726 A1 WO 2021232726A1 CN 2020131319 W CN2020131319 W CN 2020131319W WO 2021232726 A1 WO2021232726 A1 WO 2021232726A1
Authority
WO
WIPO (PCT)
Prior art keywords
audio
navigation
navigation audio
user
estimated arrival
Prior art date
Application number
PCT/CN2020/131319
Other languages
English (en)
French (fr)
Inventor
黄际洲
张昊
Original Assignee
百度在线网络技术(北京)有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 百度在线网络技术(北京)有限公司 filed Critical 百度在线网络技术(北京)有限公司
Priority to KR1020217027814A priority Critical patent/KR20210114537A/ko
Priority to SG11202107063XA priority patent/SG11202107063XA/en
Priority to US17/419,013 priority patent/US20220308826A1/en
Priority to EP20900748.3A priority patent/EP3940341A4/en
Priority to JP2021538075A priority patent/JP7383026B2/ja
Publication of WO2021232726A1 publication Critical patent/WO2021232726A1/zh

Links

Images

Classifications

    • GPHYSICS
    • G01MEASURING; TESTING
    • G01CMEASURING DISTANCES, LEVELS OR BEARINGS; SURVEYING; NAVIGATION; GYROSCOPIC INSTRUMENTS; PHOTOGRAMMETRY OR VIDEOGRAMMETRY
    • G01C21/00Navigation; Navigational instruments not provided for in groups G01C1/00 - G01C19/00
    • G01C21/26Navigation; Navigational instruments not provided for in groups G01C1/00 - G01C19/00 specially adapted for navigation in a road network
    • G01C21/34Route searching; Route guidance
    • G01C21/36Input/output arrangements for on-board computers
    • G01C21/3697Output of additional, non-guidance related information, e.g. low fuel level
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01CMEASURING DISTANCES, LEVELS OR BEARINGS; SURVEYING; NAVIGATION; GYROSCOPIC INSTRUMENTS; PHOTOGRAMMETRY OR VIDEOGRAMMETRY
    • G01C21/00Navigation; Navigational instruments not provided for in groups G01C1/00 - G01C19/00
    • G01C21/26Navigation; Navigational instruments not provided for in groups G01C1/00 - G01C19/00 specially adapted for navigation in a road network
    • G01C21/34Route searching; Route guidance
    • G01C21/36Input/output arrangements for on-board computers
    • G01C21/3626Details of the output of route guidance instructions
    • G01C21/3629Guidance using speech or audio output, e.g. text-to-speech

Definitions

  • This application relates to the field of computer application technology, in particular to the field of big data technology.
  • the present application provides a navigation audio playback method, device, equipment and computer storage medium to solve the above technical problems.
  • this application provides a navigation audio playback method, which includes:
  • the corresponding navigation audio is ordered and reported at the broadcasting position, and the non-navigation audio to be played is selected according to the gap time between the broadcasting position points.
  • this application provides a navigation audio playback device, which includes:
  • the navigation determination unit is used to determine the navigation audio and the broadcast location point to be broadcast in the navigation route
  • the broadcast processing unit is configured to order and broadcast the corresponding navigation audio at the broadcast location, and select the non-navigation audio to be played according to the gap time between the broadcast location points.
  • this application provides an electronic device, including:
  • At least one processor At least one processor
  • a memory communicatively connected with the at least one processor; wherein,
  • the memory stores instructions executable by the at least one processor, and the instructions are executed by the at least one processor, so that the at least one processor can execute the method described in any one of the foregoing.
  • the present application provides a non-transitory computer-readable storage medium storing computer instructions, wherein the computer instructions are used to make the computer execute any of the methods described above.
  • Figure 1 shows an exemplary system architecture to which embodiments of the present invention can be applied
  • Figure 2 is a flow chart of the main method provided by an embodiment of the application.
  • FIG. 3 is a flowchart of a method for determining non-navigation audio provided by an embodiment of the application
  • FIG. 4 is a structural diagram of a navigation audio playback device provided by an embodiment of the application.
  • FIG. 5 is an example diagram of broadcasting in a navigation route provided by an embodiment of the application.
  • Fig. 6 is a block diagram of an electronic device used to implement an embodiment of the present application.
  • Figure 1 shows an exemplary system architecture to which embodiments of the present invention can be applied.
  • the system architecture may include terminal devices 101 and 102, a network 103, and a server 104.
  • the network 103 is used to provide a medium for communication links between the terminal devices 101 and 102 and the server 104.
  • the network 103 may include various connection types, such as wired, wireless communication links, or fiber optic cables, and so on.
  • the user can use the terminal devices 101 and 102 to interact with the server 104 through the network 103.
  • Various applications may be installed on the terminal devices 101 and 102, such as map applications, voice interactive applications, web browser applications, communication applications, and so on.
  • the terminal devices 101 and 102 may be various electronic devices, including but not limited to smart phones, tablet computers, smart speakers, smart wearable devices, and so on.
  • the navigation audio playback device provided by the present invention can be set up and run in the server 104 mentioned above, and can also be set up and run in the terminal device 101 or 102. It can be implemented as multiple software or software modules (for example, to provide distributed services), or as a single software or software module, which is not specifically limited here.
  • the navigation audio playback device is set up and running on the server 104, then the navigation audio playback device uses the method provided by the embodiment of the present invention to determine the navigation audio and non-navigation audio to be broadcast in the navigation route, and provide the navigation audio and non-navigation audio to the terminal device 101 or 102 to play.
  • the server 104 may be a single server or a server group composed of multiple servers. It should be understood that the numbers of terminal devices, networks, and servers in FIG. 1 are merely illustrative. There can be any number of terminal devices, networks, and servers according to implementation needs.
  • Figure 2 is a flowchart of the main method provided by an embodiment of the application.
  • the playback of navigation audio and the playback of non-navigation audio are no longer controlled and played by different applications, but are unified by the navigation audio broadcasting device. And play, for example, it is controlled and played by a map application with navigation function.
  • the method may include the following steps:
  • the navigation audio and the broadcast location point to be broadcast in the navigation route are determined.
  • the route planning is performed based on the starting point location, the ending point location, and the travel mode input by the user, and the route planning result is returned to the user.
  • the user can select a route from which to navigate.
  • This application is executed when the user selects a route for navigation, and the route selected by the user is the navigation route. For a navigation route, there will be multiple broadcast position points for navigation audio playback.
  • the broadcast location point refers to the specific geographic location where the navigation audio is broadcast, for example, at a certain location before the intersection. Need to turn or perform navigation audio, broadcast the speed limit requirements of the road section at the entrance of a certain road section, and so on.
  • the navigation audio broadcast in a navigation route is often intensive, but not all users need these navigation audios.
  • a navigation route if the user is familiar with the navigation route, then Often only some of the key navigation audio is needed. But if the user is not familiar with the navigation route, then more navigation content is needed. Therefore, as a preferred embodiment, according to the user's familiarity with the navigation route and the importance of the navigation audio, the navigation audio with the importance level matching the above-mentioned familiarity can be selected from the navigation audio of the navigation route as the navigation audio to be broadcast. , So as to ensure the navigation broadcast that matches the user's navigation needs during the navigation process.
  • the user's familiarity with the navigation route can be determined based on the number of times the user has navigated the route in history. If the number of times the user has navigated the route in history exceeds the preset number threshold, it can be considered that the user is familiar with the navigation route.
  • the navigation broadcast content mainly focuses on turning points, and each intersection is reminded only once, and the electronic eye information is not prompted.
  • user B is not familiar with it.
  • the navigation broadcast content needs to be accompanied by detailed instructions to assist judgment and broadcast electronic eye information.
  • the corresponding navigation audio is ordered at the broadcasting position, and the non-navigation audio to be played is selected according to the gap time between the broadcasting position points.
  • the navigation device plays non-navigation audio during the idle time between the navigation audio.
  • determine all the non-navigation audio sequences to be played in the entire navigation route and then play them according to the determined sequence.
  • This is equivalent to a pre-determined audio sequence, but often during the user's journey, changes in road conditions, changes in the user's personal speed, staying, etc. will cause the time to reach the navigation audio broadcast position to change, that is, the above-mentioned gap time If there is a change, then the predetermined sequence is no longer suitable and needs to be adjusted again. Therefore, another preferred embodiment can be selected, that is, every time a piece of navigation audio or non-navigation audio is played, the next non-navigation audio to be played is determined in real time.
  • the location of the user when the current navigation audio or non-navigation audio is played is determined.
  • the i+1-th audio to be played can be determined, where the audio can be navigation audio or non-navigation audio.
  • the location of the user when the current i-th audio is played can be estimated based on the remaining duration of the i-th audio and the current speed of the user. i is a positive integer.
  • the position of the user when the current i-th audio is played is the current position of the user.
  • each audio to be played is determined one by one. For example, if the first item is navigation audio, use the first navigation audio as the current navigation audio to determine whether the next navigation audio or non-navigation audio is played. If the next non-navigation audio is played, which non-navigation audio will be played. After determining the next audio to be played, use the next audio as the current audio to continue to determine whether the next navigation audio or non-navigation audio will be played. If the next non-navigation audio is played, which non-navigation audio will be played. And so on. For each piece of current audio, the location of the user when the current audio is played is determined, which can be estimated based on the average speed of the user's travel mode and the playing time of each piece of audio.
  • the estimated arrival time to reach the broadcast location point of the next navigation audio is determined.
  • the estimated arrival time (ETA, Estimated Time) of reaching the next navigation audio broadcast location can be estimated based on the distance between the user’s location and the next navigation audio broadcast location, the user’s speed, and road conditions. of Arrival).
  • the specific implementation of this part can use any ETA estimation method in the prior art, which will not be described in detail here.
  • the next non-navigation audio to be played is selected according to the estimated arrival time.
  • the core is that the selected non-navigation audio needs to be played within the estimated arrival time, or the core content of the non-navigation audio needs to be played within the estimated arrival time.
  • the core content of the non-navigation audio refers to the part that can embody the theme of the non-navigation audio, and the user will generally understand the content of the audio after listening to the core content.
  • the core content of a news audio is the part that can reflect the theme of the news
  • the core content of a cross talk audio is the part that contains the main content of the cross talk
  • the core content of a song audio is the part that contains the main song of the song.
  • the core content of a joke is the part of the joke that contains the joke.
  • Non-navigation audio can be obtained and selected from an audio pool, where the audio pool can be an audio pool maintained by a service provider of a map application, or an audio pool provided by a service provider of a third-party application with which it has a cooperative relationship.
  • the audio pool contains various types of non-navigation audio, including but not limited to news, novels, music, songs, jokes, and so on.
  • the audio pool also maintains the audio duration and core content identifiers of each non-navigation audio.
  • the core content identification refers to the identification of the start time and the end time of the core content of the non-navigation audio, and the playback duration of the core content can be determined by the identification.
  • non-navigation audio can also be selected based on the user's playback needs. Several preferred options are provided below:
  • Method 1 If the estimated time of arrival is greater than the preset first time threshold, the next navigation audio is considered to be farther away, and the user's demand priority selection method is adopted. That is, select the non-navigation audio required by the user from the non-navigation audio whose audio duration or the playback duration of the core content is less than the estimated arrival time.
  • the next navigation audio is considered to be relatively close, and the time priority selection method is adopted. That is, from the non-navigation audio whose audio duration or the playback duration of the core content is less than and close to the estimated arrival time, select the non-navigation audio required by the user, where proximity refers to the difference between the estimated arrival time and the audio duration or the playback time of the core content The difference is less than the second duration threshold.
  • the estimated arrival time is less than the second time threshold, it is considered that the next navigation audio is about to be played, and no non-navigation audio is selected to be inserted.
  • the foregoing first duration threshold is greater than the second duration threshold.
  • the first duration threshold may be 4 minutes
  • the second duration threshold may be 10 seconds. If after playing a navigation audio or non-navigation audio, it is determined that the estimated time of arrival of the next navigation audio is 6 minutes, which is greater than 4 minutes, then the user needs priority selection method can be adopted. You can filter out the non-navigation audio that has been played for more than 6 minutes from the audio source (including various non-navigation audio), and then select the audio that best meets the user's needs from the remaining non-navigation audio.
  • the time priority selection method is adopted. Find out the non-navigation audio whose playback time or core content is between 2 minutes, 50 seconds and 3 minutes from the audio source, and then further determine the non-navigation audio that meets the user's needs from these non-navigation audios.
  • the non-navigation audio is no longer selected as the next audio, but the next navigation audio is waited for.
  • a non-navigation audio with the most appropriate duration is directly selected from the non-navigation audio required by the user. For example, after playing a navigation audio or non-navigation audio, it is determined that the estimated arrival time of the next navigation audio is 5 minutes, which is greater than 10 seconds. Then, determine the non-navigation audio required by all users from the audio source, and then select the non-navigation audio whose audio duration or core content playing duration is less than and closest to 5 minutes, such as a news of 4 minutes and 55 seconds.
  • the non-navigation audio is no longer selected as the next audio, but the next navigation audio is waited for.
  • the non-navigation audio required by the user it can be determined according to at least one of the destination, environmental conditions, route conditions, user driving conditions, and user preference information.
  • the destination is mainly the type information of the destination, such as company, home, supermarket, transportation hub, scenic spot, etc. For example, users prefer warm music when they go home, news audio when they go to the company, and cheerful music when they go to scenic spots, and so on.
  • the environmental conditions can include the current time, date, whether it is a holiday or a working day, weather, and so on. These environmental conditions may have an impact on the audio demanded by users. For example, if the weather is clear, users tend to prefer warm music, and users tend to prefer warm music if the weather is gloomy. For another example, users on holidays prefer the audio of songs, and users on weekdays prefer the audio of news. and many more.
  • the route status may include the congestion state, road grade, length, etc. of the current route. These conditions may also affect the audio demanded by the user. For example, when the user is in a congested state, the user prefers soothing music or news about road conditions. For another example, for a flat and long route, users prefer novel audio. and many more.
  • the user's driving status may include the user's driving time, driving mileage, congestion status of the road section, and so on. These conditions reflect user fatigue to a certain extent, and also affect the audio demand of users. For example, when the user has been driving for a long time or driving a long mileage, the user needs to cheer up, and even more needs music, rock music, etc. that can invigorate the spirit.
  • the user preference information may include the user's preference tag for the audio type, preference vector, etc. For example, the user prefers news type audio, or the user prefers jazz music, and so on.
  • the user preference information can be determined by a tag set by the user, or can be determined based on the user's behavior feedback on the audio file (for example, the behavior of switching audio files, the behavior of collecting audio files, the behavior of listening to complete, etc.).
  • At least one of the above factors can be combined to determine the non-navigation audio required by the user.
  • the switching prompt sound can be played between the non-navigation audio and the navigation audio, that is, when the non-navigation audio is switched to the navigation audio playback, the switching prompt sound can be added to give the user anticipation and avoid the user from missing the steering The occurrence of intersections, violations, etc.
  • the switching prompt sound can be, for example, a short beep sound, a human voice prompt, and so on.
  • the specific form of the prompt tone is not specifically limited here.
  • Fig. 4 is a structural diagram of a navigation audio playback device provided by an embodiment of the application.
  • the device can be implemented on the server side, for example, it can be a server-side application or a plug-in or a software development kit (Software Development Kit, located in the server-side application). SDK) and other functional units. Or if the terminal device has sufficient computing power, it can also be implemented on the terminal device side.
  • the device may include: a navigation determination unit 00 and a broadcast processing unit 10, wherein the main functions of each component unit are as follows:
  • the navigation determination unit 00 is responsible for determining the navigation audio and the broadcast location point to be broadcast in the navigation route.
  • the navigation determining unit 00 may select a navigation audio with an importance level matching the familiarity level from the navigation audio of the navigation route as the navigation audio to be broadcast according to the user's familiarity with the navigation route and the importance of the navigation audio.
  • the user's familiarity with the navigation route can be determined based on the number of times the user has navigated the route in history. If the number of times the user has navigated the route in history exceeds the preset number threshold, it can be considered that the user is familiar with the navigation route.
  • the broadcast processing unit 10 is responsible for ordering and reporting the corresponding navigation audio at the broadcast location, and the non-navigation audio to be played is selected according to the gap time between the broadcast location points.
  • the broadcast processing unit 10 may specifically include: a scene judgment subunit 11 and a content recommendation subunit 12.
  • the scene judgment subunit 11 is responsible for determining the location of the user when the current navigation audio or non-navigation audio is played; and according to the user's location, determining the estimated time of arrival to the broadcast location of the next navigation audio.
  • the estimated arrival time to the broadcast location of the next navigation audio can be estimated based on the distance between the location of the user and the broadcast location of the next navigation audio, the user's speed, and road conditions.
  • the scene judgment subunit 11 may provide the user's location and the broadcast position of the next navigation audio to the ETA service by calling the ETA service interface, and the ETA service will estimate the estimated arrival time and return it to the scene judgment subunit 11.
  • the content recommendation subunit 12 is responsible for selecting the next non-navigation audio to be played according to the estimated arrival time.
  • the content recommendation subunit 12 may use, but is not limited to, the following methods to select the next non-navigation audio to be played.
  • Method 1 If the estimated time of arrival is greater than the preset first time threshold, the next navigation audio is considered to be farther away, and the user's demand priority selection method is adopted. That is, select the non-navigation audio required by the user from the non-navigation audio whose audio duration or the playback duration of the core content is less than the estimated arrival time;
  • the next navigation audio is considered to be close, and the time priority selection method is adopted. That is, from the non-navigation audio whose audio duration or the playback duration of the core content is less than and close to the estimated arrival time, select the non-navigation audio required by the user, and the difference between the estimated arrival time and the audio duration or the playback duration of the core content is less than The second duration threshold;
  • the foregoing first duration threshold is greater than the second duration threshold.
  • the content recommendation sub-unit 12 can obtain non-navigation audio from an audio pool for selection, where the audio pool can be an audio pool maintained by a service provider of a map application, or an audio provided by a service provider of a third-party application with which it has a cooperative relationship. Pool.
  • the audio pool contains various types of non-navigation audio, including but not limited to news, novels, music, songs, jokes, and so on.
  • the audio pool also maintains the audio duration and core content identifiers of each non-navigation audio.
  • the core content identification refers to the identification of the start time and the end time of the core content of the non-navigation audio, and the playback duration of the core content can be determined by the identification.
  • the content recommendation subunit 12 may determine the non-navigation audio required by the user according to at least one of destination, environmental conditions, route conditions, user driving conditions, and user preference information.
  • the above-mentioned broadcast processing unit 10 can also play a switching prompt sound between non-navigation audio and navigation audio to give the user anticipation, thereby reminding the user to listen to the navigation audio to be played below, so as to prevent the user from missing the turn to the intersection and breaking the rules. Wait for the situation to happen.
  • navigation route 1 is the work route of user A from home to company. Because User A is familiar with the route, the navigation broadcast content is mainly based on turning points, and each intersection is only reminded once, and the electronic eye information is not prompted.
  • the audio types that users prefer are mainly news, music, and jokes.
  • news A is recommended first, and the playing time of news A is less than the estimated time of reaching the broadcast location 1 by the user.
  • the estimated time of arrival of the user from the broadcast location 1 is less than 4 minutes, and the music a that the user is interested in will be played to fill in the time.
  • the estimated time of arrival is less than 10 seconds. No longer insert other non-navigation audio, play the switching prompt for the user, and switch to the navigation audio of the broadcast position point 1 "Keep right uphill, enter the high speed, and go to G6".
  • the music b that the user is interested in will be played to fill in the time.
  • the estimated arrival time from the broadcast position point 2 is less than 10 seconds, and no other non-navigation audio is inserted.
  • the navigation audio at point 2 of the broadcast position is played "Keep ahead to the left and enter the North Fifth Ring Road".
  • the news G began to be played.
  • the distance between the user and the broadcast location point 3 is less than 4 minutes. Since the user has been driving for a long time and is exhausted, he starts to play a joke that fits the scene at the time c to help the user refresh.
  • the news H/I/J/K/L continued to be broadcast.
  • the navigation audio broadcast at the broadcast location point 4 is directly carried out "keep right ahead, exit the highway, and head towards the exit of Shangdi West Road”.
  • the user After the user gets off the highway, he is about to enter the extremely slow section of the road. In order to avoid the user's distraction and cause a car accident, the user is played calming music d/e/f. Then, after the switching prompt sound is played, point 5 at the broadcast position to broadcast "turn left" until the user reaches the destination.
  • the present application also provides an electronic device and a readable storage medium.
  • FIG. 6 it is a block diagram of an electronic device of a method for playing navigation audio according to an embodiment of the present application.
  • Electronic devices are intended to represent various forms of digital computers, such as laptop computers, desktop computers, workstations, personal digital assistants, servers, blade servers, mainframe computers, and other suitable computers.
  • Electronic devices can also represent various forms of mobile devices, such as personal digital processing, cellular phones, smart phones, wearable devices, and other similar computing devices.
  • the components shown herein, their connections and relationships, and their functions are merely examples, and are not intended to limit the implementation of the application described and/or required herein.
  • the electronic device includes one or more processors 601, a memory 602, and interfaces for connecting various components, including a high-speed interface and a low-speed interface.
  • the various components are connected to each other using different buses, and can be installed on a common motherboard or installed in other ways as needed.
  • the processor may process instructions executed in the electronic device, including instructions stored in or on the memory to display graphical information of the GUI on an external input/output device (such as a display device coupled to an interface).
  • an external input/output device such as a display device coupled to an interface.
  • multiple processors and/or multiple buses can be used with multiple memories and multiple memories.
  • multiple electronic devices can be connected, and each device provides part of the necessary operations (for example, as a server array, a group of blade servers, or a multi-processor system).
  • a processor 601 is taken as an example.
  • the memory 602 is a non-transitory computer-readable storage medium provided by this application.
  • the memory stores instructions executable by at least one processor, so that the at least one processor executes the navigation audio playback method provided in this application.
  • the non-transitory computer-readable storage medium of the present application stores computer instructions, and the computer instructions are used to make the computer execute the navigation audio playback method provided by the present application.
  • the memory 602 can be used to store non-transitory software programs, non-transitory computer-executable programs and modules, such as program instructions/modules corresponding to the navigation audio playback method in the embodiment of the present application.
  • the processor 601 executes various functional applications and data processing of the server by running non-transient software programs, instructions, and modules stored in the memory 602, that is, implements the navigation audio playback method in the foregoing method embodiment.
  • the memory 602 may include a program storage area and a data storage area.
  • the program storage area may store an operating system and an application program required by at least one function; the data storage area may store data created according to the use of the electronic device.
  • the memory 602 may include a high-speed random access memory, and may also include a non-transitory memory, such as at least one magnetic disk storage device, a flash memory device, or other non-transitory solid-state storage devices.
  • the memory 602 may optionally include memories remotely provided with respect to the processor 601, and these remote memories may be connected to the electronic device through a network. Examples of the aforementioned networks include, but are not limited to, the Internet, corporate intranets, local area networks, mobile communication networks, and combinations thereof.
  • the electronic device may further include: an input device 603 and an output device 604.
  • the processor 601, the memory 602, the input device 603, and the output device 604 may be connected by a bus or in other ways. In FIG. 6, the connection by a bus is taken as an example.
  • the input device 603 can receive input digital or character information, and generate key signal input related to the user settings and function control of the electronic device, such as touch screen, keypad, mouse, track pad, touch pad, indicator stick, one or more A mouse button, trackball, joystick and other input devices.
  • the output device 604 may include a display device, an auxiliary lighting device (for example, LED), a tactile feedback device (for example, a vibration motor), and the like.
  • the display device may include, but is not limited to, a liquid crystal display (LCD), a light emitting diode (LED) display, and a plasma display. In some embodiments, the display device may be a touch screen.
  • Various implementations of the systems and techniques described herein can be implemented in digital electronic circuit systems, integrated circuit systems, application specific ASICs (application specific integrated circuits), computer hardware, firmware, software, and/or combinations thereof. These various embodiments may include: being implemented in one or more computer programs, the one or more computer programs may be executed and/or interpreted on a programmable system including at least one programmable processor, the programmable processor It can be a dedicated or general-purpose programmable processor that can receive data and instructions from the storage system, at least one input device, and at least one output device, and transmit the data and instructions to the storage system, the at least one input device, and the at least one output device. An output device.
  • machine-readable medium and “computer-readable medium” refer to any computer program product, device, and/or device used to provide machine instructions and/or data to a programmable processor ( For example, magnetic disks, optical disks, memory, programmable logic devices (PLD)), including machine-readable media that receive machine instructions as machine-readable signals.
  • machine-readable signal refers to any signal used to provide machine instructions and/or data to a programmable processor.
  • the systems and techniques described here can be implemented on a computer that has: a display device for displaying information to the user (for example, a CRT (cathode ray tube) or LCD (liquid crystal display) monitor) ); and a keyboard and a pointing device (for example, a mouse or a trackball) through which the user can provide input to the computer.
  • a display device for displaying information to the user
  • LCD liquid crystal display
  • keyboard and a pointing device for example, a mouse or a trackball
  • Other types of devices can also be used to provide interaction with the user; for example, the feedback provided to the user can be any form of sensory feedback (for example, visual feedback, auditory feedback, or tactile feedback); and can be in any form (including Acoustic input, voice input, or tactile input) to receive input from the user.
  • the systems and technologies described herein can be implemented in a computing system that includes back-end components (for example, as a data server), or a computing system that includes middleware components (for example, an application server), or a computing system that includes front-end components (for example, A user computer with a graphical user interface or a web browser, through which the user can interact with the implementation of the system and technology described herein), or includes such back-end components, middleware components, Or any combination of front-end components in a computing system.
  • the components of the system can be connected to each other through any form or medium of digital data communication (for example, a communication network). Examples of communication networks include: local area network (LAN), wide area network (WAN), and the Internet.
  • the computer system can include clients and servers.
  • the client and server are generally far away from each other and usually interact through a communication network.
  • the relationship between the client and the server is generated by computer programs that run on the corresponding computers and have a client-server relationship with each other.

Landscapes

  • Engineering & Computer Science (AREA)
  • Radar, Positioning & Navigation (AREA)
  • Remote Sensing (AREA)
  • Automation & Control Theory (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • General Health & Medical Sciences (AREA)
  • Multimedia (AREA)
  • Navigation (AREA)

Abstract

一种导航音频的播放方法、装置、设备和计算机存储介质,涉及大数据技术领域。该方法包括:确定导航路线中需要播报的导航音频和播报位置点(201);在该播报位置点播报对应的导航音频,在各播报位置点之间的空档时间依据空档时长选择要播放的非导航音频(202)。该方法能够在保证导航音频播放的前提下,选择合适的非导航音频在导航音频之间的空档时间插入进行播放,也同时保证非导航音频的收听体验。

Description

一种导航音频的播放方法、装置、设备和计算机存储介质
本申请要求了申请日为2020年05月22日,申请号为2020104398935发明名称为“一种导航音频的播放方法、装置、设备和计算机存储介质”的中国专利申请的优先权。
技术领域
本申请涉及计算机应用技术领域,特别涉及大数据技术领域。
背景技术
为了减轻驾车出行过程中疲累和无聊,司机通常会选择播放一些音频来增加知识或打发时间。与此同时,用户在驾驶的过程中越来越依赖地图类应用提供的导航服务。因此,在两个音频文件同时播放的时候,由于两种音频文件是独立存在的,由不同的应用播放,因此不可避免地出现“碰撞”。当两个音频文件同时发声时,一般会依据优先级选择其中一种播放。若优选导航音频,则会造成非导航音频的播放断断续续,收听体验差。若优选非导航音频,则会造成用户错过导航播报内容,容易错过路口、绕路、违章等,甚至造成安全隐患。
发明内容
有鉴于此,本申请提供了一种导航音频的播放方法、装置、设备和计算机存储介质,用以解决上述技术问题。
第一方面,本申请提供了一种导航音频的播放方法,该方法包括:
确定导航路线中需要播报的导航音频和播报位置点;
在所述播报位置点播报对应的导航音频,在各所述播报位置点之间的空档时间依据空档时长选择播放的非导航音频。
第二方面,本申请提供了一种导航音频的播放装置,该装置包括:
导航确定单元,用于确定导航路线中需要播报的导航音频和播报位置点;
播报处理单元,用于在所述播报位置点播报对应的导航音频,在各所述播报位置点之间的空档时间依据空档时长选择播放的非导航音频。
第三方面,本申请提供了一种电子设备,包括:
至少一个处理器;以及
与所述至少一个处理器通信连接的存储器;其中,
所述存储器存储有可被所述至少一个处理器执行的指令,所述指令被所述至少一个处理器执行,以使所述至少一个处理器能够执行上述任一项所述的方法。
第四方面,本申请提供了一种存储有计算机指令的非瞬时计算机可读存储介质,其特征在于,所述计算机指令用于使所述计算机执行上述任一项所述的方法。
通过本申请的技术方案能够在保证导航音频播放的前提下,选择合适的非导航音频在导航音频之间的空档时间插入进行播放,也同时保证非导航音频的收听体验。
上述可选方式所具有的其他效果将在下文中结合具体实施例加以说明。
附图说明
附图用于更好地理解本方案,不构成对本申请的限定。其中:
图1示出了可以应用本发明实施例的示例性系统架构;
图2为本申请实施例提供的主要方法流程图;
图3为本申请实施例提供的确定非导航音频的方法流程图;
图4为本申请实施例提供的导航音频的播放装置结构图;
图5为本申请实施例提供的一个导航路线中播报的实例图;
图6是用来实现本申请实施例的电子设备的框图。
具体实施方式
以下结合附图对本申请的示范性实施例做出说明,其中包括本申请实施例的各种细节以助于理解,应当将它们认为仅仅是示范性的。因此,本领域普通技术人员应当认识到,可以对这里描述的实施例做出各种改变和修改,而不会背离本申请的范围和精神。同样,为了清楚和简明,以下的描述中省略了对公知功能和结构的描述。
图1示出了可以应用本发明实施例的示例性系统架构。如图1所示, 该系统架构可以包括终端设备101和102,网络103和服务器104。网络103用以在终端设备101、102和服务器104之间提供通信链路的介质。网络103可以包括各种连接类型,例如有线、无线通信链路或者光纤电缆等等。
用户可以使用终端设备101和102通过网络103与服务器104交互。终端设备101和102上可以安装有各种应用,例如地图类应用,语音交互应用、网页浏览器应用、通信类应用等。
终端设备101和102可以是各种电子设备,包括但不限于智能手机、平板电脑、智能音箱、智能可穿戴式设备等等。本发明所提供的导航音频的播放装置可以设置并运行于上述服务器104,也可以设置并运行于终端设备101或102中。其可以实现成多个软件或软件模块(例如用来提供分布式服务),也可以实现成单个软件或软件模块,在此不做具体限定。
例如,导航音频的播放装置设置并运行于上述服务器104,那么导航音频的播放装置使用本发明实施例提供的方式确定导航路线中需要播报的导航音频和非导航音频,并提供给终端设备101或102进行播放。
服务器104可以是单一服务器,也可以是多个服务器构成的服务器群组。应该理解,图1中的终端设备、网络和服务器的数目仅仅是示意性的。根据实现需要,可以具有任意数目的终端设备、网络和服务器。
图2为本申请实施例提供的主要方法流程图,在申请实施例中,导航音频的播放和非导航音频的播放不再由不同的应用控制和播放,而统一由导航音频的播报装置实现控制和播放,例如由具有导航功能的地图类应用进行控制和播放。如图2中所示,该方法可以包括以下步骤:
在201中,确定导航路线中需要播报的导航音频和播报位置点。
通常情况下,用户发起路线规划请求后,会基于用户输入的起始点位置、终点位置以及出行方式进行路线规划,并将路线规划结果返回给用户。用户可以从中选择一条路线进行导航。本申请就是从用户选择一条路线进行导航开始执行,用户选择的路线即为导航路线。针对一条导航路线会存在多个播报位置点进行导航音频的播放。
在本申请中,可以将导航路线中存在的所有导航音频都作为需要播报的导航音频,播报位置点指的是在具体哪个地理位置播报导航音频, 例如在路口之前的某个位置点播报在路口需要转弯或执行等的导航音频,在某个路段入口处播报该路段的限速要求等等。
通常为了保障用户的安全性,往往在一条导航路线中导航音频的播报是比较密集的,但并非所有用户都需要这些导航音频,对于一条导航路线而言,如果用户对该导航路线比较熟悉,那么往往就只需要其中一些关键的导航音频即可。但如果用户对该导航路线不熟悉,那么就需要更多的导航内容。因此,作为一种优选的实施方式,可以依据用户对导航路线的熟悉程度和导航音频的重要程度,从导航路线的导航音频中选择重要程度与上述熟悉程度匹配的导航音频作为需要播报的导航音频,从而保证导航过程中与用户导航需求匹配的导航播报。
其中用户对导航路线的熟悉程度可以依据用户历史导航该条路线的次数来确定。若用户历史导航该条路线的次数超过预设的次数阈值,则可以认为该用户对该导航路线很熟悉。
例如,若用户A的导航路线为熟悉的上班路线,导航的播报内容主要以转向点为主,每个路口只做一次提醒,不提示电子眼信息。再例如,对于同样的路线,用户B并不熟悉,导航的播报内容除了主要的转向点信息,还需要配上详细说明辅助判断,并播报电子眼信息。
除了上述的将用户对导航路线的熟悉程度简单划分为熟悉和不熟悉之外,也可以进行更细粒度地划分,即划分为多种不同等级的熟悉程度,然后根据不同等级对应有不同类型的导航音频。
在202中,在播报位置点播报对应的导航音频,在播报位置点之间的空档时间依据空档时长选择要播放的非导航音频。
在本申请实施例中,导航类装置除了播报导航音频之外,在导航音频之间的空档时间播放非导航音频。作为其中一种实现方式,可以在确定导航路线中需要播报的导航音频后,确定在整个导航路线中所有要播放的非导航音频序列,然后依据确定好的序列进行播放。这种相当于预先确定好音频序列,但往往在用户行进过程中,路况的变化、用户个人原因的速度变化、停留等都会引起达到导航音频播报位置点的时间发生变化,即上述的空档时长发生变化,那么预先确定好的序列就不再合适,需要重新进行调整。因此,可以选择另外一种优选的实施方式,即每播放一段导航音频或非导航音频,实时确定下一条要播放的非导航音频。
具体地,逐一确定每一条要播放的非导航音频时,可以执行如图3中所示的流程:
在301中,确定播放完当前导航音频或非导航音频时用户所在位置。
如果本流程应用于上述实时选择音频播放的方式,则可以在播放第i条音频的时候,确定第i+1条要播放的音频,其中音频可以是导航音频或非导航音频。这种情况下,播放完当前第i条音频时用户所在的位置可以依据该第i条音频的剩余时长和用户当前的速度来预估。i为正整数。
也可以在播放完第i条音频的时候,确定第i+1条音频。这种情况下,播放完当前第i条音频时用户所在的位置就是用户当前的位置。
如果本流程应用于上述预先确定音频序列的方式,则逐一确定要播放的各音频。例如若第1条为导航音频,则将该第1条导航音频作为当前导航音频,确定下一条播放导航音频还是非导航音频,若下一条播放非导航音频则播放哪一条非导航音频。确定出下一条播放的音频后,将该下一条播放的音频再作为当前音频,继续确定下一条播放导航音频还是非导航音频,若下一条播放非导航音频则播放哪一条非导航音频。以此类推。其中针对每一条当前音频确定播放完该当前音频时用户所在的位置,均可以依据用户出行方式的平均速度和各条音频的播放时长进行预估。
在302中,依据所述用户所在位置,确定到达下一条导航音频的播报位置点的预估到达时长。
在此可以依据用户所在位置与下一条导航音频的播报位置点之间的距离、用户的速度、路况等信息,预估到达下一条导航音频的播报位置点的预估到达时长(ETA,Estimated Time of Arrival)。该部分的具体实现可以采用现有技术中任意的ETA的预估方式,在此不做详述。
在303中,依据预估到达时长选择要播放的下一条非导航音频。
本步骤的具体选择策略,核心是选择的非导航音频需要在预估到达时长内播放完,或者非导航音频的核心内容需要在预估到达时长内播放完。其中非导航音频的核心内容是指能够体现非导航音频主旨的部分,用户收听完了该核心内容就大体明白了该音频的内容。举个例子,一条新闻音频的核心内容就是能够体现该新闻主旨的部分,一条相声音频的核心内容就是包含该相声主要包袱内容的部分,一条歌曲音频的核心内 容就是包含该歌曲主歌的部分,一条笑话的核心内容就是包含该笑话笑点的部分。
非导航音频可以从音频池中获取并进行选择,其中音频池可以是地图类应用的服务商维护的音频池,也可以是与其存在合作关系的第三方应用的服务商提供的音频池。
在该音频池中包含各种类型的非导航音频,可以包括但不限于新闻、小说、音乐、歌曲、笑话等等。除了非导航音频之外,音频池中还维护有各非导航音频的音频时长、核心内容标识等。其中核心内容标识指的是对非导航音频的核心内容的起始时间和结束时间的标识,通过该标识可以确定核心内容的播放时长。
另外,除了保证在预估到达时长内播放完或核心内容在预估到达时长内播放完,还可以进一步结合用户的播放需求来选择非导航音频。下面提供几种优选的选择方式:
方式一、若预估到达时长大于预设的第一时长阈值,则认为下一条导航音频距离较远,采用用户需求优先的选择方式。即从音频时长或核心内容的播放时长小于预估到达时长的非导航音频中,选择用户需求的非导航音频。
若预估到达时长大于或等于预设的第二时长阈值且小于或等于第一时长阈值,则认为下一条导航音频距离较近,采用时长优先的选择方式。即从音频时长或核心内容的播放时长小于且接近预估到达时长的非导航音频中,选择用户需求的非导航音频,其中接近指的是预估到达时长与音频时长或核心内容的播放时长的差值小于第二时长阈值。
若预估到达时长小于第二时长阈值,则认为即将播放下一条导航音频,不再选择插入任何非导航音频。
上述第一时长阈值大于第二时长阈值。
例如,可以取第一时长阈值为4分钟,第二时长阈值为10秒。若播放完一条导航音频或非导航音频后,确定出下一条导航音频的预估到达时长为6分钟,大于4分钟,则可以采用用户需求优先的选择方式。可以将音频源(包含各种非导航音频)中播放时长超过6分钟的非导航音频过滤掉,然后从剩余的非导航音频中选择最符合用户需求的音频。
若播放完一条导航音频或非导航音频后,确定出下一条导航音频的 预估到达时长为3分钟,在10秒和4分钟之间,则采用时长优先的选择方式。从音频源中找出播放时长或核心内容的播放时长在2分钟50秒和3分钟之间的非导航音频,然后再从这些非导航音频中进一步确定符合用户需求的非导航音频。
若播放完一条导航音频或非导航音频后,确定出下一条导航音频的预估到达时长小于10秒,则不再选择非导航音频作为下一条音频,而是等待播放下一条导航音频。
方式二、若预估到达时长大于预设的第二时长阈值,则从用户需求的非导航音频中,选择音频时长或核心内容的播放时长小于且最接近预估到达时长的非导航音频。若预估到达时长小于第二时长阈值,则不选择任何非导航音频。
这种方式下,不再区分用户需求优先还是时长优先,而是预估时长只要大于第二时长阈值,则直接从用户需求的非导航音频中选择一个时长最合适的非导航音频。举个例子,若播放完一条导航音频或非导航音频后,确定出下一条导航音频的预估到达时长为5分钟,大于10秒。则从音频源中确定所有用户需求的非导航音频,然后从中选择音频时长或核心内容的播放时长小于且最接近5分钟的非导航音频,例如一条4分55秒的新闻。
同样,若播放完一条导航音频或非导航音频后,确定出下一条导航音频的预估到达时长小于10秒,则不再选择非导航音频作为下一条音频,而是等待播放下一条导航音频。
当然,除了上述两种方式之外,还可以采用其他方式。在此仅列举两种优选方式,其他方式不做一一列举。
在上述两种方式中,在确定用户需求的非导航音频时,可以依据目的地、环境状况、路线状况、用户驾驶状况和用户偏好信息中的至少一种确定。
其中目的地主要是目的地的类型信息,例如公司、家、商超、交通枢纽、景区等。例如,用户回家就比较偏向温暖的音乐,用户去公司就比较偏向新闻类的音频,去景区就比较偏向欢快的音乐,等等。
其中环境状况可以包括当前时间、日期、是节假日还是工作日、天气等等。这些环境状况都可能会对用户需求的音频带来影响,例如天气 晴朗,用户就比较偏向热情的音乐,天气阴沉用户就比较偏向温情的音乐。再例如节假日用户就比较偏向歌曲类的音频,工作日用户就比较偏向新闻类的音频。等等。
其中路线状况可以包括当前路线的拥堵状态、道路等级、长度等等。这些状况也可能会对用户需求的音频带来影响,例如拥堵状态时,用户比较偏向舒缓的音乐或者关于路况的新闻等。再例如,对于平坦且长度很长的路线,用户比较偏向小说类的音频。等等。
其中用户驾驶状况可以包括用户的驾驶时长、驾驶里程、已经过路段的拥堵状况等等。这些状况在一定程度上反映了用户的疲乏程度,也会对用户需求的音频带来影响。例如,用户已经驾驶很长时间或驾驶里程很长等情况下,用户需要振奋一下精神,就更需要能够振奋精神地欢快的音乐、摇滚乐等等。
用户偏好信息可以包括用户对音频类型的偏好标签、偏好向量等。诸如,用户偏好新闻类型的音频、或者用户偏好爵士乐,等等。用户偏好信息可以由用户设置的标签来确定,也可以根据用户对音频文件的行为反馈(例如切换音频文件的行为、收藏音频文件的行为、收听完整的行为等)来确定。
可以综合上述因素中的至少一种来确定用户需求的非导航音频。
在上述音频播放的过程中,对于从非导航音频到导航音频的切换,用户可能因为措手不及而没有听清。因此,在本申请中可以在非导航音频和导航音频之间播放切换提示音,即在非导航音频切换至导航音频的播放时,可以增加切换提示音,给用户以预期,避免用户出现错过转向路口、违章等情况的发生。
其中切换提示音可以是诸如短暂的嘀嘀声、人声提示等等。提示音的具体形式在此不做具体限制。
以上是对本申请提供的方法进行的详细描述,下面结合实施例对本申请提供的装置进行详细描述。
图4为本申请实施例提供的导航音频的播放装置结构图,该装置可以设置于服务器端实现,例如可以是服务器端的应用或者位于服务器端应用中的插件或软件开发工具包(Software Development Kit,SDK)等功能单元。或者如果终端设备有足够的计算能力,也可以设置于终端设 备侧实现。如图4中所示,该装置可以包括:导航确定单元00和播报处理单元10,其中各组成单元的主要功能如下:
导航确定单元00负责确定导航路线中需要播报的导航音频和播报位置点。
具体地,导航确定单元00可以依据用户对导航路线的熟悉程度和导航音频的重要程度,从导航路线的导航音频中选择重要程度与熟悉程度匹配的导航音频作为需要播报的导航音频。
其中用户对导航路线的熟悉程度可以依据用户历史导航该条路线的次数来确定。若用户历史导航该条路线的次数超过预设的次数阈值,则可以认为该用户对该导航路线很熟悉。
播报处理单元10负责在播报位置点播报对应的导航音频,在各播报位置点之间的空档时间依据空档时长选择播放的非导航音频。
其中,播报处理单元10可以具体包括:场景判断子单元11和内容推荐子单元12。
场景判断子单元11负责确定播放完当前导航音频或非导航音频时用户所在位置;依据用户所在位置,确定到达下一条导航音频的播报位置点的预估到达时长。
在此可以依据用户所在位置与下一条导航音频的播报位置点之间的距离、用户的速度、路况等信息,预估到达下一条导航音频的播报位置点的预估到达时长。场景判断子单元11可以将用户所在位置以及下一条导航音频的播报位置点通过调用ETA服务接口提供给ETA服务后,由ETA服务进行预估到达时长的预估并返回给场景判断子单元11。
内容推荐子单元12负责依据预估到达时长选择要播放的下一条非导航音频。
具体地,内容推荐子单元12可以采用但不限于以下方式选择要播放的下一条非导航音频。
方式一、若预估到达时长大于预设的第一时长阈值,则认为下一条导航音频距离较远,采用用户需求优先的选择方式。即从音频时长或核心内容的播放时长小于预估到达时长的非导航音频中,选择用户需求的非导航音频;
若预估到达时长大于或等于预设的第二时长阈值且小于或等于第一 时长阈值,则认为下一条导航音频距离较近,采用时长优先的选择方式。即从音频时长或核心内容的播放时长小于且接近预估到达时长的非导航音频中,选择用户需求的非导航音频,接近为预估到达时长与音频时长或核心内容的播放时长的差值小于第二时长阈值;
若预估到达时长小于第二时长阈值,则认为即将播放下一条导航音频,不选择任何非导航音频。上述第一时长阈值大于第二时长阈值。
方式二、若预估到达时长大于预设的第二时长阈值,则从用户需求的非导航音频中,选择音频时长或核心内容的播放时长小于且最接近预估到达时长的非导航音频。
若预估到达时长小于第二时长阈值,则不选择任何非导航音频。
内容推荐子单元12可以从音频池中获取非导航音频进行选择,其中音频池可以是地图类应用的服务商维护的音频池,也可以是与其存在合作关系的第三方应用的服务商提供的音频池。
在该音频池中包含各种类型的非导航音频,可以包括但不限于新闻、小说、音乐、歌曲、笑话等等。除了非导航音频之外,音频池中还维护有各非导航音频的音频时长、核心内容标识等。其中核心内容标识指的是对非导航音频的核心内容的起始时间和结束时间的标识,通过该标识可以确定核心内容的播放时长。
其中,内容推荐子单元12在确定用户需求的非导航音频时,可以依据目的地、环境状况、路线状况、用户驾驶状况和用户偏好信息中的至少一种确定用户需求的非导航音频。
更进一步地,上述播报处理单元10还可以在非导航音频和导航音频之间播放切换提示音,给用户以预期,从而提醒用户注意收听下面要播放的导航音频,避免用户出现错过转向路口、违章等情况的发生。
下面列举一个具体的实例:
如图5中所示,用户甲行驶于导航路线1,该导航路线1为用户甲从家到公司的上班路线。因为用户甲对该路线很熟悉,因此导航的播报内容主要以转向点为主,每个路口只做一次提醒,不提示电子眼信息。用户偏好的音频类型主要是新闻、音乐和笑话。
当用户出发后,行驶在较长的直行路段上时,优先推荐的是新闻A,且新闻A的播放时长小于用户预估到达播报位置点1的时长。
新闻A播报完之后,用户离播报位置点1的预估到达时长小于4分钟,会播放用户感兴趣的音乐a来做时间填补。
音乐a播放完之后,由于用户离播报位置点1的距离相当近了,预估到达时长小于10秒。就不再插入其他非导航音频,为用户播放切换提示音,并切换到播报位置点1的导航音频“靠右上坡,进入高速,往G6方向”。
用户转上G6后,开始进入大段的直行路线,此时为用户顺序推荐播放了新闻B/C/D/E/F。
当新闻F播放完毕时,用户离播报位置点2的距离小于4分钟,会播放用户感兴趣的音乐b来做时间填补。
音乐b播放完之后,距离播报位置点2的预估到达时长小于10秒,不再插入其他非导航音频。播放切换提示音后,播放播报位置点2的导航音频“靠左前方,进入北五环路”。
用户转向五环的盘桥上后,开始播放新闻G。当新闻G播放完毕时,用户离播报位置点3的距离小于4分钟。由于用户已经经过了较长时间的驾驶,精神较为疲惫,则开始播放符合当时场景的笑话c帮用户提神。
播放切换提示音之后,播放播报位置点3的导航音频“靠右前方,进入高速,往G7方向”。
用户转向G7后又是持续播放的新闻H/I/J/K/L。新闻L播放完毕后,距离播报位置点4的时间很近,则直接进行播报位置点4的导航音频播报“靠右前方,出高速,往上地西路出口方向”。
用户下高速后,由于即将进入行驶极度缓慢路段,为了避免用户分神导致车祸,则为用户播放平复心情的音乐d/e/f。然后在播放切换提示音后,在播报位置点5播报“即将左转”,直到用户到达目的地。
根据本申请的实施例,本申请还提供了一种电子设备和一种可读存储介质。
如图6所示,是根据本申请实施例的导航音频的播放方法的电子设备的框图。电子设备旨在表示各种形式的数字计算机,诸如,膝上型计算机、台式计算机、工作台、个人数字助理、服务器、刀片式服务器、大型计算机、和其它适合的计算机。电子设备还可以表示各种形式的移动装置,诸如,个人数字处理、蜂窝电话、智能电话、可穿戴设备和其 它类似的计算装置。本文所示的部件、它们的连接和关系、以及它们的功能仅仅作为示例,并且不意在限制本文中描述的和/或者要求的本申请的实现。
如图6所示,该电子设备包括:一个或多个处理器601、存储器602,以及用于连接各部件的接口,包括高速接口和低速接口。各个部件利用不同的总线互相连接,并且可以被安装在公共主板上或者根据需要以其它方式安装。处理器可以对在电子设备内执行的指令进行处理,包括存储在存储器中或者存储器上以在外部输入/输出装置(诸如,耦合至接口的显示设备)上显示GUI的图形信息的指令。在其它实施方式中,若需要,可以将多个处理器和/或多条总线与多个存储器和多个存储器一起使用。同样,可以连接多个电子设备,各个设备提供部分必要的操作(例如,作为服务器阵列、一组刀片式服务器、或者多处理器系统)。图6中以一个处理器601为例。
存储器602即为本申请所提供的非瞬时计算机可读存储介质。其中,所述存储器存储有可由至少一个处理器执行的指令,以使所述至少一个处理器执行本申请所提供的导航音频的播放方法。本申请的非瞬时计算机可读存储介质存储计算机指令,该计算机指令用于使计算机执行本申请所提供的导航音频的播放方法。
存储器602作为一种非瞬时计算机可读存储介质,可用于存储非瞬时软件程序、非瞬时计算机可执行程序以及模块,如本申请实施例中的导航音频的播放方法对应的程序指令/模块。处理器601通过运行存储在存储器602中的非瞬时软件程序、指令以及模块,从而执行服务器的各种功能应用以及数据处理,即实现上述方法实施例中的导航音频的播放方法。
存储器602可以包括存储程序区和存储数据区,其中,存储程序区可存储操作系统、至少一个功能所需要的应用程序;存储数据区可存储根据该电子设备的使用所创建的数据等。此外,存储器602可以包括高速随机存取存储器,还可以包括非瞬时存储器,例如至少一个磁盘存储器件、闪存器件、或其他非瞬时固态存储器件。在一些实施例中,存储器602可选包括相对于处理器601远程设置的存储器,这些远程存储器可以通过网络连接至该电子设备。上述网络的实例包括但不限于互联网、 企业内部网、局域网、移动通信网及其组合。
该电子设备还可以包括:输入装置603和输出装置604。处理器601、存储器602、输入装置603和输出装置604可以通过总线或者其他方式连接,图6中以通过总线连接为例。
输入装置603可接收输入的数字或字符信息,以及产生与该电子设备的用户设置以及功能控制有关的键信号输入,例如触摸屏、小键盘、鼠标、轨迹板、触摸板、指示杆、一个或者多个鼠标按钮、轨迹球、操纵杆等输入装置。输出装置604可以包括显示设备、辅助照明装置(例如,LED)和触觉反馈装置(例如,振动电机)等。该显示设备可以包括但不限于,液晶显示器(LCD)、发光二极管(LED)显示器和等离子体显示器。在一些实施方式中,显示设备可以是触摸屏。
此处描述的系统和技术的各种实施方式可以在数字电子电路系统、集成电路系统、专用ASIC(专用集成电路)、计算机硬件、固件、软件、和/或它们的组合中实现。这些各种实施方式可以包括:实施在一个或者多个计算机程序中,该一个或者多个计算机程序可在包括至少一个可编程处理器的可编程系统上执行和/或解释,该可编程处理器可以是专用或者通用可编程处理器,可以从存储系统、至少一个输入装置、和至少一个输出装置接收数据和指令,并且将数据和指令传输至该存储系统、该至少一个输入装置、和该至少一个输出装置。
这些计算程序(也称作程序、软件、软件应用、或者代码)包括可编程处理器的机器指令,并且可以利用高级过程和/或面向对象的编程语言、和/或汇编/机器语言来实施这些计算程序。如本文使用的,术语“机器可读介质”和“计算机可读介质”指的是用于将机器指令和/或数据提供给可编程处理器的任何计算机程序产品、设备、和/或装置(例如,磁盘、光盘、存储器、可编程逻辑装置(PLD)),包括,接收作为机器可读信号的机器指令的机器可读介质。术语“机器可读信号”指的是用于将机器指令和/或数据提供给可编程处理器的任何信号。
为了提供与用户的交互,可以在计算机上实施此处描述的系统和技术,该计算机具有:用于向用户显示信息的显示装置(例如,CRT(阴极射线管)或者LCD(液晶显示器)监视器);以及键盘和指向装置(例如,鼠标或者轨迹球),用户可以通过该键盘和该指向装置来将输入提 供给计算机。其它种类的装置还可以用于提供与用户的交互;例如,提供给用户的反馈可以是任何形式的传感反馈(例如,视觉反馈、听觉反馈、或者触觉反馈);并且可以用任何形式(包括声输入、语音输入或者、触觉输入)来接收来自用户的输入。
可以将此处描述的系统和技术实施在包括后台部件的计算系统(例如,作为数据服务器)、或者包括中间件部件的计算系统(例如,应用服务器)、或者包括前端部件的计算系统(例如,具有图形用户界面或者网络浏览器的用户计算机,用户可以通过该图形用户界面或者该网络浏览器来与此处描述的系统和技术的实施方式交互)、或者包括这种后台部件、中间件部件、或者前端部件的任何组合的计算系统中。可以通过任何形式或者介质的数字数据通信(例如,通信网络)来将系统的部件相互连接。通信网络的示例包括:局域网(LAN)、广域网(WAN)和互联网。
计算机系统可以包括客户端和服务器。客户端和服务器一般远离彼此并且通常通过通信网络进行交互。通过在相应的计算机上运行并且彼此具有客户端-服务器关系的计算机程序来产生客户端和服务器的关系。
应该理解,可以使用上面所示的各种形式的流程,重新排序、增加或删除步骤。例如,本发申请中记载的各步骤可以并行地执行也可以顺序地执行也可以不同的次序执行,只要能够实现本申请公开的技术方案所期望的结果,本文在此不进行限制。
上述具体实施方式,并不构成对本申请保护范围的限制。本领域技术人员应该明白的是,根据设计要求和其他因素,可以进行各种修改、组合、子组合和替代。任何在本申请的精神和原则之内所作的修改、等同替换和改进等,均应包含在本申请保护范围之内。

Claims (18)

  1. 一种导航音频的播放方法,包括:
    确定导航路线中需要播报的导航音频和播报位置点;
    在所述播报位置点播报对应的导航音频,在各所述播报位置点之间的空档时间依据空档时长选择播放的非导航音频。
  2. 根据权利要求1所述的方法,其中,所述确定导航路线中需要播报的导航音频包括:
    依据用户对所述导航路线的熟悉程度和导航音频的重要程度,从导航路线的导航音频中选择重要程度与所述熟悉程度匹配的导航音频作为需要播报的导航音频。
  3. 根据权利要求1所述的方法,其中,所述在各所述播报位置点之间的空档时间依据空档时长选择播放的非导航音频包括:
    确定播放完当前导航音频或非导航音频时用户所在位置;
    依据所述用户所在位置,确定到达下一条导航音频的播报位置点的预估到达时长;
    依据所述预估到达时长选择要播放的下一条非导航音频。
  4. 根据权利要求3所述的方法,其中,依据所述预估到达时长选择要播放的下一条非导航音频包括:
    若所述预估到达时长大于预设的第一时长阈值,则从音频时长或核心内容的播放时长小于所述预估到达时长的非导航音频中,选择所述用户需求的非导航音频;
    若所述预估到达时长大于或等于预设的第二时长阈值且小于或等于所述第一时长阈值,则从音频时长或核心内容的播放时长小于且接近所述预估到达时长的非导航音频中,选择所述用户需求的非导航音频,所述接近为所述预估到达时长与所述音频时长或核心内容的播放时长的差值小于所述第二时长阈值;
    所述第一时长阈值大于所述第二时长阈值。
  5. 根据权利要求3所述的方法,其中,依据所述预估到达时长选择要播放的下一条非导航音频包括:
    若所述预估到达时长大于预设的第二时长阈值,则从用户需求的非 导航音频中,选择音频时长或核心内容的播放时长小于且最接近所述预估到达时长的非导航音频。
  6. 根据权利要求4或5所述的方法,其中,依据所述预估到达时长选择要播放的下一条非导航音频还包括:
    若所述预估到达时长小于所述第二时长阈值,则不选择任何非导航音频。
  7. 根据权利要求4或5所述的方法,其中,所述用户需求的非导航音频依据目的地、环境状况、路线状况、用户驾驶状况和用户偏好信息中的至少一种确定。
  8. 根据权利要求1所述的方法,还包括:
    在非导航音频和导航音频之间播放切换提示音。
  9. 一种导航音频的播放装置,包括:
    导航确定单元,用于确定导航路线中需要播报的导航音频和播报位置点;
    播报处理单元,用于在所述播报位置点播报对应的导航音频,在各所述播报位置点之间的空档时间依据空档时长选择播放的非导航音频。
  10. 根据权利要求9所述的装置,其中,所述导航确定单元,具体用于依据用户对所述导航路线的熟悉程度和导航音频的重要程度,从导航路线的导航音频中选择重要程度与所述熟悉程度匹配的导航音频作为需要播报的导航音频。
  11. 根据权利要求9所述的装置,其中,所述播报处理单元具体包括:
    场景判断子单元,用于确定播放完当前导航音频或非导航音频时用户所在位置;依据所述用户所在位置,确定到达下一条导航音频的播报位置点的预估到达时长;
    内容推荐子单元,用于依据所述预估到达时长选择要播放的下一条非导航音频。
  12. 根据权利要求11所述的装置,其中,所述内容推荐子单元,具体用于:
    若所述预估到达时长大于预设的第一时长阈值,则从音频时长或核心内容的播放时长小于所述预估到达时长的非导航音频中,选择所述用 户需求的非导航音频;
    若所述预估到达时长大于或等于预设的第二时长阈值且小于或等于所述第一时长阈值,则从音频时长或核心内容的播放时长小于且接近所述预估到达时长的非导航音频中,选择所述用户需求的非导航音频,所述接近为所述预估到达时长与所述音频时长或核心内容的播放时长的差值小于所述第二时长阈值;
    所述第一时长阈值大于所述第二时长阈值。
  13. 根据权利要求11所述的装置,其中,所述内容推荐子单元,具体用于:
    若所述预估到达时长大于预设的第二时长阈值,则从用户需求的非导航音频中,选择音频时长或核心内容的播放时长小于且最接近所述预估到达时长的非导航音频。
  14. 根据权利要求12或13所述的装置,其中,所述内容推荐子单元,还用于:
    若所述预估到达时长小于所述第二时长阈值,则不选择任何非导航音频。
  15. 根据权利要求12或13所述的装置,其中,所述内容推荐子单元,还用于依据目的地、环境状况、路线状况、用户驾驶状况和用户偏好信息中的至少一种确定所述用户需求的非导航音频。
  16. 根据权利要求9所述的装置,其中,所述播报处理单元,还用于在非导航音频和导航音频之间播放切换提示音。
  17. 一种电子设备,其特征在于,包括:
    至少一个处理器;以及
    与所述至少一个处理器通信连接的存储器;其中,
    所述存储器存储有可被所述至少一个处理器执行的指令,所述指令被所述至少一个处理器执行,以使所述至少一个处理器能够执行权利要求1-8中任一项所述的方法。
  18. 一种存储有计算机指令的非瞬时计算机可读存储介质,其特征在于,所述计算机指令用于使所述计算机执行权利要求1-8中任一项所述的方法。
PCT/CN2020/131319 2020-05-22 2020-11-25 一种导航音频的播放方法、装置、设备和计算机存储介质 WO2021232726A1 (zh)

Priority Applications (5)

Application Number Priority Date Filing Date Title
KR1020217027814A KR20210114537A (ko) 2020-05-22 2020-11-25 네비게이션 오디오 방송 방법, 장치, 기기 및 컴퓨터 저장 매체
SG11202107063XA SG11202107063XA (en) 2020-05-22 2020-11-25 Method, apparatus, device for playing navigation audios, and computer storage medium
US17/419,013 US20220308826A1 (en) 2020-05-22 2020-11-25 Method, apparatus, device for playing navigation audios
EP20900748.3A EP3940341A4 (en) 2020-05-22 2020-11-25 METHOD, APPARATUS AND DEVICE FOR AUDIO PLAYBACK OF NAVIGATION, AND COMPUTER RECORDING MEDIUM
JP2021538075A JP7383026B2 (ja) 2020-05-22 2020-11-25 ナビゲーションオーディオの再生方法、装置、機器及びコンピュータ記憶媒体

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202010439893.5A CN111735472A (zh) 2020-05-22 2020-05-22 一种导航音频的播放方法、装置、设备和计算机存储介质
CN202010439893.5 2020-05-22

Publications (1)

Publication Number Publication Date
WO2021232726A1 true WO2021232726A1 (zh) 2021-11-25

Family

ID=72647558

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2020/131319 WO2021232726A1 (zh) 2020-05-22 2020-11-25 一种导航音频的播放方法、装置、设备和计算机存储介质

Country Status (2)

Country Link
CN (1) CN111735472A (zh)
WO (1) WO2021232726A1 (zh)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114882721A (zh) * 2022-05-27 2022-08-09 中国第一汽车股份有限公司 一种车载导航信息播放方法、装置、电子设备及存储介质
CN114973740A (zh) * 2022-06-06 2022-08-30 北京百度网讯科技有限公司 语音播报时机的确定方法、装置及电子设备

Families Citing this family (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111735472A (zh) * 2020-05-22 2020-10-02 百度在线网络技术(北京)有限公司 一种导航音频的播放方法、装置、设备和计算机存储介质
CN112767909A (zh) * 2021-01-27 2021-05-07 腾讯科技(深圳)有限公司 音频混音方法、装置、介质以及电子设备
CN114816608A (zh) * 2021-01-29 2022-07-29 腾讯科技(深圳)有限公司 媒体文件的播放方法、装置、电子设备及存储介质
CN112857392A (zh) * 2021-02-25 2021-05-28 北京百度网讯科技有限公司 导航语音播报方法、装置、设备以及存储介质
CN115086705A (zh) * 2021-03-12 2022-09-20 北京字跳网络技术有限公司 一种资源预加载方法、装置、设备和存储介质
CN113934397A (zh) * 2021-10-15 2022-01-14 深圳市一诺成电子有限公司 电子设备中播音控制方法及电子设备
CN115842945A (zh) * 2022-12-02 2023-03-24 中国第一汽车股份有限公司 一种基于导航数据的车载媒体内容播放方法、装置

Citations (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN2800428Y (zh) * 2005-01-26 2006-07-26 菱科电子技术(中国)有限公司 车载gps导航、娱乐系统
CN101419077A (zh) * 2008-11-19 2009-04-29 凯立德欣技术(深圳)有限公司 语音播报的方法及使用此方法的语音播报装置、导航系统
CN101469995A (zh) * 2007-12-27 2009-07-01 英业达股份有限公司 导航及多媒体切换方法及应用其的电子装置
CN102768044A (zh) * 2012-07-31 2012-11-07 深圳市赛格导航科技股份有限公司 一种可记录用户行车习惯的导航仪及其记录和重放方法
CN106653064A (zh) * 2016-12-13 2017-05-10 北京云知声信息技术有限公司 音频播放方法及装置
CN107170472A (zh) * 2016-03-08 2017-09-15 阿里巴巴集团控股有限公司 一种车载音频数据播放方法和设备
CN107819949A (zh) * 2017-11-01 2018-03-20 深圳天珑无线科技有限公司 信息播放方法、终端及计算机可读存储介质
CN110717094A (zh) * 2019-09-03 2020-01-21 平安科技(深圳)有限公司 信息推荐方法、装置、计算机设备和存储介质
CN111081283A (zh) * 2019-12-25 2020-04-28 惠州Tcl移动通信有限公司 一种音乐播放方法、装置、存储介质及终端设备
US10641613B1 (en) * 2014-03-14 2020-05-05 Google Llc Navigation using sensor fusion
CN111735472A (zh) * 2020-05-22 2020-10-02 百度在线网络技术(北京)有限公司 一种导航音频的播放方法、装置、设备和计算机存储介质

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20160045353A (ko) * 2014-10-17 2016-04-27 현대자동차주식회사 에이브이엔 장치, 차량, 및 에이브이엔 장치의 제어방법
CN110316076A (zh) * 2018-03-29 2019-10-11 蔚来汽车有限公司 用于播报导航信息的方法、装置和计算机存储介质
CN109151185A (zh) * 2018-08-01 2019-01-04 张家港市鸿嘉数字科技有限公司 一种根据车辆行驶场景匹配音乐类型的方法及装置
CN110017847B (zh) * 2019-03-21 2021-03-16 腾讯大地通途(北京)科技有限公司 一种自适应导航语音播报方法、装置及系统
CN110174116B (zh) * 2019-04-15 2020-03-31 北京百度网讯科技有限公司 生成导航播报内容的方法、装置、设备和计算机存储介质
CN110068353A (zh) * 2019-04-29 2019-07-30 上海擎感智能科技有限公司 车载导航设备及其导航方法
CN110264760B (zh) * 2019-06-21 2021-12-07 腾讯科技(深圳)有限公司 一种导航语音播放方法、装置及电子设备

Patent Citations (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN2800428Y (zh) * 2005-01-26 2006-07-26 菱科电子技术(中国)有限公司 车载gps导航、娱乐系统
CN101469995A (zh) * 2007-12-27 2009-07-01 英业达股份有限公司 导航及多媒体切换方法及应用其的电子装置
CN101419077A (zh) * 2008-11-19 2009-04-29 凯立德欣技术(深圳)有限公司 语音播报的方法及使用此方法的语音播报装置、导航系统
CN102768044A (zh) * 2012-07-31 2012-11-07 深圳市赛格导航科技股份有限公司 一种可记录用户行车习惯的导航仪及其记录和重放方法
US10641613B1 (en) * 2014-03-14 2020-05-05 Google Llc Navigation using sensor fusion
CN107170472A (zh) * 2016-03-08 2017-09-15 阿里巴巴集团控股有限公司 一种车载音频数据播放方法和设备
CN106653064A (zh) * 2016-12-13 2017-05-10 北京云知声信息技术有限公司 音频播放方法及装置
CN107819949A (zh) * 2017-11-01 2018-03-20 深圳天珑无线科技有限公司 信息播放方法、终端及计算机可读存储介质
CN110717094A (zh) * 2019-09-03 2020-01-21 平安科技(深圳)有限公司 信息推荐方法、装置、计算机设备和存储介质
CN111081283A (zh) * 2019-12-25 2020-04-28 惠州Tcl移动通信有限公司 一种音乐播放方法、装置、存储介质及终端设备
CN111735472A (zh) * 2020-05-22 2020-10-02 百度在线网络技术(北京)有限公司 一种导航音频的播放方法、装置、设备和计算机存储介质

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114882721A (zh) * 2022-05-27 2022-08-09 中国第一汽车股份有限公司 一种车载导航信息播放方法、装置、电子设备及存储介质
CN114882721B (zh) * 2022-05-27 2023-05-09 中国第一汽车股份有限公司 一种车载导航信息播放方法、装置、电子设备及存储介质
CN114973740A (zh) * 2022-06-06 2022-08-30 北京百度网讯科技有限公司 语音播报时机的确定方法、装置及电子设备
CN114973740B (zh) * 2022-06-06 2023-09-12 北京百度网讯科技有限公司 语音播报时机的确定方法、装置及电子设备

Also Published As

Publication number Publication date
CN111735472A (zh) 2020-10-02

Similar Documents

Publication Publication Date Title
WO2021232726A1 (zh) 一种导航音频的播放方法、装置、设备和计算机存储介质
US11874124B2 (en) Duration-based customized media program
US10663311B2 (en) Generating personalized routes with user route preferences
US11017021B2 (en) Generating and distributing playlists with music and stories having related moods
RU2707410C2 (ru) Автомобильный мультимодальный интерфейс
RU2731837C1 (ru) Определение поисковых запросов для получения информации в процессе пользовательского восприятия события
US10742702B2 (en) Saving media for audio playout
KR20220058971A (ko) 인간 대 컴퓨터 다이얼로그들에 요청되지 않은 콘텐츠의 사전 통합
WO2022143570A1 (zh) 导航方法、装置和系统
US9535654B2 (en) Method and apparatus for associating an audio soundtrack with one or more video clips
WO2017166593A1 (zh) 一种基于地图的导航方法、装置和存储介质
US10809973B2 (en) Playlist selection for audio streaming
US20220252412A1 (en) Systems and methods for providing uninterrupted media content during vehicle navigation
EP3726395A1 (en) Vehicle-mounted music matching method and apparatus, and vehicle-mounted intelligent controller
WO2024037086A1 (zh) 出行信息分享方法、装置、计算机设备及存储介质
US20210063194A1 (en) Systems and methods for providing uninterrupted media content during vehicle navigation
US20220308826A1 (en) Method, apparatus, device for playing navigation audios
CN110612146B (zh) 随机仿真陈述生成
CN111578965A (zh) 导航播报信息处理方法、装置、电子设备和存储介质
TW200848702A (en) Virtual guide/navigation device and navigation method thereof
JP2006306242A (ja) 車載情報提供装置
JP4411118B2 (ja) カーナビゲーション装置及びカーナビゲーション装置用制御方法
CN117290606A (zh) 推荐信息的展示方法、装置、系统、设备及存储介质
CN112767909A (zh) 音频混音方法、装置、介质以及电子设备
JP2003287430A (ja) 経路設定方法、経路設定サーバ、経路設定装置、及び経路設定プログラム

Legal Events

Date Code Title Description
ENP Entry into the national phase

Ref document number: 2021538075

Country of ref document: JP

Kind code of ref document: A

ENP Entry into the national phase

Ref document number: 2020900748

Country of ref document: EP

Effective date: 20210625

ENP Entry into the national phase

Ref document number: 20217027814

Country of ref document: KR

Kind code of ref document: A

121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 20900748

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE