WO2009129775A1

WO2009129775A1 - Method for operating an electronic assistance system

Info

Publication number: WO2009129775A1
Application number: PCT/DE2009/000502
Authority: WO
Inventors: Jochen Katzer
Original assignee: Navigon Ag
Priority date: 2008-04-21
Filing date: 2009-04-20
Publication date: 2009-10-29
Also published as: DE102008019967A1

Abstract

The invention relates to a method for operating an electronic assistance system having speech output, comprising the following steps: a) Calculating at least one instruction for the user, wherein the instruction contains at least one piece of numerical data, and wherein the numerical data is defined by a multi-digit sequence of digits, b) acoustically outputting the instruction on an output device, wherein the context of the numerical data in the instruction is analyzed, wherein the numerical data is output as a function of the results of the context analysis during the acoustic output of the instruction based on context either as a) a continuous word of a number or b) as a sequence of words comprising a plurality of words of digits or c) as a combination of at least one continuous word of a number and at least one sequence of words comprising at least one word of a digits.

Description

Method for operating an electronic assistance system

The invention relates to a method for operating an electronic assistance system with voice output according to the preamble of claim 1.

In many assistance systems, such as navigation systems, the calculation and acoustic output of instructions is common. This can be, for example, a maneuvering instruction to indicate to the driver of a vehicle a maneuver necessary for following a route. In many cases, the instructions to be output for the user also include numbers, for example for indicating a distance or for specifying a street number or the like. In the generic assistance systems, the instruction is calculated programmatically and then audibly audibly output at an acoustic output device, such as a loudspeaker. In the acoustic output of number sequences in the known assistance systems either a direct pronunciation is used as a continuous number word or a composite pronunciation as a word sequence of multiple numerical words. Should with the known assistance systems, for example, the number "2156" are issued acoustically, this can be done either in direct pronunciation as numerical word "two thousand one hundred and fifty-six" or in composite word order from the numerical words "two-one-five-six".

However, this type of output of numbers either as a contiguous number word or as a word sequence of several numerical words does not take into account the problem that the manner of speaking of numbers can very much depend on the context in question. In addition, depends on the user well understood acoustic

Output of the number indication also from the language used in each case, since there are very different language habits in the different languages. If, for example, a telephone number is to be output as additional information for a specific destination, the specification of the telephone number as a connected number word is generally completely incomprehensible, but the user expects the acoustic output of the telephone number as a word sequence of several digits. If the assistance system outputs a distance, for example the distance to the next maneuver, the user is regularly expected to specify the distance as a contiguous number word, whereas the indication of the distance as word sequence of several numerical words would remain completely incomprehensible.

Based on the prior art, therefore, a new method for operating an electronic assistance system in the voice output of numbers is proposed that increases the intuitive understanding of the acoustic number output.

This object is achieved by a method according to the teaching of claim 1.

Advantageous embodiments of the invention are the subject of the dependent claims. The method according to the invention is based on the idea that the context of the number is analyzed in the statement. Depending on the analysis result of the context analysis, the numerical value is then acoustically output in the acoustic output of the instruction either as a contiguous number word or as a word sequence of several digit words or as a combination of at least one connected number word and at least one word sequence with at least one digit word. In other words, this means that the context analysis determines in which manner of speech the numerical value should be output to increase the intelligibility. If, for example, the context analysis determines that the number refers to a telephone number, the telephone number can be output as a word sequence from a plurality of numerical words. If the context analysis shows that the number is a distance to describe the distance to the maneuver, the number can be output as a contiguous number word. In addition, the output as a combination of at least one contiguous number word and at least one word sequence with at least one digit word is possible. In this respect, depending on the particular context in the acoustic output of numbers so also a hybrid of contiguous numerical words and word sequences can be generated from multiple numerical words.

As an outstandingly important analysis parameter in the context analysis, the respectively preset language for the speech output of the number has proved. The way in which numbers are spoken varies greatly with the language habits in the different languages. If, for example, the number of a numbered street in German is issued acoustically, for example by indicating "Please turn at the next intersection onto the B 417", the number "417" is output as a contiguous numerical word "four hundred and seventeen", since this Art the speech corresponds to the German usage. On the other hand, it will be in English the number "417" made, for example, by announcing the maneuver "Turn right on Highway 417", so the road number "Highway 417" is issued as "Highway four-seventeen" according to the English language usage in the indication of street numbers. In this example, it can be seen that, according to the German usage, street numbers are output as contiguous number words, whereas in English usage, a combined speech as a combination of a number specification and a number word results.

In the context analysis finally any number of analysis parameters can be considered. Of particular importance in this case is the analysis of whether the indication of a distance, for example the distance to a maneuver, the indication of a numbered road, for example "B 417", the indication of a number in a street, for example "street of the 17. June ", the indication of a house number, the indication of a zip code, the indication of a telephone number, the indication of opening hours, the indication of passable times or the indication of evaluation characteristics, for example restaurant stars, concerns. For depending on the particular context and in particular taking into account each linguistically customary in each language set, the output of the number should each be changed according to the invention.

Of particular importance for the customary output of a numerical value is the number "0." If a numerical value contains the number "0", the usual acoustic output of this number, especially taking into account the default language, can vary greatly. If the numerical value contains a "0", then the acoustic instruction should be output differently depending on the position of the "0" in the numerical value and / or depending on the context of the numerical value and / or depending on the preset language. The way in which the acoustic output of the instruction is processed is basically arbitrary. According to a first variant of the method, the instruction can be combined word-based from a plurality of word elements respectively stored in a memory. In other words, this means that a plurality of acoustic word elements are stored in the memory. For example, for all numbers from 0 to 99, corresponding acoustic files may be present, with the acoustic instruction then being composed of these acoustic files. Alternatively, the acoustic output and the necessary spoken word elements can also be synthesized in a speech synthesis module according to the known "text-to-speech technology".

Which function the electronic assistance system assumes is basically arbitrary. Of great importance, however, is the voice-like acoustic output of numbers in navigation systems, since a large number of figures must be output in different context service contexts and different languages.

In carrying out the method according to the invention on a navigation system, it is furthermore of utmost importance that the maneuvering instructions of the navigation system correctly reproduce the number-indicating context. As part of the preparation of the maneuvering instructions, the inventive method is preferably incorporated into the normal process flow of a navigation system that the text to be issued as a maneuver instruction or tokens that represent the individual components of the output text, in machine-readable form from the maneuver instruction creation unit to the maneuver instruction output unit , In this case, the maneuver instruction creation unit supplements the machine-readable form of the maneuver to be output by further parameters. These parameters include at least the language of the instruction text and the context of each number sequence contained (eg, whether it is a distance or Street indication etc.). Alternatively, the country in which the output is made, as well as other user-specific preference information, can also be transferred. Generically, a position identifier is also provided as a parameter that allows the maneuver instruction output unit to issue at the correct time prior to the maneuver. With the parameters passed, the maneuver instruction output unit can now make the output of the maneuver accordingly.

Alternatively, the maneuver instruction creation unit could already carry out the method according to the invention and as a result a

Generate coding of the maneuvering instruction in machine-readable form in which the numbers for the maneuver instruction output unit are coded so that no more knowledge of context is necessary and the acoustic output can be correctly reproduced in the desired form.

Various aspects of the invention will become apparent from the drawing and will be exemplified below.

It shows:

1 shows a table for determining the number output format as a function of the context analysis.

FIG. 1 shows a possibility for determining the speech output format of numbers as a function of a context analysis.

Column 1 determines the respective input format of a number. It can be a one-digit, two-digit, three-digit number or four-digit number, with one line being uniquely determined depending on whether the number contains a "0" and at which position of the number the "0" occurs. If, for example, the number 230 is to be output, this number is assigned to row 5 of the table with the input format XXO. If, on the other hand, the number "4321" is ben, the line to be selected results from the input format XXXX.

After determining the correct line as a function of the input format of the number to be output, a distinction is made as to whether the number is to be composed word-based from pieces of audio that are stored acoustically or is synthesized in a speech synthesis module. For a word-based speech output generation, columns 2 through 5 are selected, and columns 6 through 9 are selected for TTS-based speech synthesis. In the following, the further context analysis will be explained on the basis of the word-based speech output, as determined by columns 2 to 5. The TTS-based speech synthesis is done accordingly.

If it has been determined that the speech output should be word-based, then the numerical value in the acoustic output is then analyzed to determine whether the speech should be in German or English. Columns 2 and 3 are selected for a German-language edition, columns 4 and 5 are selected for an English-language edition. Finally, a substantive context analysis is required in which it is checked whether the speech output relates to the specification of a distance, for example the indication of a distance to a next maneuver, or the indication of a street number. At the crossing point of the respective line, which is determined by the input format of the number, and the column, which is determined by the content context, the language and the speech synthesis, the respective speech output format results. The square brackets in the speech output format indicate in each case coherent numerical words or numerical words and number word sequences of several numerical words.

As you can see from the table, the German way of speaking does not distinguish between the pronunciation of road numbers and distances. For example, the number "217" when outputting a Distance, for example, the statement "Please follow the road two hundred and tenteen km", as well as a continuous sequence of numbers spent as in the indication of a street number, for example, the output "turn right on the B two hundred seventeen".

In the English pronunciation, on the other hand, there is a big difference between distances and street numbers. In the distance specification, for example, the output "Please follow the road for fourhundred seventeen miles", the distance is output as a contiguous number word.However, the street number, the number is output as "four-seventeen".

In particular, the inclusion of the number "0" is of crucial importance for the linguistic acceptance of the acoustic output, for example, the words "... on Highway 4620" and the words "... on Highway 4602" for the first sentence the acoustic output "Highway four-six-twenty" and for the second sentence "four-six-zero-two".

Claims

claims

1. A method for operating an electronic assistance system with speech output comprising the following steps: a) calculation of at least one instruction for the user, wherein the statement contains at least one number, and wherein the number is defined by a multi-digit number, b) acoustic output of the instruction an output device, characterized in that the context of the number is analyzed in the statement, the number depending on the result of the context analysis in the acoustic output of the statement context dependent either as a) contiguous number word or b) as a word sequence with multiple numerical words or c) as Combination of at least one contiguous number word and at least one word sequence is output with at least one digit word.

2. Method according to claim 1, characterized in that in the context analysis the language preset for the acoustic output of the instruction is taken into account as analysis parameter.

3. The method according to claim 1 or 2, characterized in that it is analyzed in the context analysis, whether the number specified in the statement - the indication of a distance or

- the indication of a numbered street or

the indication of a number in a street name or

- the indication of a house number or

- the indication of a postal code or - the indication of a telephone number or

- the indication of opening hours or

- the indication of passable times or

- concerns the indication of valuation features.

4. The method according to any one of claims 1 to 3, characterized in that it is analyzed in the context analysis, whether the number contains zeros, the acoustic instruction depending on the position of the zero in the numerical value and / or depending on the context of the number and / / or is output differently depending on the preset language.

5. The method according to any one of claims 1 to 4, characterized in that for the acoustic output of the instruction a plurality of stored in a memory, spoken word elements are combined.

6. The method according to any one of claims 1 to 5, characterized in that for the acoustic output of the instruction the necessary spoken word elements are synthesized in a speech synthesis module.

7. The method according to any one of claims 1 to 6, characterized in that the electronic assistance system is a navigation system.

8. The method according to claim 7, characterized in that the instruction is a maneuvering instruction of the navigation system, which contains information for the user to drive on a route.