WO2008018274A1 - Character converting device and character converting device control method - Google Patents

Character converting device and character converting device control method Download PDF

Info

Publication number
WO2008018274A1
WO2008018274A1 PCT/JP2007/064209 JP2007064209W WO2008018274A1 WO 2008018274 A1 WO2008018274 A1 WO 2008018274A1 JP 2007064209 W JP2007064209 W JP 2007064209W WO 2008018274 A1 WO2008018274 A1 WO 2008018274A1
Authority
WO
WIPO (PCT)
Prior art keywords
character
word
kanji
input
information
Prior art date
Application number
PCT/JP2007/064209
Other languages
French (fr)
Japanese (ja)
Inventor
Hitoshi Hirose
Mitsuhiro Hakaridani
Original Assignee
Sharp Kabushiki Kaisha
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Sharp Kabushiki Kaisha filed Critical Sharp Kabushiki Kaisha
Priority to JP2008528762A priority Critical patent/JP4890551B2/en
Publication of WO2008018274A1 publication Critical patent/WO2008018274A1/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/12Use of codes for handling textual entities
    • G06F40/126Character encoding
    • G06F40/129Handling non-Latin characters, e.g. kana-to-kanji conversion
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/103Formatting, i.e. changing of presentation of documents
    • G06F40/109Font handling; Temporal or kinetic typography

Definitions

  • Character conversion device control method of character conversion device
  • the present invention is an apparatus for a user to input characters or a character string using a pointing device such as a keyboard, a touch panel, a tablet, and a mouse, and receives a conversion instruction and receives the character or characters input.
  • a character conversion device related to a character conversion device which selectively displays a converted character or character string according to a column and the user can select a displayed character or character string to input a desired character or character string. It is a thing.
  • the information device increases the time to search the lexical data ability for the kanji that matches the input reading. Therefore, after the user inputs a reading and instructs conversion, the response from displaying the conversion result is deteriorated, and as a result, it is difficult to use the information device.
  • Patent Document 1 Japanese Patent Laid-Open Publication No. Hei 6-149787 (May 31, 1994 published) Disclosure of Invention
  • the number of characters of the input reading is within the maximum number of characters of the reading of the word starting with the reading of the first character of the input reading character string recorded in the index information.
  • the present invention has been made in view of the above problems, and an object thereof is to provide a character conversion device, a character input method, a character input program, and a character conversion device capable of shortening a word search time for input phonetic characters.
  • a computer readable recording medium is provided.
  • the character conversion device converts an input character string consisting of one or more input phonetic characters into a converted character, and determines a word that is the converted character.
  • Character conversion device that can
  • word correspondence information in which word identification information for identifying a word is associated with word phonetic information indicating the phonetic sound of the word; and (b-2) a head at the beginning of the word
  • the head conversion character identification information for identifying a conversion character, and the word storage destination information indicating the storage destination in the word correspondence information of the word having the head conversion character indicated by the head conversion character identification information A word dictionary having associated word indexes,
  • a word acquiring unit which refers to the word dictionary based on the acquired head converted character identification information and acquires a word having the converted character indicated by the head converted character identification information at the head And are characterized.
  • the first conversion character acquisition means first searches for the conversion character from the first phonetic character of the input character string with reference to the conversion character dictionary, and then the word acquisition means With reference to the word dictionary, it is possible to acquire a word having the conversion character acquired by the start conversion character acquisition means at the beginning. Therefore, the time concerning the search process of a word can be shortened. That is, since the number of comparisons between the input character string and the phonetic (reading) information can be limited, it is possible to provide a character conversion device with high response speed.
  • the conversion character indicates the character after conversion.
  • the converted character is kanji
  • the first character of the phonetic character (reading) is as follows. For example, in Chinese
  • the first conversion character the first conversion character
  • Converted phonetic information is, for example, “memian” to “for“ 1 ”.
  • the converted character identification information may be a converted character itself which is any information that can identify a converted character which is a character after conversion. For example, if the converted character is kanji, it may be kanji itself or kanji code. The same applies to the word identification information and the head conversion character identification information.
  • FIG. 1 shows an embodiment of the present invention, and is a block diagram showing the main configuration of a character conversion device.
  • FIG. 2 It is a figure showing the flow of processing about conversion of the character which the said character conversion device performs.
  • FIG. 3 is a diagram for explaining a kanji index according to an embodiment of the present invention.
  • FIG. 4 is a view for explaining kanji correspondence information according to an embodiment of the present invention.
  • FIG. 5 is a diagram for explaining a primary index according to an embodiment of the present invention.
  • FIG. 6 is a diagram for explaining a secondary index according to an embodiment of the present invention.
  • FIG. 7 is a diagram for explaining word correspondence information according to an embodiment of the present invention.
  • FIG. 8 is a diagram for explaining a Pinyin dictionary index according to an embodiment of the present invention.
  • FIG. 9 is a diagram for explaining a Pinyin dictionary according to an embodiment of the present invention.
  • FIG. 10 is a diagram showing an example of pin-in division processing according to an embodiment of the present invention.
  • FIG. 11 is a diagram showing an example of pin-in division processing excluding pin-in determined in the pin-in division processing according to an embodiment of the present invention.
  • FIG. 12 is a block diagram showing the main configuration of a character conversion device according to an embodiment of the present invention.
  • FIG. 13 is a block diagram showing a schematic configuration of a multifunction printer to which the character conversion device of the present invention is applied.
  • FIG. 14 is a view showing an example of an operation screen at the time of character input processing, displayed on the operation panel of the character conversion device.
  • FIG. 15 is a view showing an example of a kanji dictionary recorded in the kanji dictionary recording unit, which is referred to when the candidate extraction unit of the character conversion device extracts candidate kanji.
  • FIG. 16 is a block diagram showing a main configuration of a character conversion device for extracting candidate idiom according to another embodiment of the present invention.
  • FIG. 17 is a diagram showing an example of candidate idiom displayed on the display unit of the operation panel of the character conversion device.
  • FIG. 18 A diagram showing an example of a idiom dictionary recorded by the idiom dictionary recording unit.
  • FIG. 19 is a flowchart showing a process flow of the character conversion device according to another embodiment of the present invention.
  • 20] is a diagram showing a display example of a candidate character display area and a confirmed character display area on the operation panel.
  • FIG. 21 is a block diagram showing the main configuration of a character conversion device provided with a learning function according to another embodiment of the present invention.
  • FIG. 22 is a block diagram showing a more detailed configuration of a candidate order determination unit 32.
  • FIG. 23 is a diagram showing an example of a connectivity degree table indicating the connectivity degree with the immediately preceding (forward) character string for each kanji (idiom) recorded in the connectivity degree recording unit.
  • FIG. 24 is a diagram showing a classification procedure of candidates when the candidate order determination unit of the character conversion device determines the display order.
  • FIG. 25 is a diagram showing an example of a data structure for storing evaluation values of display order calculated for each candidate.
  • FIG. 26 is a diagram showing an example of priority information recorded in a candidate order determination element recording part.
  • Fig. 27 is a diagram showing an example of a kanji written in a character input frame of an operation panel, and an example of coordinate information of the kanji.
  • FIG. 28 is a view showing a display example of the operation panel
  • FIG. 28 (a) is a view showing an example of kanji written in the character input frame of the display integrated type tablet.
  • FIG. 28 is a diagram showing an example of a recognized character candidate displayed in the candidate character display area
  • FIG. 28 (c) is a diagram showing an example of a candidate idiom displayed in the candidate character display area.
  • FIG. 29 is a diagram showing an example of the recognition result of the handwritten data “extra character 2”.
  • FIG. 30 is a block diagram showing an essential configuration of a character conversion device for inputting a traditional Chinese character according to another embodiment of the present invention.
  • FIG. 31 is a view showing an example of an operation screen at the time of a phonetic character input process for traditional character input according to another embodiment of the present invention.
  • FIG. 33 is a diagram showing an example of a idiom dictionary recorded by the idiom dictionary recording unit of the character conversion device according to another embodiment of the present invention.
  • FIG. 34 An example of a connectivity table showing connectivity with the immediately preceding (forward) character string for each kanji (words) recorded in the connectivity recording part of the character conversion device in another embodiment of the present invention.
  • FIG. 35 is a diagram showing an example of an operation screen displayed on the display unit of the operation panel.
  • FIG. 36 is a view showing an example of an operation screen displayed on the display unit of the operation panel.
  • FIG. 37 is a diagram showing an example of an operation screen displayed on the display unit of the operation panel.
  • FIG. 38 is a diagram showing an example of an operation screen displayed on the display unit of the operation panel.
  • FIGS. 12 to 38 One embodiment (embodiments 1 to 4) of the present invention will be described below with reference to FIGS. 12 to 38.
  • the character conversion device of the present invention is applied to a multi-function printer (copier) having a touch panel in which an input device and an output device are integrated will be described.
  • the copying machine described in the present embodiment is an example, and the character conversion device of the present invention can be applied to a mobile phone, a portable terminal, a personal computer, a remote control, and the like.
  • the character conversion device of the present invention is not limited to Chinese characters.
  • Phonetic writing power of the desired ideographic character can be obtained.
  • the above pinyin notation is described in Chinese, in which there are four voices representing four different pronunciation. The concept of four voices will be omitted for simplicity of explanation.
  • FIG. 13 is a block diagram showing a schematic configuration of a multi-function printer to which the character conversion device 1 of the present invention is applied.
  • the multifunction printer includes a control unit 10, an operation panel (tablet) 2, a temporary storage unit 3, a recording unit 40, an image reading unit 4, an image transfer unit 5, and a paper feeding unit 6.
  • a control unit 10 an operation panel (tablet) 2
  • a temporary storage unit 3 a recording unit 40
  • an image reading unit 4 an image transfer unit 5
  • the configuration of the multifunction printer described here is shown as an example for implementing the present invention, and does not limit the scope of the present invention.
  • the control unit 10 centrally controls the multi-function printer.
  • the control unit 10 reads various programs recorded in the recording unit 40 and controls the image reading unit 4, the image transfer unit 5, the paper feeding unit 6 and the like to function as a copying machine, Perform processing such as. Further, each unit for performing character input processing according to the present invention is also included in the control unit 10.
  • the recording unit 40 records an operation panel 2, a control program of the above-mentioned various units for operating as a copying machine, and a control program of the character conversion device 1, and an ROM (read only memory) Etc.) are realized.
  • the temporary storage unit 3 is a working area for processing a program, and the power used for temporarily storing read images.
  • temporary storage such as learning characters (strings) at the time of character input processing is used. It is used. It is realized by volatile memory such as random access memory (RAM).
  • Operation panel 2 is used by the user to instruct the multi-function printer to execute various processes.
  • a touch panel in which a display device and an input device are integrally formed or a display integrated tablet.
  • display-integrated tablet It may have a handwritten character recognition function that recognizes written characters. It also has a function to detect information such as instructed characters, character strings, and other function keys based on the position indicated by the user on the display integrated tablet.
  • the user can input a desired alphabet to the multi-function printer by lightly touching each key corresponding to the alphabet (phonetic characters) displayed on operation panel 2 with a pen or a finger. Is possible.
  • the multifunction printer may be a keyboard.
  • (First Input Unit Z Third Input Unit) 7 (for example, a general-purpose keyboard used in a PC) may be provided. In this case, it is possible to input alphabets using any of the displayed keys of the operation panel 2 and the keys of the keyboard 7.
  • the input unit (first input unit) having a key for inputting phonetic characters can be realized simultaneously by both the operation panel 2 and the keyboard 7 or can be realized only by the keyboard 7. Alternatively, it may be configured to switch whether to input the phonetic characters on the operation panel 2 or to the keyboard 7.
  • FIG. 12 is a block diagram showing the main configuration of the character conversion device 1 according to the present embodiment.
  • the character conversion device 1 is configured to include an operation panel 2, a control unit 10, a recording unit 40, and a temporary storage unit 3 as shown in FIG.
  • the operation panel 2 is realized by a touch panel in which the display device and the input device are integrally formed !.
  • FIG. 14 is a diagram showing an example of an operation screen displayed on the operation panel 2 during character input processing.
  • the display unit 12 has a character input area 121 for displaying a keyboard for the user to input characters, and a candidate sentence for displaying characters (or character strings) listed as input candidates.
  • a character (column) display area 122 and a confirmed character (column) display area 123 displaying an input confirmed character that has already been input and confirmed are displayed.
  • the input unit 11 is a user in China It is for inputting pinyin (phonetic character string z phonetic information) of word pronunciation notation, and when the key of the alphabet displayed in the character input area 121 is pressed by the user, the pressed position on the panel is displayed. The signal of the alphabet corresponding to is input to the input / output control unit 20 of the character conversion device 1. Further, the input unit 11 is also for the user to select a character that the user wants to confirm the input. When a kanji (ideographic character Z converted character) displayed in the candidate character display area 122 is pressed, a signal of the kanji displayed in the pressed position is input to the input / output control unit 20, and the kanji character is displayed. The input is confirmed.
  • pinyin phonetic character string z phonetic information
  • the control unit 10 controls the input / output control unit 20 and the character input control unit 30.
  • the input / output control unit 20 controls input / output of signals exchanged between the operation panel 2 and the control unit 10, and includes an input receiving unit 21, a display control unit (display control means) 22, and A supplementary selection acceptance unit 23 is included.
  • the character input control unit 30 executes character input processing, and the candidate extraction unit (character candidate extraction unit Z character string candidate extraction unit) 31, candidate order determination unit (candidate order determination unit) 32, and character confirmation Part 33 (character input confirmation means) is included.
  • each unit in the control unit 10 is realized by a central processing unit (CPU) of a computer and software executed by the CPU. This software is also stored in the above recording unit 40, read out to the temporary storage unit 3, and executed by the CPU.
  • the input receiving unit 21 of the input / output control unit 20 receives signals of characters (such as alphabets) input from the input unit 11 and supplies the signals to the candidate extracting unit 31.
  • the display control unit 22 controls the display unit 12 so that information to be displayed is displayed in each of the character input area 121, the candidate character display area 122, and the confirmed character display area 123 of the display unit 12. is there.
  • Candidate selection accepting portion 23 accepts the signal of the selected candidate kanji in candidate character display area 122 input from input portion 11 and the instruction signal for determining the input of the selected candidate kanji, It is supplied to the determination unit 33.
  • the candidate extraction unit 31 of the character input control unit 30 stores candidate kanji (or) words from the respective dictionary units stored in the recording unit 40 based on the input characters. .
  • the candidate extraction unit 31 extracts a kanji as a candidate from the kanji dictionary unit (character recording unit) 41 of the recording unit 40 based on the input alphabet (string).
  • pinyins corresponding to input alphabets (rows) are identified,
  • the kanji corresponding to each pinyin identified is extracted as a candidate kanji.
  • the candidate extraction unit 31 stores the extracted candidate kanji in an internal buffer (not shown), and instructs the candidate order determination unit 32 and the display control unit 22 to display the candidate kanji.
  • kanji dictionary unit 41 to which the candidate extraction unit 31 refers, alphabet (row) -pinyin-vague characters are recorded in association with one another. Details of the kanji dictionary unit 41 will be described later.
  • the candidate order determination unit 32 determines the display order of the candidate kanji extracted by the candidate extraction unit 31 based on a predetermined condition.
  • the display order is an order of displaying candidate kanji in the candidate character display area 122 as shown in FIG. If all extracted candidate vowels can be displayed at once in the candidate character display area 122, the order is higher at the upper part of the candidate character display area 122 as the upper candidate kanji is selected, and the user selects It should be easy. In addition, when the number of candidate kanji exceeds the number that can be displayed at one time, the page may be switched to display candidate kanji in lower order.
  • the candidate order determination unit 32 determines the order such that the extracted candidate kanji are displayed in order from those likely to be selected by the user based on a predetermined condition. Based on what conditions the order of candidate kanji will be determined, and will be described in detail later.
  • the character determination unit 33 receives from the candidate selection reception unit 23 the signal of the candidate kanji selected by the user input from the input unit 11 and the instruction signal for determining the input of the selected candidate kanji, The input of the candidate kanji selected by the user is determined. First, the selected candidate kanji is displayed in the definite character display area 123 together with the underline indicating the selected state before confirmation. Furthermore, the candidate kanji determined by the character determination unit 33 according to the instruction signal for input determination is displayed in the determined character display area 123 via the display control unit 22 without being underlined as an input determined character.
  • FIG. 15 is a diagram showing an example of a vague dictionary recorded in the kanji dictionary unit 41 which is referred to when the candidate extraction unit 31 extracts candidate kanji.
  • the kanji dictionary 140 according to the present embodiment is an alphabet piny correspondence table (AZP correspondence table 141) in which correspondence between the alphabets (rows) received by the input reception unit 21 and pinyins is recorded.
  • AZP correspondence table 141 an alphabet piny correspondence table
  • PZC correspondence table 142 pinyin-kanji correspondence table in which correspondence between pinyins and kanji is recorded. This makes it possible to directly extract candidate kanji corresponding to the input alphabet (column).
  • the candidate extraction unit 31 first identifies pinyin that matches “m” ahead (in the example of FIG. 15, according to AZP correspondence table 141 of the kanji dictionary 140). “Ma” Wmei ”) is identified, and the address of PZC correspondence table 142 in which the kanji corresponding to the pinyin is recorded is detected (“ cp ⁇ ml ”and“ cp ⁇ mi ”J).
  • the candidate extraction unit 31 selects the kanji corresponding to pinyin of “m” and the forward match.
  • the extracted kanji is supplied to the candidate order determination unit 32 as a candidate kanji as described above.
  • the candidate extraction unit 31 narrows down on the kanji that is in front agreement with “mei”, and extracts candidate kanji from the kanji dictionary unit 41.
  • the candidate extraction unit 31 receives an alphabet input by the user, and specifies pinyin including the alphabet.
  • the candidate extraction unit 31 can extract, from the kanji dictionary unit 41, candidates for kanji that the user desires to input based on the identified pin-ins.
  • the candidate order determination unit 32 determines the kanji candidate extracted by the candidate extraction unit 31 under a predetermined condition. Sort based on and determine the display order of the candidates.
  • the display control unit 22 controls the display unit 12 so that the display order is displayed in the display order determined by the candidate order determination unit 32 of the Chinese character candidate.
  • the character determination unit 33 determines the input of the designated kanji.
  • kanji characters that are candidates to be extracted by the candidate extraction unit 31 can be stored in an external storage device (not shown) and read from an external storage device by user's alphabet input operation. Even if the kanji is read, it may be used when displaying the candidates.
  • the character conversion device 1 is a kanji string (hereinafter referred to as a word (ideographic character string) in which one or more kanji powers having one meaning beginning with the kanji powers are selected after the ⁇ kanji is selected as the input determined characters
  • the candidate character display area 122 (FIG. 14) is displayed as a candidate word.
  • the user can obtain a kanji that can form a word with the ⁇ kanji in the candidate character display area 122, and thus can obtain a desired kanji without separately inputting the pinyin of the following kanji. It becomes. Therefore, it is possible to simplify the input operation for obtaining the desired kanji.
  • FIG. 16 is a block diagram showing the main configuration of the character conversion device 1 that outputs candidate words.
  • the reference numerals attached to the respective components in FIG. 16 correspond to the reference numerals attached to the respective components in FIG. 12, and the same reference numerals indicate the same components. Therefore, the description of the components already described in the above embodiments will not be repeated.
  • the character conversion device 1 shown in FIG. 16 differs from that shown in FIG. 12 in that the control signal is transmitted from the character determination unit 33 to the candidate extraction unit 31, and the word dictionary unit (character string recording unit 42) is included ⁇ ⁇ .
  • FIG. 17 is a diagram showing an example of candidate words displayed on the display unit 12 of the operation panel 2.
  • a word that starts with “extra character 2” is displayed as a candidate word in the candidate character display area 122 of the display unit 12.
  • Kanji words are recorded in association with each other in the word dictionary unit 42 to which the candidate extraction unit 31 refers.
  • FIG. 18 is a diagram showing an example of a word dictionary recorded by the word dictionary unit 42.
  • the word dictionary in this embodiment is a kanji word correspondence table (CZI correspondence table 143) in which correspondences between vague characters recorded in the kanji dictionary unit 41 and words starting from the kanji are recorded. Contains). This makes it possible to further extract candidate words corresponding to the selected kanji.
  • the candidate extraction unit 31 detects the address of “extra character 2” in the PZC correspondence table 142 (FIG. 15) (“ wp—mi + j ”). Subsequently, referring to the word dictionary unit 42, the words starting with "extra character 2" ("extra character 2" month, "extra character 2” heaven ' ⁇ ⁇ ⁇ are extracted based on the address "wp-mi + j". The selected word is supplied as a candidate word to the candidate order determination unit 32 as in the configuration described above, and displayed in the candidate character display area 122 (FIG. 17).
  • the character following the kanji may be a kanji that constitutes a word that also starts its kanji power than a kanji without any context. high. Therefore, if a word starting from a kanji whose input has been confirmed is displayed as a candidate, the user only selects the desired word from among the word candidates and does not separately input Chinese Pinyin. It becomes possible to input. As a result, the user can obtain a desired kanji input with a smaller number of operations related to the input of pinyin.
  • input reception unit 21 receives an input The signal of the alphabet “m” is supplied to the candidate extraction unit 31.
  • the candidate extraction unit 31 specifies pinyin corresponding to the input alphabet, and extracts a kanji corresponding to the specified pinyin from the kanji dictionary unit 41 as a candidate kanji (S202).
  • the candidate order determination unit 32 determines the display order of the above extracted candidate kanji based on predetermined conditions (details will be described later) (S203). Subsequently, the display control unit 22 determines the display position so as to display n candidate kanji in the display order determined by the candidate order determination unit 32, and outputs the candidate kanji to the display unit 12 to present it to the user. (S204).
  • an instruction to display the next candidate kanji other than the kanji displayed at the first time in the candidate character display area 122 by the user (because there is no desired kanji “extra character 2” in (a) of FIG. 20)
  • the process returns to S204, and the display control unit 22 displays a list of the next candidate kanji (see FIG. 20 (b)).
  • the character determination unit 33 instructs the display control unit 22 to display the selected “extra character 2”, and, for the candidate extraction unit 31, extracts a word starting with “extra character 2” next. Point.
  • the candidate extraction unit 31 subsequently extracts a word beginning with “extra character 2” as a candidate word from the word dictionary unit 42 (S207) ((c) in FIG. 20).
  • the candidate order determination unit 32 determines the display order of the extracted candidate words based on predetermined conditions. The order is determined (details will be described later) (S208). Subsequently, the display control unit 22 outputs the candidate words to the display unit 12 in the display order determined by the candidate order determination unit 32 and presents them to the user (S209).
  • the process when the user inputs an instruction to display the next candidate word (NO in S210, YES in S211), the process returns to S209, and the display control unit 22 displays a list of the next candidate words. Do. Furthermore, neither the instruction to display the next candidate word nor the instruction to select the candidate word is input (S210, S211: NO), and if an alphabet is newly input, the process returns to S202 and the candidate extraction unit 31 The pinyin corresponding to the alphabet input so far is specified, and the kanji corresponding to the specified pinyin is extracted as a candidate kanji.
  • the candidate selection reception unit 23 receives the signal of the selected candidate word ““ extra digit 2 ”heaven” and supplies the signal to the character determination unit 33.
  • the character determination unit 33 instructs the display control unit 22 to display the selected candidate word, and the display control unit 22 displays the selected candidate word on the display unit 12 (S 212) (S 212) (S 212) Figure 20 (d)). At the same time, it instructs the candidate extraction unit 31 to extract a word starting from the candidate word. If an instruction to end the input of characters is not input (NO in S213), the candidate extraction unit 31 returns to S207 and extracts a word starting from the selected candidate word as a candidate word from the word dictionary unit 42.
  • the character conversion apparatus 1 of this invention is provided with the structure which records the kanji (word) with which the input was decided by user's selection.
  • the character conversion device 1 is based on the kanji input in the past, that is, based on the kanji recorded by the user.
  • Candidate kanji can be displayed. Kanji used in the past are likely to be used again by the user. Therefore, by using the list of candidate kanji in which such usage history is reflected, the user can obtain a desired kanji with a small number of operations.
  • FIG. 21 is a block diagram showing the main configuration of the character conversion device 1 having a learning function.
  • the reference numerals attached to the respective components in FIG. 21 correspond to the reference numerals attached to the respective components in FIG. 12, and the same reference numerals indicate the same components. Therefore, the description of the components already described in the above embodiments will not be repeated.
  • the character conversion device 1 shown in FIG. 21 differs from that in FIG. 12 in that the character input control unit 30 further includes a learning control unit (learning control means) 34 and a learning dictionary in the recording unit 40.
  • a recording unit (learning word recording unit) 43 and a pair learning dictionary recording unit (learning word recording unit) 44 are included.
  • the learning control unit 34 controls the recording (learning) of the kanji input (used) to the character conversion device 1 according to the instruction of the user.
  • the kanji used by the user is stored as learning kanji in the learning dictionary recording unit 43 (hereinafter referred to as single learning (function)), or the kanji used by the user is
  • the pair is stored in the pair learning dictionary recording unit 44 in association with the kanji input immediately before (hereinafter referred to as pair learning (function)). Since kanji characters that have been input by the user in the past are likely to be used again by the user, by using such learned kanji characters as candidate kanji characters, the user can input desired kanji characters with fewer operations. The probability of getting is increased.
  • the learning dictionary recording unit 43 records kanji (input determined characters) used in the past by the user as single learning kanji.
  • the learning dictionary recorded in the learning dictionary recording unit 43 contains information on the date and time when the single learning kanji was recorded, information on the frequency of use (how many times it has been used in the past, and how many times it has been used within a predetermined period. Etc.) may be recorded in association with each kanji.
  • the learning control unit 34 When receiving the signal of the candidate kanji selected by the user from the character determination unit 33, the learning control unit 34 writes the selected candidate kanji in the temporary storage unit 3.
  • the character determination unit 33 determines the input in accordance with the determination instruction signal
  • the learning control unit 34 receives an instruction of the single learning process of the input determined character from the character determination unit 33.
  • the learning control unit 34 temporarily writes The candidate kanji is read from the memory unit 3 into the internal buffer, and deletion of the candidate kanji read from the temporary storage unit 3 and updating of the contents are performed. Then, the candidate vowels stored in the internal buffer are stored in the learning dictionary recording unit 43 as single learning kanji.
  • Pair learning dictionary recording unit 44 associates a kanji used in the past by the user with information (preceding kanji) indicating which kanji follows the kanji and the pair learning vague Is recorded as The immediately preceding kanji has already been recorded as a single learning kanji in the learning dictionary recording unit 43.
  • the preceding kanji is associated and recorded for each pair learning kanji.
  • Information of the recorded date and time, and information of frequency of use may be associated with each kanji character and recorded.
  • the learning control unit 34 receives from the character determination unit 33 an instruction of a pair learning process of the input determined character.
  • the learning control unit 34 reads the input determined character and its immediately preceding kanji from the temporary storage unit 3 to the internal buffer, and deletes the character string read from the temporary storage unit 3 and updates the contents. Then, the input determined character stored in the internal buffer is associated with the immediately preceding kanji, and is stored in the pair learning dictionary recording unit 44 as a pair learning kanji.
  • a word may be recorded in a dictionary. That is, the word used may be recorded in the learning dictionary recording unit 43 by the single learning function of the learning control unit 34. For example, when the word “water supply” is used, the word “water supply” is recorded in the learning dictionary recording unit 43 as a single learning kanji (word) in addition to the kanji "supply".
  • the pair learning function of the learning control unit 34 when the words “college” and “professor” are continuously input, the kanji “teaching” is It can also be recorded as paired learning kanji in association with kanji (word). Furthermore, the word “professor” is associated with “ ⁇ ” and “Z” or “University” 's last kanji (word) and recorded as a paired learning word.
  • the candidate order determination unit 32 (FIGS. 12, 16, and 21) of the character conversion device 1 is a candidate (candidate kanji Z A predetermined condition (hereinafter referred to as a candidate order determining element) for determining the display order of the candidate words will be described.
  • the display unit 12 provided on the operation panel 2 can display more candidates at one time.
  • the operation panel 2 is realized by a display Z input integrated input / output device such as a touch panel, it is possible to significantly reduce the number of operations for the user to obtain a desired kanji.
  • the burden of searching for a desired candidate is increased. Therefore, when displaying a limited number of candidates in a limited display area, it is desirable that they be displayed in the order of the level of possibility (selecting) that the user desires (selects) kanji (word) power.
  • the order of candidates to be displayed is determined so that the candidates are displayed in the display order with higher accuracy and in accordance with the preference of the user.
  • a character conversion device capable of simplifying the input operation for obtaining a desired kanji.
  • the display order of deviations of the Z candidate words is also determined in the same manner.
  • the candidate order determination unit 32 determines the display order of candidates based on the following seven candidate order determination factors.
  • the appearance frequency is information that is an index of how often it appears in a character (string) of a certain language. The higher the frequency of occurrence, the more often it is used in the language usage environment. Therefore, the higher the frequency of appearance, the higher the display order.
  • the appearance frequency may be fixed information using known statistical data.
  • the display order is determined based on whether or not the number of characters of the input alphabet (column) matches the number of characters of the corresponding candidate pinyin.
  • Pinyin length ranks the display order of candidates whose number matches the number of input Pinyin characters.
  • the input confirmation character that has already confirmed input up to that point and Z or at this time Increase the display order of the possibility (continuity with the previous character string) immediately after the selected character (string) and the candidate.
  • the display order is determined depending on whether it is a single learning kanji (word) recorded in the learning dictionary recording unit 43 or not. If it is a kanji (word) that has been input (used) in the past, it is likely to be used again. Therefore, the display order of the candidate V recorded as a single learning kanji (word) is ranked high.
  • the display order is determined according to whether or not the pair learning kanji (word) is recorded in the pair learning dictionary recording unit 44 and whether or not it is a pair learning kanji (word). If it is a combination of kanji (word) that has been input (used) in the past, the possibility of being used in the combination is high again. Therefore, as a pair learning kanji (word), the display order of candidates is ranked high.
  • the display order is determined based on the time when the learning kanji (word) matching the candidate kanji (word) is recorded in the learning dictionary recording unit 43 and Z or the pair learning dictionary recording unit 44.
  • the candidate character display area 122 (FIG. 14) can be displayed preferentially as a candidate with a high possibility of being selected by the user. It is possible to obtain a desired kanji with a small number of key operations.
  • the candidate order determining element is information necessary for the candidate order determining unit 32 to determine the display order of candidate kanji Z candidate words. More specifically, candidate order decision The determination unit 32 is information used to calculate an evaluation value indicating the degree of possibility that the candidate kanji character Z candidate word is selected by the user. Information on candidate order determination elements is associated with each candidate (candidate kanji Z candidate word) and stored in each (recording) unit of the recording unit 40. The candidate order determining unit 32 refers to each unit of the recording unit 40 It is then possible to identify candidate order decision factors for each candidate. Furthermore, the recording unit 40 may store an evaluation value calculation rule that defines how to calculate the evaluation value from the candidate order determination element specified by the candidate order determination unit 32. As a result, the candidate order determination unit 32 can obtain an evaluation value for each candidate according to the specified candidate order determination element and the evaluation value calculation rule, and can determine the candidate display order. The evaluation value calculation method of the candidate order determination unit 32 will be described later.
  • FIG. 22 is a block diagram showing a more detailed configuration of the candidate order determination unit 32.
  • the candidate order determination unit 32 includes a learning kanji determination unit 131, a pinyin length determination unit 132, a connectivity degree determination unit 133, an appearance frequency determination unit 134, and a learning time determination unit 135.
  • the recording unit 40 (FIGS. 12, 16, and 21) includes an appearance frequency recording unit 45 and a connectivity degree recording unit (association degree recording unit) 46 in addition to the above-described storage units.
  • Temporary storage unit 3 internally includes extraction candidate temporary storage unit 51, evaluation value temporary storage unit 52, and display order temporary storage unit 53.
  • An extraction candidate temporary storage unit 51 in the temporary storage unit 3 temporarily holds kanji characters and Z or words as candidates extracted by the candidate extraction unit 31 (hereinafter referred to as extraction candidates). .
  • the candidate order determination unit 32 reads out the extraction candidates from the extraction candidate temporary storage unit 51, and performs processing of determining the display order of each extraction candidate.
  • the evaluation value temporary storage unit 52 temporarily holds the evaluation value given to each of the extraction candidates calculated based on the candidate order determination element and the evaluation value calculation rule.
  • the display order may be determined based on the finally calculated evaluation value.
  • the display order temporary storage unit 53 temporarily holds information of the display order determined for each of the extraction candidates.
  • the candidate order determination unit 32 stores the determined display order in the display order temporary storage unit 53.
  • Information on the display order of the display order temporary storage unit 53 is referred to by the display control unit 22 when displaying the extraction candidates on the display unit 12.
  • the learning kanji judging unit 131 of the candidate order determining unit 32 judges whether the extracted candidate is a learning kanji (word) and whether it is single learning or pair learning. It is.
  • the learning kanji judgment unit 131 reflects the judgment result on the evaluation value of the evaluation value temporary storage unit 52 or the display order of the display order temporary storage unit 53. This makes it possible to determine the display order based on elements E and Z or element F.
  • the pin-in length determination unit 132 determines whether or not the number of characters of the alphabet (row) input to the input reception unit 21 (FIGS. 12, 16, 21) matches the number of characters of the corresponding candidate pinyin. To determine the For example, suppose that you have entered up to "me” in order to obtain the kanji "" ⁇ ⁇ 2 "(pinyin: mei). At this point in time, the number of input characters is 2, and the pinyin length of the corresponding candidate kanji character "" 2 "(mei;)" is 3, so the pinyin length determination unit 132 determines that the pinyin lengths do not match.
  • Connectivity determination section 133 determines the degree of connection with the input determined character and the character Z or the character currently selected (kanji Z word) whose input has already been determined by that time, that is, according to the degree of connection. Next, the character desired by the user is predicted to be judged as a candidate.
  • the connectivity determination unit 133 refers to the connectivity table 150 recorded in the connectivity recording unit 46 to determine the connectivity with the determined or selected front character string and Z or the previous character string. The higher the degree of connectivity with the forward character string and Z or the immediately preceding character string, the higher the probability that the character is the character desired by the user, and the display order is made higher. Thereby, the display order based on element C and Z or element D can be determined. Details of the connectivity degree table 150 and the connectivity degree determination unit 133 of the connectivity degree recording unit 46 will be described later.
  • the appearance frequency determination unit 134 uses known statistical data to be recorded in the appearance frequency recording unit 45. Based on it, the frequency with which candidate kanji (words) appear (is used) is determined. The more frequently they occur, the more likely they will be used by the user, since the character will be used more frequently. Therefore, the higher the frequency of appearance, the higher the display order. Thereby, the display order based on element A can be determined.
  • the learning time determination unit 135 determines the time when the learning kanji (word) matching the candidate kanji (word) is recorded in the learning dictionary recording unit 43 and Z or the pair learning dictionary recording unit 44. is there. The newer the time when the learned kanji (word) is recorded, the more likely it is to be selected by the user. Therefore, the display order of the candidates is determined so that the learning time is higher for the latest kanji (word). This makes it possible to determine the display order based on the element G.
  • the candidate order determination unit 32 can determine the candidate display order based on the various conditions described above.
  • connection degree determination unit 133 Next, the operation of the connection degree determination unit 133 will be described in more detail.
  • FIG. 23 is a diagram showing an example of the connectivity degree table 150 recorded in the connectivity degree recording unit 46 and showing the connectivity degree with the immediately preceding (forward) character string for each kanji (word).
  • the continuity degree table 150 is stored for each kanji (word), and is used to detect the address of continuity information with the immediately preceding character string and the address of relevance information with the forward character string.
  • Connectivity degree correspondence table 151 and continuity table 1 52 that stores continuity with immediately preceding character string for each kanji (word), and forward relevance that stores association with forward character string for each kanji (word) Table 15 contains three.
  • connection degree determination unit 133 when the word “traffic” is selected by the user, the connection degree determination unit 133 has a high probability of continuing after “traffic” as the immediately preceding character string, and the kanji (word) has a degree of connectivity Search from Table 150.
  • the connectivity determination unit 133 refers to the connectivity correspondence table 151 and stores the continuity information strength with the immediately preceding character string “traffic” in the address “npi + l” of the continuity table 152 Detect that
  • the connectivity determination unit 133 refers to the address “npi + 1” of the continuity table 152, and “rule (continuity 21)”, “college (16)”, “tool (15)” ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ Detects that the continuity is high. Finally, it is decided that the display order of each candidate word mentioned above is to be in the order of "rule” "college” "tool” ⁇ ⁇ ⁇ ⁇ .
  • the connectivity determination unit 133 is input in connection with “Shanghai” as a forward character string. Search the connectivity table 150 for ⁇ kanji (words). The connectivity determination unit 133 refers to the connectivity correspondence table 151, and stores the relevancy information with the forward character string “Shanghai” in the address “ppi + s” of the forward relevancy table 153, Detect
  • the connectivity determination unit 133 refers to the address “ppi + s” of the forward relevancy table 153 and selects “University (Relevancy 10)”, “Rule (5)”, “Police (3) ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ Detect that the relevance is high. Finally, it is decided that the display order of each candidate word mentioned above should be in the order of "college” "rule” "police” ⁇ ⁇ ⁇ .
  • the display order of candidate kanji can be determined as well as the words that the user often inputs
  • the candidate kanji 'words are displayed in the display order that reflects the user's preference. It can be done. As a result, it is possible to reduce the number of key operations of the user and simplify the user's input operation for obtaining a desired kanji.
  • the judgment result based on element D may be prioritized .
  • the candidate word “rule” is not always prioritized and displayed, but if "Shanghai” is input ahead, it is The supplementary word “University” can be displayed preferentially.
  • the word “Shanghai” is displayed on the determined character display area 123 (FIG. 14).
  • the candidate extraction unit 31 of the character conversion device 1 extracts candidate words input following the input determined character "Shanghai".
  • the candidate order determination unit 32 refers to the connectivity recording unit 46 in FIG. 22 to determine the display order of candidate words in order to display the high probability word following “Shanghai” in the upper order.
  • the connectivity determination unit 133 refers to the address “npi + s” of the continuity table 152 based on the connectivity correspondence table 151 (FIG. 23), and directly transmits the word candidate to the word “Shanghai”. Arrange in descending order of continuity probability. That is, in the example shown in FIG. 23, candidates are obtained in the order of “University (15)”, “Transportation (13)”, “Police (12)”, “Rule (8)”,. Subsequently, the connectivity determination unit 133 rearranges the display order of candidate words following “Shanghai” based on the relevancy to the word located in front of the input determined character “Shanghai”. However, here, since there are no kanji characters or words located in front of “Shanghai”, the change in display order due to the relationship with the front character string is not executed.
  • the character conversion device 1 When “traffic” is selected by the user from among the candidate words displayed in the above procedure, the character conversion device 1 subsequently ranks candidate words that are directly consecutive to the input determined character “traffic”. Determine the display order to display.
  • the connectivity determination unit 133 refers to the address “npi + l” of the continuity table 152, and based on the continuity, “rule (21)”, “college (16)”, “tool (15)” , “Police (14)”...
  • the address “ppi + s” of the forward relevance table 153 based on the relevance (degree of relevance) with the kanji and words (here, “Shanghai”) located ahead of “traffic” Rearrange the display order of candidate words that follow “Cross”. More specifically, in the example shown in FIG. 23, according to the forward relevance table 153, the relevance of the candidate word to "Shanghai” is, in order, "University (10)", “Rule (5)”, “Police (3)” and “Tool (2)”..
  • the degree-of-connection determining unit 133 gives priority to the degree of association of the candidate word, and changes the display order in the order of the degree of association.
  • the display order of candidate words following “Shanghai traffic” is “University”, “Rule”, “Police” in the order of “Rule”, “University”, “Tool”, “Police”... , "Tool” ... is changed in order.
  • change the display order based on the sum of connectivity and relevancy. You may change it.
  • the display order may be changed based on the average value and the maximum value of connectivity and relevance.
  • an evaluation value calculation rule that specifies that the sum of the degree of connectivity and the degree of association is calculated as an evaluation value is stored in the recording unit 40, and the candidate order determination unit 32 determines in accordance with the evaluation value calculation rule.
  • the sum of connectivity and relevance can be calculated as the evaluation value, and the display order can be determined based on the evaluation value.
  • the above-mentioned candidate order determining elements are combined to determine the display order of the extracted candidates.
  • the present invention is not limited to the combination described below, and the display order can be determined by appropriately combining the above-described respective elements.
  • FIG. 24 is a diagram showing a classification procedure of candidates when the candidate order determination unit 32 determines the display order.
  • the candidate order determination unit 32 first classifies the candidates into several groups for each display order according to a predetermined condition (the above-mentioned elements A to G and their combination), and the final display order in that group. Decide.
  • the candidate word G extracted by the candidate extraction unit 31 is first stored in the extraction candidate temporary storage unit 51 (FIG. 22) of the temporary storage unit 3.
  • the learning kanji judgment unit 131 judges whether or not the learning word is Ka (step S301).
  • the pinyin length determination unit 132 compares the pinyin length (number of characters of the alphabet) inputted when the candidate word G is extracted with each pinyin length of the candidate word G (S302). .
  • candidate words G candidate words that are learning words and that are determined to have a matching pinyin length are classified into group A in the highest display order.
  • candidate words that are determined to be learning words but do not match the pinyin length are classified into group A in the upper layer dullup B next.
  • candidate words that are not learning words but whose pinyin length is determined to match are classified into group C under group B.
  • Even Pinyin long as learning word The candidate words judged to be unmatched ⁇ are classified into the lowest group D.
  • learning kanji judgment unit 131 further judges whether it is a single learning word or a pair learning word ( S303). Further, the connectivity degree determination unit 133 determines whether or not there is continuity with the immediately preceding character string (S 304).
  • candidate words that are paired learning words and determined to have continuity with the immediately preceding character string are classified into the topmost group A1.
  • Candidate words that are judged as pair learning words but have no continuity with the immediately preceding character string are classified into the upper group A2.
  • Candidate words that are judged not to be paired learning words but have continuity with the immediately preceding character string are classified into lower group A3.
  • Candidate words judged not to be consecutive with the immediately preceding character string that is not a paired learning word are classified into the lowest group A4.
  • Duppe B candidate words are classified similarly.
  • connectivity degree determining section 133 determines the presence or absence of continuity with the immediately preceding character string (S 305), and upper group C (D 1) and the lower group C (D) 2
  • all candidate words (G) are, in order of upper group strength, group Al, group A2, groupo A3, ancestral A4, group Bl, group B2, group B2, group B3, group B4, group Cl, group It is divided into small groups of C2, group Dl and group D2.
  • the order of each small group in groups A and B is further determined by the learning time (element G) (the higher the kanji learned recently, the higher rank), and the small groups in groups C and D have their frequency of occurrence (element A). Determines the order (most commonly used words are higher)
  • the learned candidate kanji 'words are rearranged under a predetermined condition (that is, in the order of newly learned), but the candidate kanji extracted as a candidate is You may arrange later as the candidate word learned in the past.
  • the words learned in the past may be arranged so that they are displayed prior to candidate kanji. If there are multiple words used in the past, including candidate kanji, they may be arranged in ascending order of the number of characters. Alternatively, it may be arranged in the order of learning or in the reverse order as time lapse. (Modification 3—Input of Four Voices)
  • an AZP correspondence table 141 shown in FIG. 15 may be created including a four-voice code (for example, one to four voices are represented by one to four voices and five light voices for PC input). More specifically, in the AZP correspondence table 141 and the PZC correspondence table 142 shown in FIG. 15, it can be seen that pinyin "shang" is associated with the Chinese character "upper".
  • pin-in with the four voices of “upper” is “three-voice shang”
  • “sh ang” corresponding to “cp— si” in the AZP correspondence table 141 should be described as “511 & 11 8 3”. Then, it is possible to associate pinyin "shang3" with "four voices" with "on”.
  • the character conversion device 1 of the present invention further adds priority information to the above-described candidate order determination element, and the user uses the condition (evaluation value calculation rule) for determining the display order of candidate kanji (words). It may be configured to be set arbitrarily. Since the conditions for determining the display order can be arbitrarily set by the user, it is possible to display the candidates in the display order that matches the user's preference on the multifunction printer equipped with the character conversion device 1. Become. As a result, the input operation for obtaining the desired kanji can be simplified.
  • the character conversion device 1 in the present embodiment includes a candidate order determination element setting unit (candidate order determination element setting unit) 35 in the character input control unit 30 in addition to the configuration of the above-described embodiment 1,
  • the candidate order determination factor recording unit 47 is further included in the recording unit 40 (FIG. 12).
  • evaluation values calculated for each candidate are temporarily stored in the evaluation value temporary storage unit 52 shown in FIG. 22, and the display order is determined based on the evaluation values. That is, the higher the priority order candidate determination element set by the candidate order determination element setting unit 35, the larger the influence on the calculation of the evaluation value.
  • the candidate order determining element set by the candidate order determining element setting unit 35 and the priority thereof are recorded in the candidate order determining element recording unit 47. Will be recorded.
  • FIG. 25 is a diagram showing an example of a data structure for storing evaluation values of display order calculated for each candidate.
  • the evaluation value for each candidate is expressed in 16 bits. That is, values of “0” or “1” indicating characteristics (elements A to G) of a candidate kanji (word) are stored in 16 areas shown in FIG.
  • the bit position is associated with the high and low of the evaluation value, and as it becomes the upper bit, it represents the high-order digit of the evaluation value. Therefore, the judgment result on the candidate order determination element with high priority is reflected in the upper bits.
  • the evaluation value calculation rule specifies how to calculate the value of “0” or “1” from the candidate order determining element element of the candidate kanji (word), and also determines the value thus calculated. It defines which bit position of the evaluation value is to be reflected.
  • the candidate order determination element recording section 47 is for recording priority information of candidate order determination elements for determining the display order of candidates.
  • (A) of FIG. 26 and (b) of FIG. 26 are diagrams showing an example of priority information recorded in the candidate order determining element recording section 47.
  • the priority information includes, for each candidate order determination element, bit position information indicating which bit position of the data structure of the evaluation value the determination result on the candidate order determination element is to be reflected.
  • bit position information indicating which bit position of the data structure of the evaluation value the determination result on the candidate order determination element is to be reflected.
  • the determination result of the element B is set to be stored in the area of the bit position 12 shown in FIG. That is, when the pin-in length matches, "1" is stored in the area of bit position 12; otherwise, "0" is stored as the evaluation value. Evaluation of candidate with higher priority for candidate order determinant assigned higher bit position The impact on determining the value is large.
  • the evaluation value is expressed in 16 bits and the evaluation value is calculated using the area of bit positions 0 to 12, but the present invention is not limited to this example.
  • the present invention is not limited to this example.
  • the bit length is also possible to make the bit length sufficient to accommodate them.
  • the appearance frequency is expressed by 8 bits
  • 16 bits are secured to clarify the difference in the appearance frequency or It is possible to reduce to 4 bits, etc., in order not to emphasize the difference in frequency.
  • empty bit positions 13 to 15 may be used.
  • the candidate order determination element setting unit 35 sets the priority of the candidate order determination element specified by the user via the input unit 11 (FIG. 12), and the priority information is stored in the candidate order determination element recording unit 47. It is something to store.
  • the display unit 12 of the operation panel 2 displays the candidate Display an operation screen prompting to specify the priority of the order determining element. For example, along with a message such as “Indicate in the order you want each element (A to G) to be prioritized”, a list of candidate order determining elements for which priority can be set may be displayed.
  • a message asking whether to determine the power is displayed. For example, you may display a message such as "Please press the Enter key to make a change.”
  • candidate order determination element setting unit 35 determines a candidate order determination element having a low priority designated by the user from the low bit position according to the input instruction signal. Allocating in order as the area of, change the information of the bit position storing the judgment result of each candidate order decision element.
  • the user determines element E, element F in order with the pair learning kanji Z word of element F as the candidate order determining element with the highest priority. If element B, element C, element D, and element AZG are specified, the priority information is changed as shown in (b) of FIG. In the example shown in (b) of FIG. 26, the judgment result of element F is the range of bit position 12 Will be stored in the area.
  • each part (FIG. 22) of candidate order determination unit 32 determines the determination result based on the priority information of the candidate order determination elements specified by candidate order determination element storage unit 47.
  • the evaluation value is stored in the bit position of the designated evaluation value temporary storage unit 52.
  • the evaluation value for determining the display order can be calculated based on the candidate order determination factor that the user wants to prioritize, so the candidates can be displayed in a display order that more closely matches the user's preference. Can be displayed.
  • the priority information of the candidate order determining element may be set separately for the candidate kanji and the candidate word, or may be similarly applied to the display order of both candidate kanji 'candidate words.
  • the configuration may be such that the degree information can be set. If the configuration is such that the candidate kanji and the candidate word can be set separately, the priority change operation can be changed by changing the priority of the candidate order determination element in order of the candidate kanji and the candidate word by one priority setting menu call operation. You may be able to select the priority setting menu for candidate kanji and candidate words, and change the priority.
  • a setting end key may be provided, and desired priority information may be changed with respect to the contents up to when the setting end key is pressed. For example, when the setting end key is pressed in the middle of changing the priority of the candidate order determination element, it is possible not to reflect the content of the change in priority but to reflect only the change of the candidate order determination element. is there.
  • the candidate extracted by the candidate extraction unit 31 is temporarily stored in the extraction candidate temporary storage unit 51.
  • the evaluation value temporary storage unit 52 for each candidate stored, a 16-bit area (FIG. 25) representing the evaluation value is secured.
  • the learning time determination unit 135 calculates the learning time or the appearance frequency, respectively, otherwise the appearance frequency determination unit 134 calculates the learning time or the appearance frequency.
  • the appearance frequency is about various document character data such as a large amount of newspapers and magazines. It is the frequency of use statistically determined for each letter and word, and the larger the frequency, the more commonly used letter 'word' is assumed. In this embodiment, since the area of appearance frequency is secured at 8 bits, it is possible to set the evaluation value of the candidate between 0 and 255 based on the appearance frequency. .
  • the learning time can be determined between 0 and 255 so that the more recently learned one has a higher evaluation value.
  • the learning time indicates when the candidate kanji (word) was learned. It will be as follows if an evaluation value calculation rule is explained concretely. For example, assuming that the learning time is X, the maximum number of learning is N, the learning kanji and the word learning time are n, and the current learning time is m, the learning time for any kanji and word is X It is determined by
  • the value of X in the above equation exceeds the bit length range (8 bits in this embodiment) secured as the evaluation value of the learning time, the value of X is set to the bit length range. You can do numerical conversion of or change the bit length.
  • Bit positions 8 to 12 store the determination results for elements D, C, F, E, and B, respectively.
  • the learning kanji determination unit 131, the pinyin length determination unit 132, and the connection degree determination unit 133 store the determination results for elements D, C, F, E, and B, respectively.
  • the candidate order determination unit 32 rearranges the candidates in order of the evaluation value magnitude stored in the evaluation value temporary storage unit 52, and displays the final display order Store in display order temporary storage unit 53. Finally, the display control unit 22 (FIG. 12) displays the candidates based on the display order stored in the display order temporary storage unit 53. Display on section 12.
  • the character conversion device 1 of the present invention is not limited to this.
  • the input unit 11 (FIG. 12) of the operation panel 2 also has a function of inputting handwritten characters, and the input / output control unit 20 recognizes the handwritten characters to constitute the character conversion device 1 for inputting Chinese characters. It is possible.
  • the input unit (written character input unit) 11 for inputting handwritten characters may display a frame for inputting handwritten characters on the display area of the operation panel 2 as a display integrated type, for example. It can be realized from this.
  • the character conversion device 1 is a character recognition unit (character recognition means) for recognizing writing information (coordinate data) of characters written by the user in the character input box in the input / output control unit 20. Not shown).
  • the recording unit 40 further includes a recognition dictionary (not shown) used to match the input character pattern recognized by the character recognition unit. Then, the display control unit 22 is configured to output the recognition result of the input character! .
  • operation panel 2 detects the coordinate information 60 (FIG. 27) of the written character, and the input / output control unit 20 internal buffer
  • the character recognition unit matches the coordinate information stored in the internal buffer with the dictionary pattern of the recognition dictionary (for example, DP matching), and obtains the similarity (for example, distance) between the input pattern and the dictionary pattern.
  • the character code corresponding to and the similarity are associated and stored in the internal buffer.
  • the character recognition unit matches the input pattern with each pattern of the recognition dictionary, the character recognition unit rearranges the recognition results of the internal buffer in descending order of similarity. Then, the recognition result is supplied to the candidate extraction unit 31 as a candidate kanji (word).
  • operation panel 2 detects the coordinate data of “extra character 2” and stores it in the internal buffer. (Fig. 27).
  • the character recognition unit performs DP matching between the coordinate data of “external character 2” in the internal buffer and each dictionary pattern of the recognition dictionary, calculates distances, and rearranges them in ascending order of distance (FIG. 29).
  • On The output control unit 20 transfers the recognition result to the display control unit 22.
  • Display control unit 22 determines the display position of n characters (here, kanji) as recognition character candidates based on the recognition result, and displays information on display positions and n recognition character candidates. Is displayed on the display unit 12 of the operation panel 2 ((b) in FIG. 28). Operation panel 2 associates the displayed recognized character candidate with the display position and stores it in the internal buffer.
  • n characters here, kanji
  • the function of the character recognition unit of the character conversion device 1 for recognizing handwritten characters is not limited to this, with the multifunction printer taken as an example.
  • the above-described character recognition unit can be applied to any device provided with a touch panel integrated with a display. Therefore, it is possible to realize the character conversion device of the present invention capable of inputting a desired kanji 'word by inputting a written character, on any device other than the above-mentioned multifunction printer. .
  • the character conversion device 1 of the present invention is not limited to the above configuration.
  • the character conversion device 1 of the present invention as a character conversion device for inputting the traditional characters mainly used in Taiwan.
  • a vowel character (jutsu character's letter) as a phonetic character is accepted, and a character variation is presented to the user as a candidate kanji character (word) as a traditional kanji character as an ideographic character.
  • the replacement device 1 will be described.
  • FIG. 30 is a block diagram showing the main configuration of the character conversion device 1 according to the present embodiment.
  • the character conversion device 1 shown in FIG. 30 differs from those in FIG. 12 and FIG. 22 in that the input unit 11 is provided with a key for inputting a note character, which is associated with each note character.
  • the kanji dictionary unit 41 includes a phononic character input acceptance unit 24 for accepting the input phononic character and supplying it to the candidate extraction unit 31. It is a point that contains a correspondence table that records the correspondences between the ring tones and the traditional characters of the ring tones.
  • FIG. 31 is a diagram showing an example of the operation screen at the time of the phonetic character input process for traditional character input in the present embodiment.
  • the display integrated type operation panel 2 in the present embodiment may display the vulgar character in addition to displaying the alphabet (character input area 121). It is common to enter vowels instead of the alphabet for traditional Chinese input.
  • the phonetic characters are assigned to the keys of the keyboard, so that it is possible to input a vocal sound corresponding to the traditional characters.
  • FIG. 32 is a diagram showing an example of a kanji dictionary (in Japanese) recorded in the kanji dictionary unit 41 (FIG. 30) according to the present embodiment.
  • FIG. 33 is a diagram showing an example of a word dictionary recorded in the word dictionary unit 42 in the present embodiment.
  • FIG. 34 is a diagram showing an example of the connectivity degree table 150a recorded in the connectivity degree recording unit 46 in the present embodiment.
  • the calculation of the learning order Y may be performed by the learning time determination unit 135 shown in FIG.
  • the learning number represents a unique number to be assigned to the newly learned kanji, and this number can be recorded together with the learning kanji as a learning counter to know when the kanji has been learned. It is. Assuming that it is possible to record 500 learning kanji (words) in the learning dictionary recording unit 43 (or the pair learning dictionary recording unit 44) of FIG. 21, the learning numbers are prepared from 0 to 499. do it.
  • the learning timing determination unit 135 may perform processing of rounding the learning order ⁇ obtained for each candidate, in order to save time and effort of rearranging the candidates.
  • the learning order can be rounded according to the following procedure.
  • the learning kanji learned in 43 has a learning counter equal to the current learning number Search for learning kanji. Delete the kanji at the address ID of the retrieved learning kanji and set a new learning kanji. If there is no learning kanji with a learning counter, etc., etc., for the current learning number, record both the learning kanji and the learning counter at the ID position. Then, add 1 to the learning number.
  • the learning control unit 34 overwrites the new learning vague “large” with “learning” recorded along with the learning counter 0. Record with At this time, learning counter 0 is recorded in association with "large”. Subsequently, the learning number is updated to 2 as described above.
  • a group identifier (GId value) generated from the character code of the kanji constituting the word is assigned to the learning word recorded in the learning dictionary recording unit 43, and the learning word is associated with the GId value. Are recorded. By using this GId value, the processing efficiency for searching for a target learning word from the learning dictionary recording unit 43 is improved.
  • both the kanji code and the GId value for uniquely identifying a kanji are represented by 16 bits.
  • the GId value is composed of the upper 4 bits indicating the number of characters of the word and the lower 12 bits indicating the numerical value obtained by calculation of the kanji code of each kanji constituting the word.
  • the data structure of the GId value used in the character conversion device 1 of the present invention is not limited to this.
  • the learning control unit 34 First, generate the GId value of the word “Shanghai".
  • the GId value generated as described above is a word having any number of characters (if the number of characters is represented by 4 bits as in the above example, a maximum of 8 characters can be used as a word). It will be summarized as a fixed length (eg, 16 bits). Therefore, since it is possible to execute the search process after narrowing down the target word by the GId value configured with the fixed length, it becomes possible to improve the search efficiency of the learning word.
  • the operation screen for character input processing displayed on operation panel 2 is not limited to the example shown in FIG. 14 or FIG. Another example of the operation screen displayed on the operation panel 2 will be described.
  • the alphabet keys for inputting pinyin displayed on operation panel 2 may be a key arrangement imitating a PC keyboard as shown in FIG. 35, or as shown in FIG. 36, an input character Let's fit in alphabetical order to m X n key array.
  • a candidate character display area 122 for displaying candidate kanji may be superimposed on the determined character display area 123 and displayed ((a) to (c) in FIG. 37).
  • second candidate character display area 122 ′ for displaying candidate words starting from the selected kanji is superimposed on candidate character display area 122 and determined character display area 123. It may be displayed (FIG. 38 (a)) or may be displayed side by side with the candidate character display area 122 (FIG. 38 (b)).
  • FIGS. 1 to 11. One embodiment of the present invention will be described below with reference to FIGS. 1 to 11. .
  • a device for inputting a reading (phonetic characters) of kanji characters (converted characters, characters after converted) to determine words including vague characters is used, but the present invention is not limited thereto. That is, the embodiments described below are merely examples for implementing the present invention, and are not limited to the implementation of the present invention.
  • FIG. 1 is a block diagram showing a configuration of character conversion device 1 according to the present embodiment.
  • the character conversion device 1 includes an input unit 11, a dictionary search unit 310, a display control unit 22, a kanji dictionary unit (conversion character dictionary) 41, a word dictionary unit (word dictionary) 42, a temporary storage unit 3, a control unit 10, a display It has part 12.
  • the reference numerals attached to the respective constituent elements in FIG. 1 correspond to the reference numerals attached to the respective constituent elements in FIG. 12, FIG. 16, FIG. 21, FIG. 22 and FIG. It shows the elements.
  • the character conversion device 1 shown in FIG. 1 may be equipped with the components shown in FIG. 12, FIG. 16, FIG. 21, and FIG. 22, FIG. 30, which are not shown in FIG.
  • the input unit 11 includes a keyboard and the like so that the user can input a desired input character string (phonetic character), selected kanji described later, selected words, and the like. It is configured. Further, the input unit 11 is configured to allow the user to input various instructions to the character conversion device 1.
  • the input unit 11 and the display unit 12 may be used as the operation panel 2 (touch panel).
  • Display control unit 22 displays the kanji, words, and their candidates searched by dictionary search unit 310 on display unit 12 in a selectable manner based on the readings and readings input from input unit 11. .
  • the display unit 12 displays various data under the control of the display control unit 22.
  • the candidate extraction unit 31 includes a dictionary search unit 310.
  • the dictionary search unit 310 searches each dictionary unit (Kanji dictionary unit 41 and the word dictionary unit 42) stored in the storage unit 40, and detects candidate kanji (or words).
  • the dictionary search unit 310 further includes a kanji acquisition unit (head conversion character acquisition unit) 311 and a word acquisition unit (word acquisition unit) 312.
  • the kanji acquisition unit 311 receives the character string of the reading accepted by the input acceptance unit (acceptance means) 21 as an input, and searches the kanji dictionary unit 41 to find the kanji that matches the desired conditions (or, described below.
  • the kanji information is stored in the temporary storage unit 3 as a search result.
  • the word acquisition unit 312 receives the search result of the kanji acquisition unit 311 and the character string of the reading accepted by the input accepting unit 21.
  • the word dictionary unit 42 is searched as a force, and a word matching the desired condition (or word information described below) is stored in the temporary storage unit 3 as a search result. Details of kanji acquisition and word acquisition will be described later.
  • the kanji dictionary unit 41 holds kanji correspondence information 144 in which the yomi and the kanji are associated.
  • a kanji index (kanji index table) 145 in which the first character of reading and the address information of the storage destination in the kanji correspondence information 144 of the kanji whose head is the first character is associated is held.
  • the kanji index 145 is arranged in the order of kanji reading.
  • the word dictionary unit 42 holds word correspondence information 162 in which a word and a reading of the word are associated with each other. Furthermore, a word index (word index table) 161 in which the head character which can be the head of the word and the address information of the storage destination in the word correspondence information 162 of the word having the head character at the head are associated is held. ing. The word correspondence information 162 and the word index 161 are grouped by the first letter of the word! .
  • the kanji correspondence information 144 stores auxiliary information in association with kanji.
  • the word correspondence information 162 stores auxiliary information in association with a word.
  • the auxiliary information includes, for example, the number of characters of reading (for example, the pin-in length (element B) in each of the embodiments described above) (search auxiliary information), the frequency of occurrence (element A described above) (display auxiliary information), etc. .
  • the above auxiliary information is a numerical value obtained by converting the number of characters and the frequency of occurrence of reading into a predetermined range, for example, 1 to 100.
  • kanji information refers to kanji and information (reading and supplementary information) associated with the kanji.
  • word information refers to a word and information (reading and supplementary information) associated with the word.
  • auxiliary information is, of course, not limited to the above, and may be other information.
  • the candidate order determining elements (element C to element G) described in each of the above-described embodiments may be stored as auxiliary information in association with the candidate (kanji Z word).
  • the control unit 10 centrally controls the operation of various components in the character conversion device 1. For example, the number of characters of the character string input from the input unit 11 is counted. Also, the dictionary search unit 310 controls the display control unit 22. In addition, an input receiving unit 21 is provided so that the user can receive a character string input by the input unit 11! Next, the processing operation of the character conversion device 1 will be described using the flowchart shown in FIG.
  • Control unit 10 counts the number of characters of the read character string transferred from input unit 11, and stores the read character string and the number of characters in temporary storage unit 3. Then, the control unit 10 instructs the dictionary search unit 310 to search for kanji (S101).
  • the Chinese character acquisition unit 311 of the dictionary search unit 310 reads the read character string and the number of characters of the read character string from the temporary storage unit 3 into an internal buffer (not shown) of the dictionary search unit 310.
  • the kanji acquisition unit 311 refers to the kanji index 145, determines the address (iNum) of the end position of the search target range to be searched in the kanji correspondence information 144 based on the first character of the read character string, and internally It holds in the buffer (S102). Further, the address (iStart) of the start position of the search target range to be searched in the kanji correspondence information 144 is obtained and held in the internal buffer (S103).
  • the kanji acquisition unit 311 reads kanji information by referring to the kanji correspondence information 144 based on the address held in the internal buffer of the dictionary search unit 310, and reads the kanji correspondence information 144 and the input characters.
  • the column reading is compared (S104). If the reading of the input character string matches the reading in the kanji correspondence information 144 (YES in S104), the kanji acquisition unit 311 stores the kanji information satisfying the reading condition in the temporary storage unit 3. It memorizes (S105).
  • iStart is updated (S106). Thereafter, the kanji acquisition unit 311 compares the iNum stored in the internal buffer with the iS tart updated in S 106 (S 107). As a result of comparison, in the kanji to be searched, iStart becomes iNum or more! /, And (in S 107,! /, NO), notifies the control unit 10 that the kanji search is finished.
  • control unit 10 When control unit 10 receives the notification that the kanji search is completed from dictionary search unit 310, control unit 10 rearranges the kanji search results stored in temporary storage unit 3 in a predetermined order (S 108), Information on the position is added and stored in temporary storage unit 3. Power! ],
  • the display control unit 22 is a display unit 1
  • the kanji search result is displayed at step 2 (S109).
  • sorting in the predetermined order it may be sorted in the order of appearance frequency of the auxiliary information, or as described in the above embodiments, based on the candidate order determination factor (and the evaluation value calculation rule).
  • the candidate order determination unit 32 calculates the evaluation value, and based on it, it may be rearranged!
  • the control unit 10 compares the position information received by the input receiving unit 21 with the display position information of the kanji search result stored in the temporary storage unit 3 and detects the input kanji (S 11). In addition, the control unit 10 transfers the kanji designated by the user to the dictionary search unit 310, and instructs to perform a word search Note that, at this time, the candidate selection reception unit 23 is selected by the user.
  • the kanji may be detected, and the learning control means may store it as a learning kanji in the learning dictionary recording unit 43 or the pair learning dictionary recording unit 44.
  • the word acquisition unit 312 of the dictionary search unit 310 When the word acquisition unit 312 of the dictionary search unit 310 receives a word search instruction, the word acquisition unit 312 refers to the word index 161 and, based on the acquired identification information (character code) of the kanji, in the word correspondence information 162.
  • the address (iNum) of the end position of the search target range to be searched in is determined and stored in the internal buffer (S112). Further, the address (iStart) of the start position of the search target range to be searched in the word correspondence information 162 is obtained and held in the internal buffer (S113).
  • the word acquisition unit 312 reads the word information with reference to the word correspondence information 162, and temporarily stores the word information matching the predetermined condition. It stores in the part 3 (S 114).
  • the word acquiring unit 312 updates iStart for the word to be searched (S 114 a), holds it in an internal buffer, and compares iNum with iStart updated in S 114 a (S 115) o As a result of comparison, when iStart is iNum or more for the search target word (NO in S115), the control unit 10 is notified that the word search has ended.
  • the control unit 10 When the control unit 10 receives the notification that the word search is completed from the dictionary search unit 310, the control unit 10 arranges the word search results stored in the temporary storage unit 3 in a predetermined order (S116), The position information is added and stored in the temporary storage unit 3.
  • the display control unit 22 causes the display unit 12 to display the word search result on the display unit 12 (S117). Note that when sorting in the above-mentioned predetermined order, it may be sorted in the order of appearance frequency of the auxiliary information, or, as described in each of the above-mentioned embodiments, candidates are determined based on candidate order determination elements (and evaluation value calculation rules).
  • the order determination unit 32 may calculate the evaluation value and rearrange it based on it.
  • the user also visually checks a desired word as the word search result displayed on the display unit 12, selects the desired word, and designates and inputs the position where the word is displayed.
  • the input unit 11 transmits position information of the input position to the input reception unit 21 of the control unit 10. At this time, even if the candidate selection receiving unit 23 detects the kanji selected by the user and the learning control means stores it as a learning kanji in the learning dictionary recording unit 43 or the pair learning dictionary recording unit 44.
  • the control unit 10 compares the position information received by the input receiving unit 21 with the information on the display position of the word search result stored in the temporary storage unit 3 to search for a word specified by the user. Then, the display control unit 22 displays the word retrieved by the display unit 12.
  • the kanji dictionary unit 41 and the word dictionary unit 42 are provided with the index and the kanji correspondence information 144 and the word correspondence information 162 to group the words of the same leading kanji.
  • Auxiliary information is added to the correspondence information and the word correspondence information, and the search time can be shortened by limiting the search target kanji and words based on the auxiliary information.
  • the display order of the retrieved kanji characters (words) is likely to be selected by the user by the candidate order determination unit 32 based on the auxiliary information (candidate order determination factor) of the candidate. It is decided that things will be displayed with priority. Therefore, candidate kanji (words) rearranged in the order desired by the user can be displayed in a shorter time. As a result, user convenience can be improved.
  • the Chinese character conversion device 1 described above is converted into a Chinese character (converted character) by inputting pinyin (phonetic characters).
  • pinyin phonetic characters
  • the kanji index 145 is as shown in FIG. 3.
  • the kanji index 145 has the first character of pinyin (the first phonetic character) and the first character of this pinyin as the first of the pinyin of kanji (phonetic information).
  • the kanji address (storage destination information) in the kanji correspondence information 144 of the kanji is associated!
  • the description on the left of the Kanji index 145 (cidx-ml, cidx-m2, etc.) is an address in the vague index 145.
  • cidx-ml is an address in the kanji index 145 of "the beginning of reading is m and one voice".
  • kanji and its pinyin are associated with auxiliary information (in FIG. 4, search conditions and frequency information).
  • auxiliary information in FIG. 4, search conditions and frequency information.
  • the description (cp-ml, cp-m2, etc.) on the left of the kanji correspondence information 144 in FIG. 4 is an address in the kanji correspondence information 144.
  • the word index 161 has a primary index (word primary index table) 161a shown in FIG. 5 and a secondary index (word secondary index table) 16 lb shown in FIG.
  • the primary index 161a the kanji and its code are associated with the address at the secondary index 161b of the word having the kanji as the head.
  • the secondary index 16 lb the top Pinyin of the kanji and the second kanji is associated with the address in the word correspondence information 162.
  • the description on the left of the secondary index 161b (widx-di, widx-gi, etc.) in FIG. 6 is an address in the secondary index 161b.
  • a pin is associated with a word and its reading.
  • the description (wp ⁇ di, wp ⁇ gi, etc.) on the left of the word correspondence information 162 in FIG. 7 is an address in the word correspondence information 162.
  • the primary index 161a is not necessarily required.
  • the character conversion device 1 described above is described as a Chinese conversion device which converts pinyin to kanji, the invention is not limited thereto.
  • a hiragana or katakana
  • the apparatus may be a device that determines a word including
  • the Chinese character acquisition unit 311 of the dictionary search unit 310 reads the read character string "mei" from the temporary storage unit 3 into the internal buffer of the dictionary search unit 310. Then, the dictionary search unit 310 obtains an address in the range of “m” of the kanji index 145 according to the following expression based on the first character “m” of the reading character, and the kanji address described in those addresses Store in buffer.
  • the address is a
  • the unit size of the kanji index (kanji index table) 41 has word index (word index table) 51 in alphabetical order of the first letters of Pinyin, and the first letters of the same pinyin are arranged in the order of 1 to 5 in four voices If configured as above, it will be the byte length of the kanji address. For example, in the device operating as this implementation, it is 4 bytes.
  • the end address is calculated by “(0x6d — 0x61 + 1) Unit size of X kanji index X 5” (ie, cidx — nl in FIG. 3), and each address points
  • the kanji address (cp-ml and cp-nl) described in the kanji index is stored in the internal buffer.
  • cidx-ml in the figure represents the address in the Kanji index 145 of the m-first reading of one reading.
  • the corresponding cp-ml represents the address of the single-speech kanji correspondence information 144 with the m beginning of the reading. cidx-m5, cp-m5 has m at the beginning of the reading and corresponds to a light voice!
  • the dictionary search unit 310 sets the address cpi to be searched as cp ⁇ ml. Next, the dictionary search unit 310 reads the kanji information of the address cpi from the kanji correspondence information 144, and determines the search condition based on the number of input pinyins. As a result, if the search conditions are satisfied, the input reading and the reading of the kanji information are compared, and if the input reading matches the reading of the kanji information, the temporary storage unit 3 is regarded as a match.
  • cpi only.
  • the search condition is as follows.
  • the address of the third voice of “m” is Calculated by the above equation (1), the start address is "(0x6d-0x61) unit size of X kanji index X 5 + (3-1) unit size of X kanji index” (that is, cidx-m3 in Fig. 3), The end address is "(0x6d-0x61) X Kanji index unit size X 5 + 3 X Kanji index unit size” (that is, cidx-m4 in Fig. 3) and described in the Kanji index pointed to by each address! Store the kanji address (cp-m3 and cp-m4) in the internal buffer.
  • the user inputs the pin-in “111” of Chinese reading “extra digit 1,” the kanji search result is stored in the temporary storage unit 3, and the display control unit 22 displays it on the display device. Give instructions from “" and "”, and explain.
  • the primary index 161a of the word dictionary unit 42 is stored in ascending order of kanji code.
  • the dictionary search unit 310 searches for the word index 161 that matches the transferred kanji code, and finds the search target range in the word correspondence information 162. Ru. Specifically, the dictionary search unit 310 searches the primary index 161a for the transferred “extra character 2” by two and obtains a kanji code C3BF that matches “extra character 2”.
  • the dictionary search unit 310 is configured to set the kanji code C3C0 next to C3BF in the primary index 161a (see FIG. 5).
  • the address (word address) “wp ⁇ mi + l” in the word correspondence information 162 of “head digit 6” is the first word, “star 2” is the head. It is stored in the internal buffer as the end address for the word correspondence information 162.
  • the word acquisition unit 312 of the dictionary search unit 310 reads out the word information from “wp ⁇ mi” to “wp ⁇ mi + l”, and uses the kanji string of the word information as the search result as a temporary storage unit 3
  • the candidate order determination unit 32 rearranges the word information (candidate words) in the temporary storage unit 3 in descending order of appearance frequency, which is supplementary information (not shown), and ends the word search process.
  • this rearrangement is a mere illustration and may be rearranged by other methods.
  • the display order may be determined based on the candidate order determination factors (element A to element G).
  • the retrieved and rearranged words are displayed on the display unit 12.
  • Embodiment 5-2 describes the operation when the user inputs word reading to input a word.
  • the embodiment 5-1 while the reading of the first character of the character string to be input is input, in the present embodiment 5.2, there is a difference that the reading of the word to be input is input.
  • the process in which the input receiving unit 21 of the control unit 10 receives the read character string input from the input unit 11 is the same as that of the embodiment 5-1. Therefore, in the present embodiment, the process after the user inputs “meinian” of Pinyin, which is a Chinese reading of “extra character 1” to input “extra character 1” will be described.
  • the control unit 10 has a function of dividing pinyin of the character string of the input reading, and is assumed to be a personal computer. The pinyin division process of the input read character string will be described later.
  • control unit 10 When control unit 10 receives the reading character string "meinian” from input unit 11, it divides the reading character string into pin-ins, and as a division result, it is divided into pinyins by "mei" and "nian". The number of divisions indicating that there are two (or the number of division positions indicating that there are two division positions) 2 is stored in the temporary storage unit 3.
  • the kanji acquisition unit 311 of the dictionary search unit 310 reads the read character string “mei” from the temporary storage unit 3 and stores the read character string “mei” in the internal buffer of the dictionary search unit 310.
  • the kanji acquisition unit 311 refers to the kanji index 145 using the above equation (2) based on the initial letter “m” of “mei” to obtain an address in the range of “m”, and those addresses The kanji address described in is fed to the internal buffer. .
  • the unit size of X kanji index X 5 "(that is, calculated according to cidx ⁇ nl in FIG. 3), referring to kanji index 145 of kanji dictionary section 41, and kanji address indicated by each address (cp — ml , And cp-nl) are stored in the internal buffer.
  • the kanji acquisition unit 311 of the dictionary search unit 310 sets the address cpi to be searched as cp ⁇ ml.
  • the kanji acquisition unit 311 reads kanji information at the address cpi, refers to the kanji correspondence information 144, and determines a search condition based on the number of input pins. As a result, if the search condition is satisfied, the input reading is compared with the reading of the kanji information.
  • the temporary storage unit 3 stores that the complete matching with the kanji information (for example, 1) is associated and stored, and cpi is updated.
  • the input reading matches the reading of the kanji information, it is regarded as a match and it is stored in the temporary storage unit 3 that the kanji information and the leading match (for example, 0) are associated and stored cpi Do.
  • candidate order determination unit 32 rearranges perfect match kanji information among the kanji information stored in temporary storage unit 3 according to the appearance frequency, and Then, the front matching kanji information is rearranged at the appearance frequency so that the candidate order is lower than the perfect matching kanji information, and the kanji search processing is ended.
  • the candidate order determination unit 32 may further determine the display order based on other auxiliary information (eg, candidate order determination factors (element A to element G)).
  • kanji information of “extra character 2” is stored as a search result in the temporary storage unit 3 as kanji information that completely matches “mei” for the read character string “mei”.
  • the kanji search candidate power displayed after the above-described kanji search is described as the user specifying “extra character 2”.
  • the process of candidate selection in response to the instruction of “extra character 2” by the user is the same as the process described in the embodiment 5-1, and therefore the description thereof is omitted.
  • the primary index 161a, the secondary index 161b, and the word correspondence information 162 are arranged in the ascending order of the vague code.
  • the word acquisition unit 312 of the dictionary search unit 310 receives a word search instruction, it refers to the primary index of the word index 161, and the first index information that matches the transferred kanji code.
  • the primary index they are arranged in the kanji code order of the leading kanji of a word, and the word acquisition unit 312 searches for first index information that matches the kanji code transferred by binary search.
  • the first index information matching the transferred kanji code and the first index information of the next kanji code arranged in the primary index are stored in the internal buffer.
  • the secondary index address that matches the kanji code is widx ⁇ mi
  • the secondary index address of the next kanji code is The address is widx-mi + 1, which are stored in an internal buffer.
  • the word acquisition unit 312 of the dictionary search unit 310 reads the second and subsequent character strings of the divided read character string from the temporary storage unit 3 into the internal buffer.
  • the dictionary search unit 310 searches the search range of the word correspondence information 162 for the first divided kanji (first kanji, first kanji) and the first pinyin character of the second divided reading. In order to narrow down, based on the address of the secondary index 161b added to the primary index 161a, the one that matches the first character "n" of the second divided reading is added to the secondary index 16 lb. Search It stores the secondary index 161b searched for in the file and the next second index information arranged in the secondary index 161b.
  • the word acquisition unit 312 of the dictionary search unit 310 appends the address wpi to be searched to the second index information as the address wp-mi of the word information to be searched.
  • the word information from “wp—mi” to “wp—mi + k” is read out.
  • reading of the word information in the word correspondence information 162 corresponds to each kanji of the word. It is assumed that the reading is capitalized at the beginning of the reading corresponding to each kanji of the word as "MeiNian".
  • the word acquisition unit 312 of the dictionary search unit 310 reads the word information of the address wpi, refers to the word correspondence information 162, and compares the number of kanji of the word correspondence information 162 with the number of divisions of the input reading. . If the number of kanji in the word information is equal to or less than the number of divisions in the input reading, the reading and the divided reading corresponding to each kanji in the word information are compared from the beginning, and the number of kanji with exactly matching reading and the leading kanji reading Count the number.
  • temporary memory unit 3 has the number of kanji with perfect matching of word information and reading.
  • the word and the word memorize the number of kanji in the forward match, and wpi is updated. If “Number of Kanji in Word Information” and “Number of Kanji with Perfect Match of Reading + Number of Kanji with Forward Match” do not match, update wpi.
  • the dictionary search unit 310 ranks the word information stored in the temporary storage unit 3 under the following conditions.
  • the word search result is used. However, if the number of kanji in the word information is more than “the number of completely matching kanji + the number of leading kanji reading”, the word search result may be used. With this configuration, it is possible to input only by reading the first few characters of the desired word.
  • the word search is started by the user selecting a desired kanji from the displayed kanji search results.
  • the word search may be started for the first ranked kanji candidate without displaying the kanji search result, and the search result may be displayed when the word search is completed.
  • the word search may be started for some or all of the candidates without displaying the kanji search results, and the search results may be displayed when the word search is completed.
  • the first character strength also sets the i-th character as the first syllable, and the i + 1st character also finds the longest pinyin that holds as a rhyme, a vocal mother and a vocal mother + a rhyme, and makes it a second syllable
  • the first syllable has more characters than the second syllable
  • division processing of “meinian” will be described as a final example of division positions having upper conditions in the order of Whether a force that a part or all of the input pinyin is established as pinyin is registered in the pinyin dictionary and compared with the corresponding pinyin, it is assumed that if there is a matching pinyin, it may be established as pinyin.
  • pin-in dictionary as shown in FIG. 9, pin-ins of lengths 1 to 7 which can be established as pin-ins are registered.
  • input pinyin is a string input as pinyin
  • the position evaluation value calculated at the time of division processing determines the likelihood that the syllable is divided at that position when the division position for dividing the input pin into two syllables is determined. It is a numerical value. This position evaluation value is calculated for each syllable break position. Therefore, the evaluation value calculated for each candidate kanji Z candidate word (the evaluation value indicating the magnitude of the possibility that the candidate kanji Z candidate word is selected by the user) used in the above-described first to fourth embodiments Are distinguished.
  • the division position i of the input pinyin "meinian" is set to 1. Shown in Figure 8 with the first letter 'm' The pinyin dictionary index (pinyin dictionary index table) is referenced, and the first character is “m” and the 1-character pinyin dictionary address pr—ml and the 2 character pinyin dictionary address pr — read m2. Then, it compares with pinyin stored in pr ⁇ ml to pr ⁇ m2 of the Pinyin dictionary. In this case, since "m" exists, it is established as pinyin.
  • pinyin dictionary address pr-e6 with 7 characters and "z" as the first character is “e” with “einian” excluding "m” as the second character and "z” in 7 characters is read, pr — Compare with pinyins stored in e6 to pr-e7. If there is a matching pinyin, the position evaluation value of the first character is 7, but if there is no matching pinyin, one character is subtracted from the last character of the second syllable to create a new second syllable, and similarly, If there is a matching pin-in, compare the length of the second syllable + the length of the first syllable (1 in this case) as the position evaluation value of division position 1. If there is no matching pinyin, one more character is subtracted from the last character of the second syllable, and the process is compared with the Pinyin dictionary, and this is repeated until the position evaluation value is obtained.
  • the pin-in dictionary index table based on the first two characters "me” is compared with the pin-ins stored in the first two characters of the pin-in dictionary address pr-m2 to pr-m3. If “111 ⁇ 2” is present, let the first syllable be “me”. Next, with the exception of "me”, "inian” is taken as the second syllable, and it is similarly searched for whether there is a pinyin that matches the pinyin dictionary.
  • the processing from the top to cutout position 7 is performed, and the division position of the maximum evaluation value is evaluated. In FIG. 10, since the maximum position evaluation value is 7 and the number of division positions for which the maximum position evaluation value is 7 is 1, the final division position is determined to be 3. In FIG.
  • division number 2 for the division position 3 means that “mainian” has one eye division position between the third character i and the fourth character n. Since it is determined here that there is a second division position between the seventh character n and the subsequent character n (there is no character), it indicates that there are two division positions.
  • each Pinyin initial character is composed of seven pieces from 1 to 7 in length.
  • the maximum length of the note sound is 4, the first character of each note sound may be composed of four pieces of 1 to 4 in length.
  • the words of the same leading kanji are grouped, and the kanji dictionary and the word dictionary have information for restricting the search target according to the input pinyin.
  • the kanji dictionary has information as to whether or not it is a search target according to the pinyin length.
  • the word dictionary has the first character of the second character kanji in the index information, divides the Pinyin sequence (n characters of kanji), and the first character of the first character kanji searched for kanji and the second character also have word reading power You can limit the search target. With such a configuration, search is made for items that match the search conditions.
  • the kanji (word) candidates obtained by the above-described search processing may be selected by the user based on the auxiliary information (or the candidate order determining element) associated with each candidate. Since the higher the priority is displayed in order of priority, the response time until the candidate is displayed can be shortened, and in addition, the candidate can be displayed in the order desired by the user. It is possible to further improve the convenience of the user.
  • each block of the character conversion device 1, in particular, the candidate order determination unit 32, the dictionary search unit 310, the display control unit 22, and the control unit 10 may be configured by hardware logic. And may be realized by software using a CPU as follows. Specifically, the recording unit 40, the kanji dictionary unit 41, the word dictionary unit 42, and the temporary storage unit 3 are realized by a storage device such as a memory or a hard disk, and the candidate order determination unit 32, the dictionary search unit 310, the display control unit 22.
  • the control unit 10 is realized by a central processing unit (CPU) of a computer and software executed by the CPU. This software is also stored in the storage device, read out to the memory, and executed by the CPU.
  • CPU central processing unit
  • the character conversion device 1 executes a CPU (central processing unit) that executes instructions of a control program that realizes each function, a ROM (read only memory) with an upper d program frame, and expands the above program
  • a storage device such as a random access memory (RAM), a memory for storing the above programs and various data, and the like are provided.
  • the object of the present invention is to record the program code (the executable program, the intermediate code program, the source program) of the control program of the character conversion device 1 which is software that realizes the functions described above in a computer readable manner. It is also possible to supply the above recording medium to the character conversion device 1 and the computer (or CPU or MPU) reads out and executes the program code recorded on the recording medium.
  • Examples of the recording medium include tape-based media such as a magnetic tape and a cassette tape, magnetic disks such as a floppy disk Z hard disk, and optical disks such as a CD-ROM ZMOZ MD / DVD / CD-R.
  • a disk system, an IC card (including a memory card), a card system such as a Z optical card, or a semiconductor memory system such as a mask ROMZEPROMZEEPROM Z flash ROM can be used.
  • the character conversion device 1 may be configured to be connectable to a communication network, and the program code may be supplied via the communication network.
  • the communication network is not particularly limited.
  • the communication network is not particularly limited.
  • the Internet intranet, extranet, LAN, ISDN, VAN, CATV communication network, virtual private network, telephone network, mobile communication network And satellite communication networks are available.
  • the transmission medium constituting the communication network is not particularly limited.
  • IEEE1394, USB power line carrier, cable TV line, telephone line, wired line such as ADSL line, IrDA, infrared ray such as remote control, Bluetooth (Registered trademark), 802.11 radio, HDR, mobile phone network, satellite link, terrestrial digital network Etc.
  • the present invention can also be realized in the form of a computer data signal embedded in a carrier wave, in which the program code is embodied by electronic transmission.
  • the character conversion device converts an input character string consisting of one or more input phonetic characters into a converted character, and can change the character character of the converted character. In the replacement
  • word correspondence information in which word identification information for identifying a word is associated with word phonetic information indicating the phonetic sound of the word; and (b-2) a head at the beginning of the word
  • the head conversion character identification information for identifying a conversion character, and the word storage destination information indicating the storage destination in the word correspondence information of the word having the head conversion character indicated by the head conversion character identification information A word dictionary having associated word indexes,
  • a word acquisition unit for acquiring the word having the conversion character indicated by the head conversion character identification information at the beginning by referring to the word dictionary based on the acquired head conversion character identification information.
  • the character conversion device displays the first conversion character indicated by the first conversion character identification information acquired by the first conversion character acquisition unit on the display unit in a user-selectable manner.
  • Display control means for receiving the first conversion character selected by the user, and the word acquiring means is based on the first conversion character identification information of the selected first conversion character. Do you want to process to get the word. [0294] According to the above configuration, it is possible to acquire a word based on the head conversion character selected by the user, that is, by narrowing the head conversion character.
  • the conversion character identification information further includes a display for the conversion character indicated by the conversion character identification information.
  • the auxiliary information may be associated, and the display control means may rearrange the head conversion characters based on the display auxiliary information and display the selected character on the display unit by the user.
  • the head conversion characters can be rearranged based on the display auxiliary information, and can be displayed on the display unit in a selectable manner by the user.
  • the display auxiliary information may be, for example, information on the appearance frequency of the vague characters.
  • the display auxiliary information may be any other information as long as the display control means can rearrange the head conversion characters.
  • the word index of the word dictionary includes the head conversion character identification information for each head conversion character to be the head of the word;
  • the storage destination in the word correspondence information may be allocated to the combination with the second phonetic alphabet, which is the first phonetic alphabet of the letter.
  • the word acquiring unit searches for the word from the combination of the head converted character identification information and the second phonetic initial phonetic letter which is the first phonetic alphabet of the second word of the word. Can. Therefore, the search range can be narrowed further and quicker response is possible.
  • the converted character identification information is further acquired for the converted character indicated by the converted character identification information.
  • Auxiliary information may be associated, and the head converted character acquisition unit may acquire the converted character identification information based on the acquired auxiliary information.
  • the lead conversion character acquisition unit can acquire the conversion character identification information based on the acquisition auxiliary information on the conversion character indicated by the conversion character identification information.
  • the acquisition auxiliary information is, for example, information such as the length of a phonetic character of the head conversion character if it is information that assists the search using the conversion character dictionary of the head conversion character acquisition means.
  • the acquisition auxiliary information is It is classified into four information below.
  • the head conversion character acquisition means can acquire the head conversion character quickly and effectively.
  • those having the same conversion character at the beginning of a word may be grouped as one group.
  • the word acquisition means can search for words grouped as a group, and therefore can acquire words quickly and effectively.
  • the words may be arranged in a predetermined arrangement order of the first phonetic characters of the words.
  • the word acquiring means can arrange words in the group in the order of the predetermined arrangement of the phonetic characters at the head of the words, and can acquire more rapid words.
  • the predetermined arrangement order of the phonetic characters at the beginning of the word is, for example, alphabetical order if the word is pinyin, 50-sentence if hiragana, and the like.
  • the converted character is a kanji character
  • the phonetic character at the beginning of the kanji character is the same and the tone code is
  • the same kanji identification information, which is the above-mentioned converted character identification information, may be grouped as a group.
  • the start conversion character acquisition means increases the response speed because! /, It is necessary to search for the start conversion character! / Obtained in the group.
  • the above-mentioned converted character correspondence of the above-mentioned character conversion device used as a Chinese-language conversion device in each of the above groups, the kanji identification information is arranged in a predetermined arrangement order of the top phonetic characters of the kanji.
  • the leading converted character acquisition unit can acquire the leading converted character more quickly.
  • the first phonetic characters of the kanji are arranged in a predetermined arrangement order, and Information indicating the storage destination in the converted character correspondence information may be associated with the first phonetic character of the kanji as the converted character storage information, and may be! / !.
  • the start conversion character acquisition means can narrow down the group from the storage destination in the conversion character correspondence information of the group, and can make a search with a faster response speed.
  • the kanji characters may be simplified characters and the phonetic sounds may be expressed by pinyin.
  • the above kanji characters may be traditional characters, and the above-mentioned phonetic sounds may be expressed by a gush sound.
  • control method of the character conversion device can convert an input character string consisting of one or more input phonetic characters into a converted character and determine a word having the converted character strength.
  • (a-1) conversion character correspondence in which conversion character identification information for identifying a conversion character and conversion character phonetic information indicating the phonetic character of the conversion character are associated Information (a-2) a storage destination of the converted character identification information having the phonogram input and the converted character identification information having the phonogram as the head of the converted character phonetic information, A converted character dictionary having a converted character index associated with the indicated converted character storage destination information, (b-1) word identification information for identifying a word, and word sound information indicating the phonetic sound of the word Word correspondence information in which and are associated, and (b 2) the point of the word Head conversion character identification information for identifying a head conversion character; word storage destination information indicating a storage destination in the word correspondence information of a word having a conversion character at the head indicated by the head conversion character
  • the conversion character identification is made by referring to the conversion character dictionary and having the first phonetic character of the input character string as the first character of the converted character phonetic information.
  • a leading conversion character acquisition step for acquiring information
  • the character conversion device may be realized by a computer.
  • a character conversion program that causes the computer to realize the character conversion device by operating the computer as any of the means in any of the character conversion devices, and a computer readable recording medium storing the character conversion program Even falls within the scope of the present invention.
  • a character conversion device is a character conversion device for converting an input phonetic character or phonetic character string into a corresponding ideographic character and inputting an ideographic character or ideographic character string selected by the user.
  • a character recording unit for recording a phonogram character and an ideographic character corresponding to a phonogram string including the phonogram character, and receiving the phonogram character input by the user, the phonogram character is selected as the phonogram character.
  • Character candidate extraction means for extracting associated ideographic characters from the character recording unit as candidates for ideographic characters
  • Candidate order determination means for determining the display order of candidates extracted by the candidate extraction means based on a predetermined condition
  • display control means for outputting data so as to display the candidates based on the display order determined by the candidate order determination means.
  • the character candidate extraction unit receives the phonetic characters input by the user, and identifies a phonetic character string including the phonetic characters.
  • the phonetic strings correspond to ideographic characters.
  • the correspondence between the phonetic characters and the phonetic character strings, and the correspondence between the phonetic character strings and the ideographic characters are recorded in the character recording unit. Therefore, the character candidate extraction means can extract the candidate of the desired alphabetic character that the user desires to input based on the phonetic characters for which the user's input has been received.
  • the candidate order determination means rearranges the candidates of the ideographic character extracted by the character candidate extraction means on the basis of a predetermined condition, and determines the display order of the candidates.
  • the display control means outputs candidate data so that the ideographic character candidates are displayed in the display order determined by the candidate order determination means.
  • the user can input the ideographic character only by designating the desired ideographic character from the displayed candidates. From the above, the user can obtain desired ideographic character input with a smaller number of operations regarding the number of operations related to the input of phonetic characters.
  • phonetic characters are alphabets and ideographic characters are Chinese kanji. That is, the alphabet string is pinyin, and one pin corresponds to one or more kanji. In the character recording section, correspondences between alphabetic pinyin and pinyin-kanji are recorded.
  • the character candidate extraction unit refers to the character recording unit, identifies pinyin starting from the alphabet, and identifies one or more kanji corresponding to the identified pinyin. Extract as a candidate.
  • the kanji extracted as the above candidate is presented to the user.
  • the user may also specify the desired kanji for the presented candidate power. That is, after inputting the alphabet, it becomes possible to specify and obtain a desired kanji that does not require an operation to select the corresponding pinyin.
  • the phonogram may be a punctuation character and the ideographic character may be a traditional ideographic character. That is, the phonetic character string is a gushing sound, and one or more traditional characters correspond to one gushing sound.
  • the correspondence relationship between the vowels of vowels and the traditional vignettes of the vowels should be recorded.
  • the character conversion apparatus further receives a character string recording unit that records the ideographic character and the ideographic character string including the ideographic character in association with each other, and the ideographic character selected by the user.
  • Character string candidate extraction means for extracting an ideographic character string associated with an ideographic character from the character string recording unit as an ideographic character string candidate may be provided.
  • the character conversion device of the present invention converts the input phonetic character or phonetic character string into the corresponding ideographic character, and inputs the ideographic character selected by the user.
  • a character string recording unit for recording the ideographic character in association with an ideographic character string including the ideographic character, and an ideographic character selected by the user are accepted, and the ideographic character string associated with the ideographic character is Character string candidate extraction means for extracting from the character string recording unit as candidates for figurative character strings, candidate order determination means for determining the display order of candidates extracted by the character string candidate extraction means based on a predetermined condition, and And display control means for outputting data so as to display candidates based on the display order determined by the candidate order determination means.
  • the character string candidate extraction unit extracts an ideographic character string associated with the ideographic character as a candidate from the character string recording unit in response to the selection of the ideographic character by the user.
  • a kanji When a kanji is input, the character following the kanji (the kanji desired by the user) is more likely to be a kanji constituting a idiom in which the kanji power also starts than a kanji without any context. Therefore, if a idiom that starts from a kanji whose input has been confirmed is displayed as a candidate, the user only inputs Chinese language without separately entering pinyin simply by selecting the desired idiom of the idiom of the candidate of the idiom. It will be possible.
  • the character conversion apparatus further includes a writing character input unit for inputting ideographic characters written by the user, recognizes the ideographic characters input to the writing character input unit, and obtains information of the ideographic characters. It is preferable that a character recognition unit supplied to the character string candidate extraction unit is provided.
  • the character candidate extraction means of the character conversion device when receiving the phonetic characters input by the user, may extract the ideographic character corresponding to the continuously received phonetic character string as a candidate. Preferred.
  • the character candidate extraction means can narrow down and extract the number of corresponding ideographic character candidates significantly, and the display control means can output the narrowed-down ideographic character candidates.
  • the character conversion device is configured such that the probability that the displayed ideographic character candidate contains the ideographic character desired by the user is increased each time the phonetic character is input by the user. can do.
  • a character conversion device capable of simplifying input operations of phonetic characters to obtain desired ideographic characters.
  • the candidate order determination means of the character conversion device is configured such that the candidate of the ideographic character string extracted by the character string candidate extraction means is more than the candidate of the ideographic character extracted by the character candidate extraction means. It is preferable to determine the display order so as to be displayed preferentially.
  • the candidate ideographic character string extracted by the character string candidate extraction unit from the character string recording unit is preceded by the candidate ideographic character extracted by the character candidate extraction unit from the character recording unit.
  • the candidate order determining means determines the display order of the candidates to be displayed.
  • the character conversion apparatus further includes a learning control unit that records the ideographic character selected by the user as a learning character in the learning word recording unit, and the candidate order determining unit is extracted by the character candidate extraction unit. If the selected candidate is a learning character recorded in the learning word recording unit, the display order may be determined such that the candidate is displayed with priority.
  • a learning control means for recording the ideographic character string selected by the user as a learning character string in the learning word recording unit is provided, and the candidate order determination means extracts the character string candidate
  • the display order may be determined so that the candidate is displayed with priority.
  • the learning control means causes the learning word recording unit to record the ideographic character or ideographic character string whose input has been determined (desired by the user) in the character conversion device as a learning character (string).
  • the candidate order determining unit learns, of the extracted candidates, those recorded in the learning word recording unit. It can be judged as a character. That the candidate is a learning character means that the ideographic character (string) desired by the user in the past is more likely to be desired by the user than the candidate who is not a learning character.
  • the character conversion device further associates the ideographic character selected by the user with the immediately preceding character or the immediately preceding character string input immediately before the ideographic character, and sets the learning word recording unit as a pair learning character
  • the candidate order determination means is a pair learning character recorded in the learning word recording section, with the candidate extracted by the character candidate extraction means being associated with the immediately preceding character or the immediately preceding character string.
  • the display order may be determined such that the candidates are displayed in priority.
  • the ideographic string selected by the user is input immediately before the ideographic string!
  • a learning control means for recording in the learning word recording section as a pair learning character string in association with the immediately preceding character or the immediately preceding character string and the candidate order determining means is a candidate extracted by the character string candidate extraction means.
  • the display order may be determined such that the candidate is displayed with priority.
  • the learning control means causes the learning word recording unit to record the ideographic character or ideographic character string whose input has been determined (desired by the user) in the character conversion device as a learning character (string).
  • the input character is already determined immediately before the learning character (string) ) (Hereafter, it is referred to as the immediately preceding character (string)) and recorded as a pair learning character (string).
  • the paired learning character (string) follows the associated immediately preceding character (string), which means that there is a history desired by the user. Therefore, when the preceding character (string) is input, the pair learning character (string) is likely to be desired by the user.
  • the candidate order determination means changes the display order of the candidate to higher rank.
  • the learning control means of the character conversion device may use the learning character or learning character string as a learning word, or a group identifier for classifying the pair learning character or pair learning character string.
  • the group identifier is generated in association with each learning word and recorded in the above-described learning word recording unit, and the group identifier is assigned a predetermined length previously assigned to the ideographic letters constituting the learning word. Preferably, it is generated based on the character code of
  • the learning control means determines the learning word (learning character or learning character string, or pair learning character or pair learning character string) recorded in the learning word recording unit. Generate and assign a group identifier to classify into a plurality of groups. The learning control means generates the group identifier using a numerical value determined based on the character code, which is assigned in advance to each ideographic character constituting the learning word.
  • the group identifier generated in this way has a fixed length no matter what learning word is assigned. That is, as a specific example, even if it is a learning word consisting of a two-character idiom or a learning word consisting of a six-character idiom, group identifiers of the same size are assigned to them.
  • the data size increases as the number of learning words recorded in the learning word recording section increases, and in the case of idiom, as the number of characters increases. That is, it takes a great deal of time to search for matching learning words from the learning word recording section to determine whether the candidate is a learning word or not, and the processing efficiency of character input decreases. Occurs.
  • a fixed-length group identifier is assigned to each learning character, whether the input character (string) and the learning word match or are characters of the same group (string) in the search
  • the time to determine the score is uniform regardless of the number of characters in the learning word. From the above, it is possible to suppress the proportional increase in the time spent on the search with respect to the increase in the number of learning words, and the influence on the decrease in processing efficiency can be reduced.
  • the above-mentioned character conversion device further associates the ideographic character or ideographic character string selected by the user with the forward character or the forward character string whose input is determined prior to the ideographic character or ideographic character string.
  • the candidate order determination unit determines the display order so that the candidate is displayed with higher precedence as the degree of association with the front character or the front character string is higher.
  • the candidate order determination means determines the display order of the candidates based on the degree of association with the forward character (string).
  • the degree of association refers to the ideographic character or ideographic character string whose input is determined by the character input determination means and the forward character or character string whose input is determined before the ideographic character or ideographic character string.
  • the candidate order determination means Considering the relationship between "Shanghai” and “University”, change "University” to the top of the candidate.
  • the candidate order determining means can consider candidate ideographic characters in consideration of not only continuity with the immediately preceding character (string) but also with the preceding character (string). Since the display order of (rows) is determined, it is possible to present the candidates in an order that reflects the user's wishes more
  • the character conversion device sets a candidate order determination element for setting a condition designated by the user, which is a predetermined condition to be referred to when the candidate order determination means determines the display order of the candidates.
  • the candidate order determination means preferably determines the display order of the candidates based on the conditions set by the candidate order determination element setting means, According to the above configuration, when there are a plurality of predetermined conditions to be referred to when the candidate order determination means determines the display order of candidates, based on which conditions the display order is determined Is set by the candidate order determining element setting means in accordance with the user's instruction.
  • the candidate is a character (Kanji) or a character string (Idiom) or (2) the ability is a learning character candidate (3) the ability is a paired learning character 4) Force when learned (5)-Generally how often it is used, (6) Phonetic character length entered as current reading matches the candidate phonetic character length (For example, “Technology” has been entered !, and in the case where “Giji (2 letters)” and “Gijutsu (4 letters)” have different phonetic character lengths), (7) Continuity with a string is a force, (8) a relationship with a forward string is a force, (9) how often it is used by the user, and various conditions can be considered. .
  • candidate order determination The display order of the candidates determined by the defining means more accurately reflects the user's wishes.
  • the user can obtain the desired ideographic character input with a smaller number of operations relating to the input of the phonetic characters.
  • the copying machine is provided with the above-described character conversion device and a key corresponding to each phonetic character, and supplies, to the character conversion device, a phonetic character corresponding to a key pressed by the user.
  • the candidate order determination means defines one input part, a second input part for inputting the candidate selected by the user to the character conversion device, and a candidate to be extracted by the character candidate extraction means or the character string candidate extraction means.
  • a display unit configured to display in accordance with the display order, wherein the first input unit and the second input unit, and the display unit are touch panels or tablets integrally formed with each other.
  • the user is provided with the first and second input units for inputting phonetic characters and ideographic characters selected by the user into the character conversion device, and ideographic characters (strings) as candidates to the user.
  • the user can obtain desired ideographic character input with a smaller number of operations relating to the input of phonetic characters. It becomes.
  • the copying machine further includes a writing character input unit for inputting ideographic characters written by the user, recognizes the ideographic characters input to the writing character input unit, and detects information on the ideographic characters. It is preferable to include character recognition means for supplying the above character conversion device.
  • the character input method converts an input phonetic character or phonetic character string into a corresponding ideographic character, and inputs an ideographic character or ideographic character string selected by the user.
  • the ideographic character input by the user is accepted, and the ideographic character associated with the phonogram character corresponds to the phonogram character and the phonogram character string including the phonogram character.
  • ideographic character candidates corresponding to the phonetic character string including the phonetic characters input by the user are extracted, and in the second step, display of the extracted candidates is performed The order is determined.
  • the display order may be determined based on, for example, the predetermined conditions as described above.
  • ideographic character candidates are output based on the determined display order.
  • the character conversion device and the copying machine are realized by a computer
  • the character conversion device may be operated by operating the computer as the respective means.
  • a control program that causes a computer or a copying machine to be realized by a computer, and a computer-readable recording medium recording the same also fall within the scope of the present invention.
  • a character conversion device is a character conversion device for converting an input phonetic character or phonetic character string into a corresponding ideographic character and inputting an ideographic character or ideographic character string selected by the user.
  • a character recording unit for recording a phonogram character and an ideographic character corresponding to a phonogram string including the phonogram character, and receiving the phonogram character input by the user, the phonogram character is selected as the phonogram character.
  • Character candidate extraction means for extracting the associated ideographic character from the character recording unit as an ideographic character candidate, candidate order determination means for determining the display order of the candidates extracted by the candidate extraction means, and the candidate order determination means And display control means for outputting data so as to display candidates based on the display order determined by the ideographic character, the ideographic character or ideographic character string input in the past, and the ideographic character or the ideographic character string Furthermore, in correspondence with the forward character or forward character string input further forward, the degree of association indicating the height of the possibility that the ideographic character or ideographic string is input behind the forward character or forward character string is The candidate order determination means detects the degree of association between each candidate and the forward character or string that has been input ahead of the candidate from the relevance degree recording unit. And the display order is determined such that the candidate with the higher degree of association is displayed with priority.
  • the character conversion device of the present invention converts the input phonetic characters or phonetic character strings into corresponding ideographic characters, and inputs the ideographic characters selected by the user.
  • a character string recording unit that associates and records an ideographic character with an ideographic character string including the ideographic character, and receives an ideographic character selected by the user, and is associated with the ideographic character!
  • the ideographic character A character string candidate extraction unit that extracts a string from the character string recording unit as an ideographic character string candidate, a candidate order determination unit that determines the display order of the candidates extracted by the character string candidate extraction unit, the candidate And a display control means for outputting data to display candidates based on the display order determined by the order determination means, and an ideographic character or ideographic character string input in the past, and the ideographic character or ideographic character string from immediately before Et al in in association with the front character or forward string entered forward, the front character or Or a degree of association indicating the degree to which the ideogram or the ideographic character is likely to be input is stored in the degree-of-relevance recording unit after the forward character string, and the candidate order determining unit
  • the unit detects the degree of association between each candidate and the forward character or forward character string input to the front of the above candidate, and the display order is prioritized so that the candidate with the higher degree of association is displayed with priority. It is characterized by deciding.
  • the user can obtain desired ideographic character input with a smaller number of operations relating to the input of the phonetic characters.
  • the ideographic character or ideographic character string input in the past is associated with the ideographic character or the immediately preceding character or character string immediately prior to the ideographic character string, and the immediately preceding character or the immediately preceding character string is associated.
  • the degree of continuity indicating the height of the possibility that the ideogram or ideographic character string is input is stored in the continuity recording unit immediately after the It is preferable to determine the display order so that candidates with higher relevance and higher continuity with the preceding character or preceding character string are displayed preferentially.
  • the character candidate extraction unit of the character conversion device each time the character candidate extraction unit of the character conversion device receives a phonetic character input by the user, the character candidate extraction unit concatenates the phonetic character to the end of the already-received phonetic character string, Preferably, ideographic characters corresponding to the obtained phonetic character string are extracted as candidates.
  • the learning control means of the character conversion device is the learning character or learning character string as a learning word to be recorded in the learning word recording unit, or the pair learning character or pair learning
  • a group identifier for classifying a character string is generated, the group identifier is associated with each learning word, and is recorded in the learning word recording unit, and the group identifier is assigned in advance to ideographic characters constituting the learning word. Preferably, it is generated based on the character code of a predetermined length.
  • an evaluation indicating the magnitude of the possibility that the candidate is selected by the user is a predetermined condition to be referred to when the candidate order determination means determines the display order of the candidates.
  • a candidate order determining element setting unit configured to set an element designated by the user from a plurality of types of elements that can be used for calculating the value; An element different from the degree of association is stored in the element recording unit, and the candidate order determination unit determines the degree of association, and an element stored in the element storage unit set by the candidate order determination element setting unit. It is preferable to determine the display order of the candidates based on
  • the character input method converts an input phonetic character or phonetic character string into a corresponding ideographic character, and inputs an ideographic character or ideographic character string selected by the user.
  • the character conversion device includes character candidate extraction means, candidate order determination means, and display control means, and the character candidate extraction means uses phonetic characters input by the user.
  • the first step of extracting as a character candidate, the second step of determining the display order of the candidates extracted in the first step by the candidate order determining means, and the second step of the display control means Decision A third step of outputting data to display candidates based on the display order, the second step including an ideographic character or ideographic character string input in the past, and the ideographic character or ideographic character string Relevance indicating the height of the possibility that the ideogram or ideographic character string will be input after the front character or the front character string in association with the front character or the front character string input further to the front than immediately before
  • the degree of association between the preceding character or the front character string input ahead of the above candidate and each candidate is detected from the degree of association recording section that stores the It is characterized as including the step of determining the display order.
  • a character in a character conversion device for converting an input phonetic character or phonetic character string into a corresponding ideographic character and inputting an ideographic character to be selected by the user
  • the character conversion device includes character candidate extraction means, candidate order determination means, and display control means, and the character candidate extraction means receives an ideographic character selected by the user.
  • a character that records the ideographic character string associated with the ideographic character in association with the ideographic character and the ideographic character string including the ideographic character A first step of extracting a candidate for an ideographic character string from the column recording unit; a second step of determining the display order of the candidates extracted in the first step by the candidate order determination means; and the display control means , And the third step of outputting data to display candidates based on the display order determined in the second step, and the second step includes the ideographic character or ideographic character string input in the past.
  • the ideographic character or ideographic character string is associated with the ideographic character or the forward character string that is input ahead of the ideographic character or ideographic character string,
  • the relevance degree recording unit that stores the degree of relevance that indicates the likelihood of being input, detects the degree of relevance between each candidate and the front character or forward character string input to the front of the candidate, Candidate with high degree of association It is characterized by comprising the step of determining the display order to appear in throat priority.
  • the present invention can be applied to various devices for converting readings (phonetic characters), in particular, converting readings into kanji.

Abstract

A character converting device (1) comprises a kanji character dictionary unit (41) containing a kanji character correspondence information (144) including kanji characters and their readings associated with the kanji characters and kanji character indices (145) in which the first characters of pin-ins are associated with address information on the storage locations of the kanji characters the first parts of which are the first characters in the kanji character corresponding information (144), a word dictionary unit (42) containing word correspondence information (162) including words and their readings associated with the words and word indices (161) in which the first characters of the words are associated with the address information on the storage locations of the words the first parts of which are the first characters in the word correspondence information (162), and word acquiring means which refers to the kanji character dictionary unit (41) and the word dictionary unit (42) according to the first character of the first pin-in of an input character string and acquires a word corresponding to the input character string. The time required to search for a word corresponding to the input reading can be shortened.

Description

明 細 書  Specification
文字変換装置、 文字変換装置の制御方法 技術分野  Character conversion device, control method of character conversion device
[0001] 本発明は、ユーザが、キーボードゃタツチパネル、タブレット、マウス等のポインティ ングデバイスを使って文字あるいは文字列を入力するための装置であり、変換指示 を受け付けて、入力された文字あるいは文字列に応じて、変換された文字あるいは 文字列を選択可能に表示し、この表示された文字あるいは文字列をユーザが選択す ることで所望の文字あるいは文字列の入力が行われる文字変換装置に関するもので ある。  The present invention is an apparatus for a user to input characters or a character string using a pointing device such as a keyboard, a touch panel, a tablet, and a mouse, and receives a conversion instruction and receives the character or characters input. A character conversion device related to a character conversion device, which selectively displays a converted character or character string according to a column and the user can select a displayed character or character string to input a desired character or character string. It is a thing.
背景技術  Background art
[0002] 日本語や中国語などの漢字や単語を、ユーザがキーボードを用いて情報機器へ文 字入力し表示させるには、漢字や単語に対する読み (表音文字)を入力し、変換指示 操作を行うことで実現されている。読みを漢字に変換するには、読みと漢字や単語と を関連付けた辞書データが必要であり、辞書データの語彙数の量が情報機器 (入力 システム)の'性能の一面となって 、る。  [0002] In order for the user to input kanji and words such as Japanese and Chinese into the information device using the keyboard and display them, the user can input phonetic characters (phonetic characters) for kanji and words, and input conversion instructions It is realized by doing. In order to convert Yomi to Kanji, dictionary data that associates Yomi with Kanji or words is required, and the amount of vocabulary of dictionary data becomes an aspect of 'performance' of information equipment (input system).
[0003] しかし、語彙数が多くなると、情報機器では、入力された読みに一致する漢字を辞 書データ力 検索する時間が増加する。そのため、ユーザが読みを入力し変換を指 示してから、変換結果を表示するまでのレスポンスが悪くなり、結果的に情報機器の 使いにくさに繋がる。  However, when the number of vocabulary increases, the information device increases the time to search the lexical data ability for the kanji that matches the input reading. Therefore, after the user inputs a reading and instructs conversion, the response from displaying the conversion result is deteriorated, and as a result, it is difficult to use the information device.
[0004] そこで、漢字や単語の検索時間を削減するための工夫が行われている。例えば、 辞書中の単語について、読みの先頭文字毎にその先頭文字の読みで始まる単語の 読みの最大文字数を索引情報中に持たせることで、入力文字列に対する検索対象 の制限を行い、検索に関する処理量の削減を行う変換装置がある(例えば、特許文 献 1参照)。  Therefore, devices have been developed to reduce the search time for kanji and words. For example, for the words in the dictionary, the maximum number of characters of the reading of the word starting with the reading of the first character is added to the index information for each first character of reading to restrict the search target for the input character string. There are conversion devices that reduce the amount of processing (see, for example, Patent Document 1).
特許文献 1 :日本国公開特許公報特開平 6— 149787公報(1994年 5月 31日公開) 発明の開示 [0005] し力しながら、上記従来の装置では、入力された読みの文字数が、索引情報に記 録されている入力読み文字列の先頭文字の読みで始まる単語の読みの最大文字数 以内であれば、その読みの範囲の全辞書データと一致しているか否かをくまなくチヱ ックする必要がある。そのため、変換できる語彙数を増やすために辞書データに収録 する語彙数が増加すると、先頭読み文字毎の漢字、単語数が多くなり、前記条件の 時の処理時間が増加する。結局、表示出力までのレスポンスが低下してしまい、情報 機器は使いにく 、ものとなってしまう。 Patent Document 1: Japanese Patent Laid-Open Publication No. Hei 6-149787 (May 31, 1994 published) Disclosure of Invention On the other hand, in the above-described conventional apparatus, the number of characters of the input reading is within the maximum number of characters of the reading of the word starting with the reading of the first character of the input reading character string recorded in the index information. For example, it is necessary to check whether or not all the dictionary data in the range of the reading match. Therefore, if the number of vocabulary words included in the dictionary data increases to increase the number of vocabulary that can be converted, the number of kanji characters and words per leading reading character increases, and the processing time under the above conditions increases. As a result, the response to the display output is degraded, making the information device useless and expensive.
[0006] 本発明は、上記の問題点に鑑みてなされたものであり、その目的は、入力された表 音文字に対する単語の検索時間を短縮できる文字変換装置、文字入力方法、文字 入力プログラムおよびコンピュータ読取可能な記録媒体を実現することにある。  The present invention has been made in view of the above problems, and an object thereof is to provide a character conversion device, a character input method, a character input program, and a character conversion device capable of shortening a word search time for input phonetic characters. A computer readable recording medium is provided.
[0007] 本発明に係る文字変換装置は、上記課題を解決するために、入力された 1以上の 表音文字よりなる入力文字列を変換文字に変換し、当該変換文字力 なる単語を決 定できる文字変換装置において、  In order to solve the above problems, the character conversion device according to the present invention converts an input character string consisting of one or more input phonetic characters into a converted character, and determines a word that is the converted character. Character conversion device that can
(a- 1)変換文字を識別するための変換文字識別情報と、該変換文字の表音を示 す変換文字表音情報とが対応付けられた変換文字対応情報および、(a— 2)入力さ れる表音文字と、該表音文字を上記変換文字表音情報の先頭として持つ上記変換 文字識別情報の、上記変換文字対応情報での格納先を示す変換文字格納先情報 とが対応付けられた変換文字インデックス、を有する変換文字辞書と、  (a-1) converted character correspondence information in which converted character identification information for identifying converted characters and converted character phonetic information indicating phonetic characters of the converted characters are associated; (a-2) input And the converted character storage destination information indicating the storage destination in the converted character correspondence information of the converted character identification information having the phonetic character as the head of the converted character Converted character dictionary, which has converted character index,
(b— 1)単語を識別するための単語識別情報と、該単語の表音を示す単語表音情 報とが対応付けられた単語対応情報および、(b— 2)単語の先頭となる先頭変換文 字を識別するための先頭変換文字識別情報と、該先頭変換文字識別情報が示す先 頭変換文字を先頭に持つ単語の、上記単語対応情報での格納先を示す単語格納 先情報とが対応付けられた単語インデックス、を有する単語辞書と、  (b-1) word correspondence information in which word identification information for identifying a word is associated with word phonetic information indicating the phonetic sound of the word; and (b-2) a head at the beginning of the word The head conversion character identification information for identifying a conversion character, and the word storage destination information indicating the storage destination in the word correspondence information of the word having the head conversion character indicated by the head conversion character identification information A word dictionary having associated word indexes,
(c)入力された上記入力文字列の先頭の表音文字を基に、上記変換文字辞書を 参照し、該入力文字列の先頭の表音文字を上記変換文字表音情報の先頭として持 つ変換文字識別情報を取得する先頭変換文字取得手段と、  (c) referring to the converted character dictionary based on the first phonetic character of the input character string inputted, and holding the first phonetic character of the input character string as the first character of the converted character phonetic information First converted character acquisition means for acquiring converted character identification information;
(d)取得された上記先頭変換文字識別情報を基に、上記単語辞書を参照し、該先 頭変換文字識別情報が示す変換文字を先頭に持つ単語を取得する単語取得手段 と、を備えていることを特徴としている。 (d) A word acquiring unit which refers to the word dictionary based on the acquired head converted character identification information and acquires a word having the converted character indicated by the head converted character identification information at the head And are characterized.
[0008] 上記構成によると、初めに先頭変換文字取得手段が、変換文字辞書を参照に、入 力文字列の先頭の表音文字から、変換文字を検索し、次に、単語取得手段が、単語 辞書を参照に、先頭変換文字取得手段が取得した変換文字を先頭に持つ単語を取 得することができる。よって、単語の検索処理に係る時間を短縮できる。つまり、入力 文字列と表音 (読み)情報との比較を行う回数が制限できるので、応答速度の速い文 字変換装置を提供することができる。  [0008] According to the above configuration, the first conversion character acquisition means first searches for the conversion character from the first phonetic character of the input character string with reference to the conversion character dictionary, and then the word acquisition means With reference to the word dictionary, it is possible to acquire a word having the conversion character acquired by the start conversion character acquisition means at the beginning. Therefore, the time concerning the search process of a word can be shortened. That is, since the number of comparisons between the input character string and the phonetic (reading) information can be limited, it is possible to provide a character conversion device with high response speed.
[0009] ここで、変換文字とは、変換された後の文字を示して 、る。変換文字が漢字であると すると、表音文字 (読み)の先頭文字は次のようになる。例えば、中国語で、「  [0009] Here, the conversion character indicates the character after conversion. Assuming that the converted character is kanji, the first character of the phonetic character (reading) is as follows. For example, in Chinese
[0010] [表 1] 每年 '  [0010] [Table 1] leap year '
[0011] (以下では「外字 1」とする)」の読みを表記すると「meinian」であり、「m」が読みの先 頭文字にあたる。また、日本語で、「毎年」の読みを表記すると、「まいとし」なので、先 頭文字は「ま」である。  [0011] When the reading of (in the following, "external 1") is written, it is "meinian", and "m" is the first letter of reading. Also, in Japanese, when the reading of "Annual" is written, the first letter is "Ma" because "Maitoshi".
[0012] また、先頭変換文字は、例えば「外字 1」では、「  [0012] Also, for example, in the first conversion character, the first conversion character
[0013] [表 2] 每  [0013] [Table 2] 每
[0014] (以下では「外字 2」とする)」に当たる。変換文字表音情報は、例えば「外字 1」では、 「memian」で to 。  (Hereinafter referred to as “extra character 2”). Converted phonetic information is, for example, “memian” to “for“ 1 ”.
[0015] なお、変換文字識別情報は、変換後の文字である変換文字を識別できる情報であ ればどのようなものでもよぐ変換文字そのものであってもよい。例えば、変換文字が 漢字である場合、漢字そのものであってもよいし、漢字コードであってもよい。これは 、単語識別情報と、先頭変換文字識別情報とについても同様である。  [0015] The converted character identification information may be a converted character itself which is any information that can identify a converted character which is a character after conversion. For example, if the converted character is kanji, it may be kanji itself or kanji code. The same applies to the word identification information and the head conversion character identification information.
[0016] 本発明のさらに他の目的、特徴、および優れた点は、以下に示す記載によって十 分わ力るであろう。また、本発明の利益は、添付図面を参照した次の説明で明白にな るであろう。  [0016] Other objects, features and advantages of the present invention will be made clear by the description given below. Also, the benefits of the present invention will be apparent from the following description with reference to the accompanying drawings.
図面の簡単な説明 [図 1]本発明の一実施形態を示すものであり、文字変換装置の要部構成を示すプロ ック図である。 Brief description of the drawings FIG. 1 shows an embodiment of the present invention, and is a block diagram showing the main configuration of a character conversion device.
[図 2]上記文字変換装置が行う文字の変換についての処理のフローを示す図である  [FIG. 2] It is a figure showing the flow of processing about conversion of the character which the said character conversion device performs.
[図 3]本発明の実施例に係る漢字インデックスを説明する図である。 FIG. 3 is a diagram for explaining a kanji index according to an embodiment of the present invention.
[図 4]本発明の実施例に係る漢字対応情報を説明する図である。  FIG. 4 is a view for explaining kanji correspondence information according to an embodiment of the present invention.
[図 5]本発明の実施例に係る 1次インデックスを説明する図である。  FIG. 5 is a diagram for explaining a primary index according to an embodiment of the present invention.
[図 6]本発明の実施例に係る 2次インデックスを説明する図である。  FIG. 6 is a diagram for explaining a secondary index according to an embodiment of the present invention.
[図 7]本発明の実施例に係る単語対応情報を説明する図である。  FIG. 7 is a diagram for explaining word correspondence information according to an embodiment of the present invention.
[図 8]本発明の実施例に係るピンイン辞書インデックスを説明する図である。  FIG. 8 is a diagram for explaining a Pinyin dictionary index according to an embodiment of the present invention.
[図 9]本発明の実施例に係るピンイン辞書を説明する図である。  FIG. 9 is a diagram for explaining a Pinyin dictionary according to an embodiment of the present invention.
[図 10]本発明の実施例に係るピンイン分割処理例を表した図である。  FIG. 10 is a diagram showing an example of pin-in division processing according to an embodiment of the present invention.
[図 11]本発明の実施例に係るピンイン分割処理において、確定したピンインを除くピ ンインの分割処理例を表した図である。  FIG. 11 is a diagram showing an example of pin-in division processing excluding pin-in determined in the pin-in division processing according to an embodiment of the present invention.
[図 12]本発明の実施形態に係る文字変換装置の要部構成を示すブロック図である。  FIG. 12 is a block diagram showing the main configuration of a character conversion device according to an embodiment of the present invention.
[図 13]本発明の文字変換装置を適用したマルチファンクションプリンタの概略構成を 示すブロック図である。 FIG. 13 is a block diagram showing a schematic configuration of a multifunction printer to which the character conversion device of the present invention is applied.
[図 14]文字変換装置の操作パネルに表示される、文字入力処理時の操作画面の一 例を示す図である。  FIG. 14 is a view showing an example of an operation screen at the time of character input processing, displayed on the operation panel of the character conversion device.
[図 15]文字変換装置の候補抽出部が候補漢字を抽出するときに参照する、漢字辞 書記録部に記録されている漢字辞書の例を示す図である。  FIG. 15 is a view showing an example of a kanji dictionary recorded in the kanji dictionary recording unit, which is referred to when the candidate extraction unit of the character conversion device extracts candidate kanji.
[図 16]本発明の他の実施形態に係る候補熟語を抽出する文字変換装置の要部構成 を示すブロック図である。  FIG. 16 is a block diagram showing a main configuration of a character conversion device for extracting candidate idiom according to another embodiment of the present invention.
[図 17]文字変換装置の操作パネルの表示部に表示される候補熟語の例を示す図で ある。  FIG. 17 is a diagram showing an example of candidate idiom displayed on the display unit of the operation panel of the character conversion device.
[図 18]熟語辞書記録部が記録する熟語辞書の例を示す図である。  [FIG. 18] A diagram showing an example of a idiom dictionary recorded by the idiom dictionary recording unit.
[図 19]本発明の他の実施形態に係る文字変換装置の処理の流れを示すフローチヤ ートである。 圆 20]操作パネル上の候補文字表示領域および確定文字表示領域の表示例を示 す図である。 FIG. 19 is a flowchart showing a process flow of the character conversion device according to another embodiment of the present invention. ] 20] is a diagram showing a display example of a candidate character display area and a confirmed character display area on the operation panel.
[図 21]本発明の他の実施形態に係る学習機能を備えた文字変換装置の要部構成を 示すブロック図である。  FIG. 21 is a block diagram showing the main configuration of a character conversion device provided with a learning function according to another embodiment of the present invention.
[図 22]候補順序決定部 32のさらに詳細な構成を示すブロック図である。  FIG. 22 is a block diagram showing a more detailed configuration of a candidate order determination unit 32.
圆 23]連結度記録部に記録される、漢字 (熟語)ごとの直前 (前方)文字列との連結 度を示した連結度テーブルの例を示す図である。 FIG. 23 is a diagram showing an example of a connectivity degree table indicating the connectivity degree with the immediately preceding (forward) character string for each kanji (idiom) recorded in the connectivity degree recording unit.
圆 24]文字変換装置の候補順序決定部が表示順序を決定するときの、候補の分類 手順を示す図である。 FIG. 24 is a diagram showing a classification procedure of candidates when the candidate order determination unit of the character conversion device determines the display order.
圆 25]候補ごとに算出した表示順序の評価値を記憶するデータ構造の一例を示す 図である。 FIG. 25 is a diagram showing an example of a data structure for storing evaluation values of display order calculated for each candidate.
[図 26]候補順序決定要素記録部に記録される優先度情報の一例を示す図である。 圆 27]操作パネルの文字入力枠に筆記された漢字の例、および、該漢字の座標情 報の例を示す図である。  FIG. 26 is a diagram showing an example of priority information recorded in a candidate order determination element recording part. Fig. 27 is a diagram showing an example of a kanji written in a character input frame of an operation panel, and an example of coordinate information of the kanji.
[図 28]操作パネルの表示例を示す図であり、図 28の(a)は、表示一体型タブレットの 文字入力枠に筆記された漢字の一例を示す図であり、図 28の (b)は、候補文字表示 領域に表示された認識文字候補の例を示す図であり、図 28の (c)は、候補文字表示 領域に表示された候補熟語の例を示す図である。  [FIG. 28] is a view showing a display example of the operation panel, and FIG. 28 (a) is a view showing an example of kanji written in the character input frame of the display integrated type tablet. FIG. 28 is a diagram showing an example of a recognized character candidate displayed in the candidate character display area, and FIG. 28 (c) is a diagram showing an example of a candidate idiom displayed in the candidate character display area.
[図 29]筆記データ「外字 2」の認識結果の例を示す図である。 FIG. 29 is a diagram showing an example of the recognition result of the handwritten data “extra character 2”.
圆 30]本発明の他の実施形態に係る繁体字を入力する文字変換装置の要部構成を 示すブロック図である。 圆 30] FIG. 30 is a block diagram showing an essential configuration of a character conversion device for inputting a traditional Chinese character according to another embodiment of the present invention.
[図 31]本発明の他の実施形態における、繁体字入力のための注音文字入力処理時 の操作画面の一例を示す図である。  FIG. 31 is a view showing an example of an operation screen at the time of a phonetic character input process for traditional character input according to another embodiment of the present invention.
圆 32]本発明の他の実施形態における、文字変換装置の候補抽出部が候補漢字を 抽出するときに参照する、漢字辞書記録部に記録されている漢字辞書の例を示す図 である。 32) A diagram showing an example of a kanji dictionary recorded in a kanji dictionary recording unit, which is referred to when a candidate extraction unit of a character conversion device extracts candidate kanji in another embodiment of the present invention.
[図 33]本発明の他の実施形態における、文字変換装置の熟語辞書記録部が記録す る熟語辞書の例を示す図である。 [図 34]本発明の他の実施形態における、文字変換装置の連結度記録部に記録され る、漢字 (熟語)ごとの直前 (前方)文字列との連結度を示した連結度テーブルの例を 示す図である。 FIG. 33 is a diagram showing an example of a idiom dictionary recorded by the idiom dictionary recording unit of the character conversion device according to another embodiment of the present invention. [FIG. 34] An example of a connectivity table showing connectivity with the immediately preceding (forward) character string for each kanji (words) recorded in the connectivity recording part of the character conversion device in another embodiment of the present invention FIG.
[図 35]操作パネルの表示部に表示される操作画面の一例を示す図である。  FIG. 35 is a diagram showing an example of an operation screen displayed on the display unit of the operation panel.
[図 36]操作パネルの表示部に表示される操作画面の一例を示す図である。  FIG. 36 is a view showing an example of an operation screen displayed on the display unit of the operation panel.
[図 37]操作パネルの表示部に表示される操作画面の一例を示す図である。  FIG. 37 is a diagram showing an example of an operation screen displayed on the display unit of the operation panel.
[図 38]操作パネルの表示部に表示される操作画面の一例を示す図である。  FIG. 38 is a diagram showing an example of an operation screen displayed on the display unit of the operation panel.
発明を実施するための最良の形態  BEST MODE FOR CARRYING OUT THE INVENTION
[0018] 本発明の一実施形態(実施形態 1〜4)について図 12〜38に基づいて説明すると 以下の通りである。本実施形態では、一例として、入力装置と出力装置とが一体にな つているタツチパネルを備えたマルチファンクションプリンタ (複写機)に、本発明の文 字変換装置を適用した場合について説明する。しかし、本実施形態で説明する複写 機は一例であって、本発明の文字変換装置を、携帯電話、携帯端末、パソコン、リモ コンなどに適用することも可能である。  One embodiment (embodiments 1 to 4) of the present invention will be described below with reference to FIGS. 12 to 38. In this embodiment, as an example, the case where the character conversion device of the present invention is applied to a multi-function printer (copier) having a touch panel in which an input device and an output device are integrated will be described. However, the copying machine described in the present embodiment is an example, and the character conversion device of the present invention can be applied to a mobile phone, a portable terminal, a personal computer, a remote control, and the like.
[0019] 〔背景技術と問題点〕  [Background Art and Problems]
従来の文字変換装置において表意文字を得ようとすると、表音文字よりも操作が繁 雑になってしまう。表意文字の文字種は膨大な量であり、それらに 1つ 1つキーを割り 当てることなど現実的ではない。さらに、表意文字は同音異義語が多く存在すること が一般的であり、複数の表音文字の組み合わせによっても、それに対応する表意文 字の候補は膨大な量となる。その結果、膨大な量の候補から適切な表意文字を選び 出す作業は、ユーザに多くの手間を強いることになる。  When trying to obtain ideographic characters in a conventional character conversion device, operations become more complicated than phonetic characters. The ideographic character types are huge, and it is not realistic to assign them one by one key. Furthermore, ideographic characters generally have many homonyms, and combinations of multiple phonetic characters result in enormous amounts of corresponding ideographic characters. As a result, the task of selecting an appropriate ideographic character from a huge amount of candidates will require the user to do a lot of work.
[0020] この問題点は、日本語の文字入力システムに限って生じるものではなぐ同様の表 意文字入力システムであれば、同様に生じるものである。例えば、中国語はピンイン を利用(中国語の特徴である四声は省略)して入力する。漢字を入力するには、次の ような手順で行う。入力した ヽ漢字のピンイン表記 (アルファベットの文字列)をキーボ ードから入力し、そのピンインに対応する漢字への変換操作を行って所望の漢字を 得る。上記問題点を解決するために、所望の表意文字を得るための表意文字の入力 操作を簡素化することが可能な文字変換装置について説明する。 [0021] また、本実施形態では、ピンイン表記 (アルファベット列)から中国語 (簡体字)を入 力可能な文字変換装置について説明するが、本発明の文字変換装置は中国語に 限定されず、種々の表音文字力 所望の表意文字を得られるものである。なお、上記 ピンイン表記について、中国語には本来、 4種の異なる発音を表す四声が存在する 力 説明の簡略ィ匕のために四声の概念を省略して説明する。 [0020] This problem similarly occurs in the case of the same ideographic character input system which is not generated only in the Japanese character input system. For example, Chinese uses Pinyin (the four voices characteristic of Chinese are omitted). Follow the steps below to enter kanji characters. Input the pinyin notation (character string of the alphabet) of the input ヽ kanji from the keyboard and perform conversion operation to the kanji corresponding to the pinyin to obtain the desired kanji. In order to solve the above problems, a character conversion device capable of simplifying the input operation of ideographic characters for obtaining desired ideographic characters will be described. Further, in the present embodiment, a character conversion device capable of inputting Chinese characters (Simplified characters) from Pinyin notation (alphabet string) will be described. However, the character conversion device of the present invention is not limited to Chinese characters. Phonetic writing power of the desired ideographic character can be obtained. The above pinyin notation is described in Chinese, in which there are four voices representing four different pronunciation. The concept of four voices will be omitted for simplicity of explanation.
[0022] 〔実施形態 1〕 図 13は、本発明の文字変換装置 1を適用したマルチファンクションプリンタの概略 構成を示すブロック図である。  Embodiment 1 FIG. 13 is a block diagram showing a schematic configuration of a multi-function printer to which the character conversion device 1 of the present invention is applied.
[0023] マルチファンクションプリンタは、図 13に示すとおり、制御部 10、操作パネル (タブ レット) 2、一時記憶部 3、記録部 40、画像読取ユニット 4、画像転写ユニット 5、紙送り ユニット 6を備えている。なお、ここで説明するマルチファンクションプリンタの構成は、 本発明を実施するための一例として示すものであって、本発明の範囲を限定するも のではない。 As shown in FIG. 13, the multifunction printer includes a control unit 10, an operation panel (tablet) 2, a temporary storage unit 3, a recording unit 40, an image reading unit 4, an image transfer unit 5, and a paper feeding unit 6. Have. The configuration of the multifunction printer described here is shown as an example for implementing the present invention, and does not limit the scope of the present invention.
[0024] 制御部 10は、マルチファンクションプリンタを統括制御するものである。制御部 10 は、記録部 40に記録されている各種プログラムを読み出して、複写機としての機能を 果たす、画像読取ユニット 4、画像転写ユニット 5、および、紙送りユニット 6などを制 御し、複写などの処理を行う。また、本発明に係る文字入力処理を行うための各部も この制御部 10に含まれている。  The control unit 10 centrally controls the multi-function printer. The control unit 10 reads various programs recorded in the recording unit 40 and controls the image reading unit 4, the image transfer unit 5, the paper feeding unit 6 and the like to function as a copying machine, Perform processing such as. Further, each unit for performing character input processing according to the present invention is also included in the control unit 10.
[0025] 記録部 40は、操作パネル 2、複写機として動作するための上述の各種ユニットの制 御プログラム、および、文字変換装置 1の制御プログラムを記録するものであり、 RO M (read only memory)などの不揮発性記憶素子で実現される。  The recording unit 40 records an operation panel 2, a control program of the above-mentioned various units for operating as a copying machine, and a control program of the character conversion device 1, and an ROM (read only memory) Etc.) are realized.
[0026] 一時記憶部 3は、プログラムを処理するための作業領域、読取画像の一時記憶に 利用されるほ力 本実施形態では、文字入力処理時の学習文字 (列)などの一時記 憶に利用される。 RAM (random access memory)などの揮発性メモリで実現される。  The temporary storage unit 3 is a working area for processing a program, and the power used for temporarily storing read images. In this embodiment, temporary storage such as learning characters (strings) at the time of character input processing is used. It is used. It is realized by volatile memory such as random access memory (RAM).
[0027] 操作パネル 2は、ユーザがマルチファンクションプリンタに対して各種処理の実行を 指示するためのものである。例えば、表示装置と入力装置とが一体に形成されたタツ チパネルや、表示一体型タブレットで実現される。さら〖こ、表示一体型タブレットを用 いて筆記された文字を認識する手書き文字認識機能を備えていてもよい。また、ユー ザが表示一体型タブレットの指示した位置に基づいて、その指示された文字、文字 列、その他機能キーなどの情報を検出する機能を備えている。 Operation panel 2 is used by the user to instruct the multi-function printer to execute various processes. For example, it is realized by a touch panel in which a display device and an input device are integrally formed or a display integrated tablet. For display-integrated tablet It may have a handwritten character recognition function that recognizes written characters. It also has a function to detect information such as instructed characters, character strings, and other function keys based on the position indicated by the user on the display integrated tablet.
[0028] ユーザは、操作パネル 2に表示された、アルファベット(表音文字)に対応する各キ 一にペンや指などで軽く触れることにより、所望のアルファベットをマルチファンクショ ンプリンタに入力することが可能となる。  The user can input a desired alphabet to the multi-function printer by lightly touching each key corresponding to the alphabet (phonetic characters) displayed on operation panel 2 with a pen or a finger. Is possible.
[0029] あるいは、上記マルチファンクションプリンタは、操作パネル 2に加えて、キーボード  Alternatively, in addition to the operation panel 2, the multifunction printer may be a keyboard.
(第 1入力部 Z第 3入力部) 7 (例えば、 PCで使用する汎用キーボード)を備えていて もよい。この場合、操作パネル 2の表示されるキー、および、キーボード 7のキーのい ずれを用いてもアルファベットを入力することが可能となる。表音文字を入力するため のキーを備えた入力部(第 1入力部)を、操作パネル 2およびキーボード 7の両方で同 時に実現することもできるし、キーボード 7のみで実現することもできる。あるいは、表 音文字の入力を操作パネル 2で行うか、キーボード 7で行うかを切り替えられる構成と してちよい。  (First Input Unit Z Third Input Unit) 7 (for example, a general-purpose keyboard used in a PC) may be provided. In this case, it is possible to input alphabets using any of the displayed keys of the operation panel 2 and the keys of the keyboard 7. The input unit (first input unit) having a key for inputting phonetic characters can be realized simultaneously by both the operation panel 2 and the keyboard 7 or can be realized only by the keyboard 7. Alternatively, it may be configured to switch whether to input the phonetic characters on the operation panel 2 or to the keyboard 7.
[0030] 以下、上記文字入力処理を行う本発明の文字変換装置 1について、さらに詳細に 説明する。  Hereinafter, the character conversion device 1 of the present invention which performs the above-mentioned character input processing will be described in more detail.
[0031] (文字変換装置の構成)  (Configuration of Character Conversion Device)
図 12は、本実施形態に係る文字変換装置 1の要部構成を示すブロック図である。 文字変換装置 1は、図 12に示すとおり、操作パネル 2、制御部 10、記録部 40、およ び、一時記憶部 3を備えた構成となっている。  FIG. 12 is a block diagram showing the main configuration of the character conversion device 1 according to the present embodiment. The character conversion device 1 is configured to include an operation panel 2, a control unit 10, a recording unit 40, and a temporary storage unit 3 as shown in FIG.
[0032] 操作パネル 2は、上述したように、表示装置と入力装置とが一体に形成されたタツチ パネルで実現されて!、る。  As described above, the operation panel 2 is realized by a touch panel in which the display device and the input device are integrally formed !.
[0033] 表示部 12は、ユーザに各種情報を提供するものである。図 14は、操作パネル 2に 表示される文字入力処理時の操作画面の一例を示す図である。図 14に示すとおり、 表示部 12には、ユーザが文字を入力するためのキーボードを表示する文字入力領 域 121、入力の候補として挙げられている文字 (または、文字列)を表示する候補文 字 (列)表示領域 122、および、すでに入力が完了し確定している入力確定文字を表 示する確定文字 (列)表示領域 123が表示されている。入力部 11は、ユーザが中国 語発音表記のピンイン (表音文字列 z表音情報)を入力するためのものであり、文字 入力領域 121に表示されたアルファベットのキーがユーザにより押下されると、パネ ル上のその押下位置に対応するアルファベットの信号が、文字変換装置 1の入出力 制御部 20に入力される。また、入力部 11は、ユーザが入力を確定させたい文字を選 択するためのものでもある。候補文字表示領域 122に表示されている漢字 (表意文 字 Z変換文字)が押下されると、その押下位置に表示されている漢字の信号が、入 出力制御部 20に入力され、該漢字の入力が確定する。 The display unit 12 provides the user with various information. FIG. 14 is a diagram showing an example of an operation screen displayed on the operation panel 2 during character input processing. As shown in FIG. 14, the display unit 12 has a character input area 121 for displaying a keyboard for the user to input characters, and a candidate sentence for displaying characters (or character strings) listed as input candidates. A character (column) display area 122 and a confirmed character (column) display area 123 displaying an input confirmed character that has already been input and confirmed are displayed. The input unit 11 is a user in China It is for inputting pinyin (phonetic character string z phonetic information) of word pronunciation notation, and when the key of the alphabet displayed in the character input area 121 is pressed by the user, the pressed position on the panel is displayed. The signal of the alphabet corresponding to is input to the input / output control unit 20 of the character conversion device 1. Further, the input unit 11 is also for the user to select a character that the user wants to confirm the input. When a kanji (ideographic character Z converted character) displayed in the candidate character display area 122 is pressed, a signal of the kanji displayed in the pressed position is input to the input / output control unit 20, and the kanji character is displayed. The input is confirmed.
[0034] 制御部 10は、入出力制御部 20および文字入力制御部 30を制御するものである。  The control unit 10 controls the input / output control unit 20 and the character input control unit 30.
入出力制御部 20は、操作パネル 2と制御部 10との間でやりとりされる信号の入出力 を制御するものであり、入力受付部 21、表示制御部(表示制御手段) 22、および、候 補選択受付部 23を含んでいる。文字入力制御部 30は、文字入力処理を実行するも のであり、候補抽出部 (文字候補抽出手段 Z文字列候補抽出手段) 31、候補順序 決定部 (候補順序決定手段) 32、および、文字確定部 (文字入力確定手段) 33を含 んでいる。なお、制御部 10内の各部は、コンピュータの中央演算装置 (CPU)および CPUによって実行されるソフトウェアにより実現される。このソフトウェアも、上述の記 録部 40に格納され、一時記憶部 3に読み出されて CPUにより実行される。  The input / output control unit 20 controls input / output of signals exchanged between the operation panel 2 and the control unit 10, and includes an input receiving unit 21, a display control unit (display control means) 22, and A supplementary selection acceptance unit 23 is included. The character input control unit 30 executes character input processing, and the candidate extraction unit (character candidate extraction unit Z character string candidate extraction unit) 31, candidate order determination unit (candidate order determination unit) 32, and character confirmation Part 33 (character input confirmation means) is included. Note that each unit in the control unit 10 is realized by a central processing unit (CPU) of a computer and software executed by the CPU. This software is also stored in the above recording unit 40, read out to the temporary storage unit 3, and executed by the CPU.
[0035] 入出力制御部 20の入力受付部 21は、入力部 11から入力された文字 (アルファべ ットなど)の信号を受け付けて、候補抽出部 31に供給するものである。表示制御部 2 2は、表示部 12の文字入力領域 121、候補文字表示領域 122、および、確定文字表 示領域 123の各々に表示すべき情報が表示されるよう表示部 12を制御するものであ る。候補選択受付部 23は、入力部 11から入力された、候補文字表示領域 122内の 選択された候補漢字の信号、および、該選択された候補漢字の入力を確定する指示 信号を受け付けて、文字確定部 33に供給するものである。  The input receiving unit 21 of the input / output control unit 20 receives signals of characters (such as alphabets) input from the input unit 11 and supplies the signals to the candidate extracting unit 31. The display control unit 22 controls the display unit 12 so that information to be displayed is displayed in each of the character input area 121, the candidate character display area 122, and the confirmed character display area 123 of the display unit 12. is there. Candidate selection accepting portion 23 accepts the signal of the selected candidate kanji in candidate character display area 122 input from input portion 11 and the instruction signal for determining the input of the selected candidate kanji, It is supplied to the determination unit 33.
[0036] 文字入力制御部 30の候補抽出部 31は、入力された文字に基づいて、候補となる 漢字 (または)単語を記録部 40に記憶されて 、る各辞書部から取得するものである。 本実施形態では、候補抽出部 31は、入力されたアルファベット(列)に基づいて、候 補となる漢字を記録部 40の漢字辞書部 (文字記録部) 41から抽出する。  The candidate extraction unit 31 of the character input control unit 30 stores candidate kanji (or) words from the respective dictionary units stored in the recording unit 40 based on the input characters. . In the present embodiment, the candidate extraction unit 31 extracts a kanji as a candidate from the kanji dictionary unit (character recording unit) 41 of the recording unit 40 based on the input alphabet (string).
[0037] より詳細には、まず、入力されたアルファベット (列)に対応するピンインを特定して、 特定した各ピンインに対応する漢字を、候補漢字として抽出する。候補抽出部 31は 、抽出した候補漢字を、内部バッファ (不図示)に記憶し、候補順序決定部 32および 表示制御部 22に候補漢字の表示を指示する。 [0037] More specifically, first, pinyins corresponding to input alphabets (rows) are identified, The kanji corresponding to each pinyin identified is extracted as a candidate kanji. The candidate extraction unit 31 stores the extracted candidate kanji in an internal buffer (not shown), and instructs the candidate order determination unit 32 and the display control unit 22 to display the candidate kanji.
[0038] 候補抽出部 31が参照する漢字辞書部 41には、アルファベット (列)—ピンイン—漠 字が対応付けて記録されている。漢字辞書部 41の詳細は後述する。  In the kanji dictionary unit 41 to which the candidate extraction unit 31 refers, alphabet (row) -pinyin-vague characters are recorded in association with one another. Details of the kanji dictionary unit 41 will be described later.
[0039] 候補順序決定部 32は、候補抽出部 31が抽出した候補漢字の表示順序を、所定の 条件に基づいて決定するものである。表示順序とは、図 14に示すように、候補文字 表示領域 122に候補漢字を表示する順序のことである。抽出されたすベての候補漠 字が候補文字表示領域 122内に一度に表示できる場合、順序が上の候補漢字ほど 、候補文字表示領域 122の上部に表示されるようにし、ユーザが選択しやすいように すればよい。また、候補漢字が一度に表示できる数を上回る場合は、下位に続く順 序の候補漢字を表示するようページを切り換えるようにしてもよい。この場合でも、順 序が上の候補漢字ほど初めのページに表示されるようにして、上位の候補漢字ほど ユーザに選ばれやすいようにしておく。したがって、候補順序決定部 32は、抽出され た候補漢字を、所定の条件に基づいてユーザに選択されやすいものから順に表示さ れるよう順序を決定する。どのような条件に基づ 、て候補漢字の順序を決定するのか につ 、ては後に詳しく説明する。  The candidate order determination unit 32 determines the display order of the candidate kanji extracted by the candidate extraction unit 31 based on a predetermined condition. The display order is an order of displaying candidate kanji in the candidate character display area 122 as shown in FIG. If all extracted candidate vowels can be displayed at once in the candidate character display area 122, the order is higher at the upper part of the candidate character display area 122 as the upper candidate kanji is selected, and the user selects It should be easy. In addition, when the number of candidate kanji exceeds the number that can be displayed at one time, the page may be switched to display candidate kanji in lower order. Even in this case, the upper candidate kanji is displayed on the first page as the upper candidate kanji is displayed, and the upper candidate kanji is selected more easily by the user. Therefore, the candidate order determination unit 32 determines the order such that the extracted candidate kanji are displayed in order from those likely to be selected by the user based on a predetermined condition. Based on what conditions the order of candidate kanji will be determined, and will be described in detail later.
[0040] 文字確定部 33は、入力部 11より入力されたユーザが選択した候補漢字の信号、 および、該選択された候補漢字の入力を確定する指示信号を、候補選択受付部 23 より受け取り、ユーザにより選択された上記候補漢字の入力を確定させるものである。 まず、選択された候補漢字は、確定前の選択されている状態を示す下線とともに、確 定文字表示領域 123に表示させる。さらに、入力確定の指示信号に応じて文字確定 部 33が確定した候補漢字は、入力確定文字として下線が付されない状態で、表示 制御部 22を介して確定文字表示領域 123に表示される。  The character determination unit 33 receives from the candidate selection reception unit 23 the signal of the candidate kanji selected by the user input from the input unit 11 and the instruction signal for determining the input of the selected candidate kanji, The input of the candidate kanji selected by the user is determined. First, the selected candidate kanji is displayed in the definite character display area 123 together with the underline indicating the selected state before confirmation. Furthermore, the candidate kanji determined by the character determination unit 33 according to the instruction signal for input determination is displayed in the determined character display area 123 via the display control unit 22 without being underlined as an input determined character.
[0041] (漢字辞書)  (Kanji dictionary)
次に、漢字辞書部 41に記録される漢字辞書について説明する。図 15は、候補抽 出部 31が候補漢字を抽出するときに参照する、漢字辞書部 41に記録されている漠 字辞書の例を示す図である。 [0042] 図 15に示すとおり、本実施形態における漢字辞書 140は、入力受付部 21が受け 付けたアルファベット(列)とピンインとの対応付けを記録した、アルファベット ピンィ ン対応表 (AZP対応表 141)、および、ピンインと漢字との対応付けを記録した、ピン イン—漢字対応表 (PZC対応表 142)とを含んでいる。これにより、入力されたアルフ ァベット (列)に対応する候補漢字を直接抽出することが可能となる。 Next, the kanji dictionary recorded in the kanji dictionary unit 41 will be described. FIG. 15 is a diagram showing an example of a vague dictionary recorded in the kanji dictionary unit 41 which is referred to when the candidate extraction unit 31 extracts candidate kanji. As shown in FIG. 15, the kanji dictionary 140 according to the present embodiment is an alphabet piny correspondence table (AZP correspondence table 141) in which correspondence between the alphabets (rows) received by the input reception unit 21 and pinyins is recorded. And a pinyin-kanji correspondence table (PZC correspondence table 142) in which correspondence between pinyins and kanji is recorded. This makes it possible to directly extract candidate kanji corresponding to the input alphabet (column).
[0043] 具体的には、例えば、ユーザが「外字 2」(ピンインは「mei」)という漢字を入力した いために、入力部 11を用いて、まず「m」のキーを押下したとする。入力受付部 21より 「m」の信号が供給されると、候補抽出部 31は、まず、漢字辞書 140の AZP対応表 1 41から、「m」と前方一致するピンイン(図 15の例では、「ma」Wmei」)を特定し、該 ピンインに対応する漢字が記録されている、 PZC対応表 142のアドレスを検出する( 「cp― ml」と「cp― mi」 J。  Specifically, for example, it is assumed that the user first presses the key “m” using the input unit 11 in order to input the kanji “extra character 2” (pinyin is “mei”). When a signal of “m” is supplied from the input reception unit 21, the candidate extraction unit 31 first identifies pinyin that matches “m” ahead (in the example of FIG. 15, according to AZP correspondence table 141 of the kanji dictionary 140). “Ma” Wmei ”) is identified, and the address of PZC correspondence table 142 in which the kanji corresponding to the pinyin is recorded is detected (“ cp−ml ”and“ cp−mi ”J).
[0044] 続いて、 PZC対応表 142の「cp— ml」と「cp— mi」とに基づいて、候補抽出部 31 は、「m」と前方一致のピンインに対応する漢字(  Subsequently, based on “cp−ml” and “cp−mi” in the PZC correspondence table 142, the candidate extraction unit 31 selects the kanji corresponding to pinyin of “m” and the forward match.
[0045] [表 3] 妈  [Table 3] 妈
[0046] 、麻、没、 · · ·「外字 2」、  [0046], hemp, death,
[0047] [表 4] 浼  [Table 4] 浼
[0048] · · を抽出する。抽出された漢字は、上述したとおり、候補漢字として候補順序決定 部 32に供給される。ここで、ユーザによりさらに「mei」と入力された場合には、候補抽 出部 31は、「mei」と前方一致する漢字に絞り込んで、漢字辞書部 41から候補漢字 を抽出する。  [0048] · · Extract the. The extracted kanji is supplied to the candidate order determination unit 32 as a candidate kanji as described above. Here, when “mei” is further input by the user, the candidate extraction unit 31 narrows down on the kanji that is in front agreement with “mei”, and extracts candidate kanji from the kanji dictionary unit 41.
[0049] 上記構成によれば、候補抽出部 31は、ユーザが入力したアルファベットを受け付け て、そのアルファベットを含む、ピンインを特定する。候補抽出部 31は、特定したピン インに基づいて、ユーザが入力したいと所望する漢字の候補を漢字辞書部 41から抽 出することができる。  According to the above configuration, the candidate extraction unit 31 receives an alphabet input by the user, and specifies pinyin including the alphabet. The candidate extraction unit 31 can extract, from the kanji dictionary unit 41, candidates for kanji that the user desires to input based on the identified pin-ins.
[0050] 候補順序決定部 32は、上記候補抽出部 31が抽出した漢字の候補を所定の条件 に基づいて並べ替え、候補の表示順序を決定する。表示制御部 22は、漢字の候補 力 上記候補順序決定部 32が決定した表示順序で表示されるよう表示部 12を制御 する。 The candidate order determination unit 32 determines the kanji candidate extracted by the candidate extraction unit 31 under a predetermined condition. Sort based on and determine the display order of the candidates. The display control unit 22 controls the display unit 12 so that the display order is displayed in the display order determined by the candidate order determination unit 32 of the Chinese character candidate.
[0051] 最後に、ユーザが表示された入力したい候補漢字の中から、所望の漢字を指定す ると、文字確定部 33は、指定された漢字の入力を確定させる。  Finally, when a desired kanji is designated from among the displayed candidate kanji that the user desires to input, the character determination unit 33 determines the input of the designated kanji.
[0052] これにより、所望の漢字を得るための入力操作を簡素化することが可能となる。  [0052] This makes it possible to simplify the input operation for obtaining the desired kanji.
[0053] また、候補抽出部 31が抽出する候補となる漢字は、ユーザのアルファベット入力操 作により、外部記憶装置(図示せず)への記憶および外部記憶装置からの読込みが 行えるようになつていてもよぐ読み込まれた漢字を、候補を表示する際に利用するよ うにしてもよい。  Also, kanji characters that are candidates to be extracted by the candidate extraction unit 31 can be stored in an external storage device (not shown) and read from an external storage device by user's alphabet input operation. Even if the kanji is read, it may be used when displaying the candidates.
[0054] (変形例 1 候補単語の出力)  (Modification 1 Output of Candidate Words)
さらに、本発明の文字変換装置 1は、某漢字が入力確定文字として選択されたのち 、該漢字力 始まる 1つの意味を持つ 1以上の漢字力もなる漢字列 (以下、単語 (表 意文字列)と称する)を候補単語として、候補文字表示領域 122 (図 14)に表示する ことが好ましい。これにより、ユーザは、上記某漢字とともに単語を形成し得る漢字を 候補文字表示領域 122において得ることができるので、続く漢字のピンインを別途入 力しなくても、所望の漢字を得ることが可能となる。したがって、所望の漢字を得るた めの入力操作を簡素化することが可能となる。  Furthermore, the character conversion device 1 according to the present invention is a kanji string (hereinafter referred to as a word (ideographic character string) in which one or more kanji powers having one meaning beginning with the kanji powers are selected after the 某 kanji is selected as the input determined characters Preferably, the candidate character display area 122 (FIG. 14) is displayed as a candidate word. As a result, the user can obtain a kanji that can form a word with the 某 kanji in the candidate character display area 122, and thus can obtain a desired kanji without separately inputting the pinyin of the following kanji. It becomes. Therefore, it is possible to simplify the input operation for obtaining the desired kanji.
[0055] 図 16は、候補単語を出力する文字変換装置 1の要部構成を示すブロック図である 。なお、図 16の各構成要素に付された符号は、図 12の各構成要素に付された符号 に対応しており、同じ符号は、同じ構成要素を示している。したがって、上述の各実 施形態ですでに説明した構成要素についての説明は繰り返さない。図 16に示す文 字変換装置 1において、図 12と異なる点は、文字確定部 33から候補抽出部 31に制 御信号が伝達される点と、記録部 40に単語辞書部(文字列記録部) 42が含まれて 、 る^;である。  FIG. 16 is a block diagram showing the main configuration of the character conversion device 1 that outputs candidate words. The reference numerals attached to the respective components in FIG. 16 correspond to the reference numerals attached to the respective components in FIG. 12, and the same reference numerals indicate the same components. Therefore, the description of the components already described in the above embodiments will not be repeated. The character conversion device 1 shown in FIG. 16 differs from that shown in FIG. 12 in that the control signal is transmitted from the character determination unit 33 to the candidate extraction unit 31, and the word dictionary unit (character string recording unit 42) is included ^ ^.
[0056] 文字確定部 33は、ユーザにより選択された漢字 (または、入力確定文字)が候補選 択受付部 23を介して入力されると、続いて、候補抽出部 31に対して、当該選択され たから始まる単語を抽出する指示を送る。 [0057] 候補抽出部 31は、上記の指示に基づいて、上記選択された漢字から始まる単語を 、候補単語として単語辞書部 42から抽出する。図 17は、操作パネル 2の表示部 12 に表示される候補単語の例を示す図である。図 17に示す例では、漢字「外字 2」が選 択されているときに、「外字 2」力も始まる単語が、候補単語として表示部 12の候補文 字表示領域 122に表示されている。 When the kanji character (or the input finalized character) selected by the user is input through candidate selection reception unit 23, character determination unit 33 continues to select candidate extraction unit 31 for the selection. Send instructions to extract words that start with The candidate extraction unit 31 extracts a word starting from the selected kanji as a candidate word from the word dictionary unit 42 based on the above instruction. FIG. 17 is a diagram showing an example of candidate words displayed on the display unit 12 of the operation panel 2. In the example shown in FIG. 17, when the kanji character “extra character 2” is selected, a word that starts with “extra character 2” is displayed as a candidate word in the candidate character display area 122 of the display unit 12.
[0058] 候補抽出部 31が参照する単語辞書部 42には、漢字 単語が対応付けて記録さ れている。図 18は、単語辞書部 42が記録する単語辞書の例を示す図である。図 18 に示すとおり、本実施形態における単語辞書は、漢字辞書部 41に記録されている漠 字と、該漢字から始まる単語との対応付けを記録した、漢字 単語対応表 (CZI対 応表 143)を含んでいる。これにより、選択された漢字に対応する候補単語をさら〖こ 抽出することが可能となる。  Kanji words are recorded in association with each other in the word dictionary unit 42 to which the candidate extraction unit 31 refers. FIG. 18 is a diagram showing an example of a word dictionary recorded by the word dictionary unit 42. As shown in FIG. As shown in FIG. 18, the word dictionary in this embodiment is a kanji word correspondence table (CZI correspondence table 143) in which correspondences between vague characters recorded in the kanji dictionary unit 41 and words starting from the kanji are recorded. Contains). This makes it possible to further extract candidate words corresponding to the selected kanji.
[0059] 具体的には、例えば、ユーザにより、漢字「外字 2」が選択されると、候補抽出部 31 は、 PZC対応表 142 (図 15)の「外字 2」のアドレスを検出する(「wp— mi+j」)。続 いて、単語辞書部 42を参照し、アドレス「wp—mi+j」に基づいて、「外字 2」から始ま る単語(「外字 2」月、「外字 2」天' · を抽出する。抽出された単語は、上述の構成と 同様に、候補単語として候補順序決定部 32に供給され、候補文字表示領域 122〖こ 表示される(図 17)。  Specifically, for example, when the user selects the kanji “extra character 2”, the candidate extraction unit 31 detects the address of “extra character 2” in the PZC correspondence table 142 (FIG. 15) (“ wp—mi + j ”). Subsequently, referring to the word dictionary unit 42, the words starting with "extra character 2" ("extra character 2" month, "extra character 2" heaven '· · · are extracted based on the address "wp-mi + j". The selected word is supplied as a candidate word to the candidate order determination unit 32 as in the configuration described above, and displayed in the candidate character display area 122 (FIG. 17).
[0060] ある漢字が入力されると、その漢字の次に続く文字 (ユーザが所望する漢字)は、何 の脈絡もない漢字よりも、その漢字力も始まる単語を構成する漢字である可能性が高 い。したがって、入力が確定された漢字から始まる単語を候補として表示すれば、ュ 一ザは、その単語の候補の中から所望の単語を選択するのみで、別途ピンインを入 力することなぐ中国語を入力することが可能となる。これにより、ユーザは、ピンイン の入力に係る操作数について、さらに少ない操作数で、所望の漢字の入力を得るこ とが可能となる。  [0060] When a kanji is input, the character following the kanji (the kanji desired by the user) may be a kanji that constitutes a word that also starts its kanji power than a kanji without any context. high. Therefore, if a word starting from a kanji whose input has been confirmed is displayed as a candidate, the user only selects the desired word from among the word candidates and does not separately input Chinese Pinyin. It becomes possible to input. As a result, the user can obtain a desired kanji input with a smaller number of operations related to the input of pinyin.
[0061] (フローチャート)  [0061] (Flowchart)
次に、本発明の文字変換装置 1 (図 16)の処理の流れを、図 19に示すフローチヤ ートと、図 20に示す操作パネル 2上の候補文字表示領域 122および確定文字表示 領域 123の表示例に基づき説明する。以下では、ユーザが、「「外字 2」天到郊外散 歩」を入力しょうとした場合について説明する。 Next, the process flow of the character conversion device 1 (FIG. 16) of the present invention is shown in the flowchart of FIG. 19 and the candidate character display area 122 and the determined character display area 123 on the operation panel 2 shown in FIG. It demonstrates based on the example of a display. In the following, the user says "" Explain the case where it is intended to input "step".
[0062] ユーザが操作パネル 2の入力部 11からアルファベット(漢字「外字 2」 (mei)のピン インの先頭アルファベット「m」)を入力すると(S201の YES)、入力受付部 21は、入 力されたアルファベット「m」の信号を、候補抽出部 31に供給する。次に、候補抽出 部 31は、入力されたアルファベットに対応するピンインを特定し、特定したピンインに 対応する漢字を、候補漢字として漢字辞書部 41から抽出する (S202)。  When the user inputs an alphabet (leading alphabet “m” of pinyin of Kanji “extra character 2” (mei)) from input unit 11 of operation panel 2 (YES in S201), input reception unit 21 receives an input The signal of the alphabet “m” is supplied to the candidate extraction unit 31. Next, the candidate extraction unit 31 specifies pinyin corresponding to the input alphabet, and extracts a kanji corresponding to the specified pinyin from the kanji dictionary unit 41 as a candidate kanji (S202).
[0063] 候補順序決定部 32は、所定の条件に基づ!/、て上記抽出された候補漢字の表示順 序を決定する(詳細は後述)(S203)。続いて、表示制御部 22は、候補順序決定部 3 2が決定した表示順序で n個の候補漢字を表示するよう表示位置を定め、上記候補 漢字を表示部 12に出力し、ユーザに提示する(S204)。図 20の(a)に、 n= 5として、 漢字候補を 5個づっ表示する場合の表示例を示す。  The candidate order determination unit 32 determines the display order of the above extracted candidate kanji based on predetermined conditions (details will be described later) (S203). Subsequently, the display control unit 22 determines the display position so as to display n candidate kanji in the display order determined by the candidate order determination unit 32, and outputs the candidate kanji to the display unit 12 to present it to the user. (S204). FIG. 20 (a) shows a display example in the case where five kanji candidates are displayed with n = 5.
[0064] ここで、ユーザにより、 1度目に候補文字表示領域 122に表示された漢字以外の次 候補漢字を表示する指示(図 20の (a)に所望の漢字「外字 2」がないので、例えば、「 次候補表示」のボタンを押すなどする)が入力されると(S205の NO、 S206の YES) 、 S204に戻り、表示制御部 22は、次の候補漢字のリストを表示する(図 20の(b) )。 さらに、次候補漢字の表示指示も、候補漢字の選択指示も入力されず (S205、 S20 6において NO)、新たにアルファベットが入力された場合には、 S202に戻り、候補抽 出部 31は、これまでに入力されたアルファベット列に対応するピンインをさらに絞り込 んで特定し、特定したピンインに対応する漢字を、候補漢字として抽出する。  Here, an instruction to display the next candidate kanji other than the kanji displayed at the first time in the candidate character display area 122 by the user (because there is no desired kanji “extra character 2” in (a) of FIG. 20) For example, when the button "next candidate display" is pressed (NO in S205, YES in S206), the process returns to S204, and the display control unit 22 displays a list of the next candidate kanji (see FIG. 20 (b)). Furthermore, neither the display instruction for the next candidate kanji nor the selection instruction for the candidate kanji is input (S205, NO in S206), and when an alphabet is newly input, the process returns to S202, and the candidate extraction unit 31 The pinyins corresponding to the alphabet string input so far are further narrowed down and specified, and the kanji corresponding to the specified pinyin is extracted as a candidate kanji.
[0065] 一方、ユーザが、入力部 11を介して、候補文字表示領域 122に表示された所望の 漢字「外字 2」(図 20の (b)の" 3011")を選択すると(S205の YES)、候補選択受付 部 23は、選択された候補漢字「外字 2」の信号を受け付けて、文字確定部 33に供給 する。  On the other hand, when the user selects the desired kanji “extra character 2” (“3011” in (b) of FIG. 20) displayed in candidate character display area 122 via input unit 11, (YES in S205) And the candidate selection acceptance unit 23 accepts the signal of the selected candidate kanji “extra character 2”, and supplies it to the character determination unit 33.
[0066] 文字確定部 33は、選択された「外字 2」を表示するよう表示制御部 22に指示すると ともに、候補抽出部 31に対して、次に「外字 2」から始まる単語を抽出するよう指示す る。候補抽出部 31は、続いて、「外字 2」から始まる単語を候補単語として単語辞書 部 42から抽出する(S207) (図 20の(c) )。  The character determination unit 33 instructs the display control unit 22 to display the selected “extra character 2”, and, for the candidate extraction unit 31, extracts a word starting with “extra character 2” next. Point. The candidate extraction unit 31 subsequently extracts a word beginning with “extra character 2” as a candidate word from the word dictionary unit 42 (S207) ((c) in FIG. 20).
[0067] 候補順序決定部 32は、所定の条件に基づ!/、て上記抽出された候補単語の表示順 序を決定する(詳細は後述)(S208)。続いて、表示制御部 22は、候補順序決定部 3 2が決定した表示順序にて上記候補単語を表示部 12に出力し、ユーザに提示する( S209)。 [0067] The candidate order determination unit 32 determines the display order of the extracted candidate words based on predetermined conditions. The order is determined (details will be described later) (S208). Subsequently, the display control unit 22 outputs the candidate words to the display unit 12 in the display order determined by the candidate order determination unit 32 and presents them to the user (S209).
[0068] ここで、ユーザにより、次候補単語を表示する指示が入力されると(S210の NO、 S 211の YES)、 S209に戻り、表示制御部 22は、次の候補単語のリストを表示する。さ らに、次候補単語の表示指示も、候補単語の選択指示も入力されず (S210、 S211 において NO)、新たにアルファベットが入力された場合には、 S202に戻り、候補抽 出部 31は、これまでに入力されたアルファベットに対応するピンインを特定し、特定し たピンインに対応する漢字を、候補漢字として抽出する。  Here, when the user inputs an instruction to display the next candidate word (NO in S210, YES in S211), the process returns to S209, and the display control unit 22 displays a list of the next candidate words. Do. Furthermore, neither the instruction to display the next candidate word nor the instruction to select the candidate word is input (S210, S211: NO), and if an alphabet is newly input, the process returns to S202 and the candidate extraction unit 31 The pinyin corresponding to the alphabet input so far is specified, and the kanji corresponding to the specified pinyin is extracted as a candidate kanji.
[0069] 一方、ユーザが、入力部 11を介して、候補文字表示領域 122に表示された所望の 単語「「外字 2」天」(図 20の(c)の" 020")を選択すると(S210の YES)、候補選択受 付部 23は、選択された候補単語「「外字 2」天」の信号を受け付けて、文字確定部 33 に供給する。  On the other hand, when the user selects the desired word ““ extra character 2 ”sky” (“020” in FIG. 20 (c)) displayed in candidate character display area 122 via input unit 11 (“020” in FIG. 20) The candidate selection reception unit 23 receives the signal of the selected candidate word ““ extra digit 2 ”heaven” and supplies the signal to the character determination unit 33.
[0070] 文字確定部 33は、選択された候補単語を表示するよう表示制御部 22に指示し、表 示制御部 22は、選択された候補単語を表示部 12に表示する(S 212) (図 20の(d) ) 。同時に、候補抽出部 31に対して、次に該候補単語から始まる単語を抽出するよう 指示する。文字の入力を終了する指示が入力されなければ (S213の NO)、候補抽 出部 31は、 S207に戻り、上記選択された候補単語から始まる単語を候補単語として 単語辞書部 42から抽出する。  The character determination unit 33 instructs the display control unit 22 to display the selected candidate word, and the display control unit 22 displays the selected candidate word on the display unit 12 (S 212) (S 212) (S 212) Figure 20 (d)). At the same time, it instructs the candidate extraction unit 31 to extract a word starting from the candidate word. If an instruction to end the input of characters is not input (NO in S213), the candidate extraction unit 31 returns to S207 and extracts a word starting from the selected candidate word as a candidate word from the word dictionary unit 42.
[0071] ユーザの次の所望の漢字「到」が候補漢字として表示されない場合、ユーザは、「 到」(dao)の先頭ピンイン「d」を入力することができる(S211の NO、 S201の YES) ( 図 20の(e) )。以降同様の処理を繰り返し、所望の漢字を入力することが可能となる。  [0071] When the user's next desired kanji "Iru" is not displayed as a candidate kanji, the user can input the leading pinyin "d" of "Iru" (dao) (NO in S211, YES in S201) ) (Fig. 20 (e)). Thereafter, the same process is repeated, and it becomes possible to input a desired kanji.
[0072] 選択された候補単語が選択されたのち(「散歩」まで入力が確定されたのち)、文字 の入力を終了する指示が入力されれば (S213の YES)、処理を終了する。  After the selected candidate word has been selected (after the input has been confirmed up to “walk”), if an instruction to end the input of characters is input (YES in S 213), the process is ended.
[0073] (変形例 2—学習機能)  (Modification 2—Learning Function)
さらに、本発明の文字変換装置 1は、ユーザの選択により入力が確定された漢字( 単語)を記録する構成を備えていることが好ましい。これにより、文字変換装置 1は、 過去に入力した漢字、つまり、ユーザが使用したことのある漢字の記録に基づいて、 候補漢字を表示することができる。過去に使用された漢字は、ユーザによって再度使 用される可能性は大きい。したがって、そのような使用履歴が反映された候補漢字の 一覧を用いることにより、ユーザは少ない操作数で所望の漢字を得ることが可能とな る。 Furthermore, it is preferable that the character conversion apparatus 1 of this invention is provided with the structure which records the kanji (word) with which the input was decided by user's selection. Thereby, the character conversion device 1 is based on the kanji input in the past, that is, based on the kanji recorded by the user. Candidate kanji can be displayed. Kanji used in the past are likely to be used again by the user. Therefore, by using the list of candidate kanji in which such usage history is reflected, the user can obtain a desired kanji with a small number of operations.
[0074] 図 21は、学習機能を備えた文字変換装置 1の要部構成を示すブロック図である。  FIG. 21 is a block diagram showing the main configuration of the character conversion device 1 having a learning function.
なお、図 21の各構成要素に付された符号は、図 12の各構成要素に付された符号に 対応しており、同じ符号は、同じ構成要素を示している。したがって、上述の各実施 形態ですでに説明した構成要素についての説明は繰り返さない。図 21に示す文字 変換装置 1において、図 12と異なる点は、文字入力制御部 30が、さらに、学習制御 部 (学習制御手段) 34を有して ヽる点と、記録部 40に学習辞書記録部 (学習語記録 部) 43およびペア学習辞書記録部(学習語記録部) 44が含まれて ヽる点である。  The reference numerals attached to the respective components in FIG. 21 correspond to the reference numerals attached to the respective components in FIG. 12, and the same reference numerals indicate the same components. Therefore, the description of the components already described in the above embodiments will not be repeated. The character conversion device 1 shown in FIG. 21 differs from that in FIG. 12 in that the character input control unit 30 further includes a learning control unit (learning control means) 34 and a learning dictionary in the recording unit 40. A recording unit (learning word recording unit) 43 and a pair learning dictionary recording unit (learning word recording unit) 44 are included.
[0075] 学習制御部 34は、ユーザの指示により、文字変換装置 1に入力 (使用)された漢字 の記録 (学習)を制御するものである。具体的には、ユーザにより使用された漢字を学 習漢字として、学習辞書記録部 43に格納したり(以下、単独学習 (機能)と称す)、ま たは、ユーザにより使用された漢字を、直前に入力されている漢字と関連付けてペア 学習辞書記録部 44に格納したり(以下、ペア学習 (機能)と称す)する。ユーザにより 過去に入力されたことのある漢字は、ユーザによって再度使用される可能性が大きい ので、そのような学習漢字を候補漢字として利用することにより、ユーザがより少ない 操作数で所望の漢字を得る確率が高まる。  The learning control unit 34 controls the recording (learning) of the kanji input (used) to the character conversion device 1 according to the instruction of the user. Specifically, the kanji used by the user is stored as learning kanji in the learning dictionary recording unit 43 (hereinafter referred to as single learning (function)), or the kanji used by the user is The pair is stored in the pair learning dictionary recording unit 44 in association with the kanji input immediately before (hereinafter referred to as pair learning (function)). Since kanji characters that have been input by the user in the past are likely to be used again by the user, by using such learned kanji characters as candidate kanji characters, the user can input desired kanji characters with fewer operations. The probability of getting is increased.
[0076] 学習辞書記録部 43は、ユーザにより過去に使用された漢字 (入力確定文字)を単 独学習漢字として記録するものである。学習辞書記録部 43に記録される学習辞書に は、その単独学習漢字が記録された日時の情報や、使用頻度の情報 (過去に何回 使用されたか、所定期間内に何回使用された力など)が、漢字ごとに対応付けて記 録されていてもよい。  The learning dictionary recording unit 43 records kanji (input determined characters) used in the past by the user as single learning kanji. The learning dictionary recorded in the learning dictionary recording unit 43 contains information on the date and time when the single learning kanji was recorded, information on the frequency of use (how many times it has been used in the past, and how many times it has been used within a predetermined period. Etc.) may be recorded in association with each kanji.
[0077] 学習制御部 34は、文字確定部 33から、ユーザにより選択された候補漢字の信号を 受け取ると、一時記憶部 3に該選択された候補漢字を書き込む。学習制御部 34は、 文字確定部 33が確定指示信号に応じて入力を確定させたとき、文字確定部 33より 当該入力確定文字の単独学習処理の指示を受け付ける。学習制御部 34は、一時記 憶部 3から、上記候補漢字を内部バッファに読み出し、一時記憶部 3から読み出され た候補漢字の削除および内容の更新を行う。そして、内部バッファに記憶した候補漠 字を単独学習漢字として学習辞書記録部 43に記憶する。 When receiving the signal of the candidate kanji selected by the user from the character determination unit 33, the learning control unit 34 writes the selected candidate kanji in the temporary storage unit 3. When the character determination unit 33 determines the input in accordance with the determination instruction signal, the learning control unit 34 receives an instruction of the single learning process of the input determined character from the character determination unit 33. The learning control unit 34 temporarily writes The candidate kanji is read from the memory unit 3 into the internal buffer, and deletion of the candidate kanji read from the temporary storage unit 3 and updating of the contents are performed. Then, the candidate vowels stored in the internal buffer are stored in the learning dictionary recording unit 43 as single learning kanji.
[0078] ペア学習辞書記録部 44は、ユーザにより過去に使用された漢字を、当該漢字がど の漢字に続 、て確定されたのかを示す情報 (直前漢字)と関連付けて、ペア学習漠 字として記録するものである。直前漢字は、すでに学習辞書記録部 43において単独 学習漢字として記録されている。ペア学習辞書記録部 44に記録される学習辞書には 、そのペア学習漢字ごとに、上記直前漢字が関連付けて記録される。この漢字のぺ ァごとに、記録された日時の情報や、使用頻度の情報を対応付けて記録してもよい。  Pair learning dictionary recording unit 44 associates a kanji used in the past by the user with information (preceding kanji) indicating which kanji follows the kanji and the pair learning vague Is recorded as The immediately preceding kanji has already been recorded as a single learning kanji in the learning dictionary recording unit 43. In the learning dictionary recorded in the pair learning dictionary recording unit 44, the preceding kanji is associated and recorded for each pair learning kanji. Information of the recorded date and time, and information of frequency of use may be associated with each kanji character and recorded.
[0079] 学習制御部 34は、文字確定部 33より入力確定文字のペア学習処理の指示を受け 付ける。学習制御部 34は、一時記憶部 3から、上記入力確定文字と、その直前漢字 とを内部バッファに読み出し、一時記憶部 3から読み出された文字列の削除および内 容の更新を行う。そして、内部バッファに記憶した入力確定文字を直前漢字と関連付 けて、ペア学習漢字としてペア学習辞書記録部 44に記憶する。  The learning control unit 34 receives from the character determination unit 33 an instruction of a pair learning process of the input determined character. The learning control unit 34 reads the input determined character and its immediately preceding kanji from the temporary storage unit 3 to the internal buffer, and deletes the character string read from the temporary storage unit 3 and updates the contents. Then, the input determined character stored in the internal buffer is associated with the immediately preceding kanji, and is stored in the pair learning dictionary recording unit 44 as a pair learning kanji.
[0080] なお、上記では、学習辞書記録部 43およびペア学習辞書記録部 44にお 、て、各 学習辞書が漢字ごとに記録される構成について説明したが、これに限定されない。 上記構成に加えて、単語を辞書に記録する構成としてもよい。すなわち、学習制御 部 34の単独学習機能により、使用された単語が、学習辞書記録部 43に記録される ようにしてもよい。例えば、「給水」という単語が使用された場合、「給」という漢字に加 え、「給水」という単語が、単独学習漢字 (単語)として学習辞書記録部 43に記録され る。  In the above, the configuration in which each learning dictionary is recorded for each kanji in the learning dictionary recording unit 43 and the pair learning dictionary recording unit 44 has been described, but the present invention is not limited to this. In addition to the above configuration, a word may be recorded in a dictionary. That is, the word used may be recorded in the learning dictionary recording unit 43 by the single learning function of the learning control unit 34. For example, when the word "water supply" is used, the word "water supply" is recorded in the learning dictionary recording unit 43 as a single learning kanji (word) in addition to the kanji "supply".
[0081] また、学習制御部 34のペア学習機能によれば、「大学」「教授」という単語が続けて 入力された場合、漢字「教」を、「学」および Zまたは「大学」の直前漢字 (単語)と関 連付けて、ペア学習漢字として記録することもできる。さらに、単語「教授」を、「学」お よび Zまたは「大学」の直前漢字 (単語)と関連付けて、ペア学習単語として記録する ことちでさる。  Further, according to the pair learning function of the learning control unit 34, when the words “college” and “professor” are continuously input, the kanji “teaching” is It can also be recorded as paired learning kanji in association with kanji (word). Furthermore, the word “professor” is associated with “学” and “Z” or “University” 's last kanji (word) and recorded as a paired learning word.
[0082] (候補順序決定要素)  (Candidate Order Determining Element)
次に、文字変換装置 1の候補順序決定部 32 (図 12、 16、 21)が候補 (候補漢字 Z 候補単語)の表示順序を決定するための所定の条件 (以下、候補順序決定要素と称 する)について説明する。 Next, the candidate order determination unit 32 (FIGS. 12, 16, and 21) of the character conversion device 1 is a candidate (candidate kanji Z A predetermined condition (hereinafter referred to as a candidate order determining element) for determining the display order of the candidate words will be described.
[0083] 上述したとおり、操作パネル 2に設けられた表示部 12において、一度により多くの 候補を表示できる方が、ユーザがより早く所望の漢字を得る可能性が高くなるというメ リットがある。しかも、操作パネル 2をタツチパネルなどの表示 Z入力一体型の入出力 デバイスで実現した場合、ユーザが所望の漢字を得るための操作数を格段に減らす ことが可能である。しかし、一方で、多くの候補を大きな表示画面に一覧表示しても、 その中力 所望の候補を探す負担が逆に大きくなるという問題が生じる。そこで、限ら れた表示領域に限られた数の候補を表示する際、ユーザが所望する (選択する)可 能性の大き!/、漢字 (単語)力 順に表示されることが望まれる。  As described above, there is a merit that it is more likely that the user can obtain a desired kanji more quickly if the display unit 12 provided on the operation panel 2 can display more candidates at one time. Moreover, when the operation panel 2 is realized by a display Z input integrated input / output device such as a touch panel, it is possible to significantly reduce the number of operations for the user to obtain a desired kanji. However, on the other hand, even if many candidates are displayed on a large display screen, there is a problem that the burden of searching for a desired candidate is increased. Therefore, when displaying a limited number of candidates in a limited display area, it is desirable that they be displayed in the order of the level of possibility (selecting) that the user desires (selects) kanji (word) power.
[0084] 本発明では、ユーザの嗜好に合致した、より精度の高 、表示順序で候補が表示さ れるよう、表示する候補の順序を決定する。これにより、所望の漢字を得るための入 力操作を簡素化することが可能な文字変換装置を実現することができる。  In the present invention, the order of candidates to be displayed is determined so that the candidates are displayed in the display order with higher accuracy and in accordance with the preference of the user. As a result, it is possible to realize a character conversion device capable of simplifying the input operation for obtaining a desired kanji.
[0085] 以下では、候補単語の表示順序を決定する方法につ!、て説明するが、候補漢字 [0085] In the following, a method of determining the display order of candidate words will be described!
Z候補単語の 、ずれの表示順序も同じ方法にて決定される。 The display order of deviations of the Z candidate words is also determined in the same manner.
[0086] 本実施形態では、候補順序決定部 32は、以下の 7つの候補順序決定要素に基づ いて、候補の表示順序を決定する。  In the present embodiment, the candidate order determination unit 32 determines the display order of candidates based on the following seven candidate order determination factors.
[0087] (要素 A)出現頻度 (Element A) Frequency of occurrence
出現頻度とは、ある言語の文字 (列)において、どの程度出現するのかの指標となる 情報である。出現頻度が高ければ高いほど、その言語使用環境下においてよく使用 されているということになるので、出現頻度の高い候補ほど表示順序を上位にする。 出現頻度は、既知の統計データを用いた固定的な情報を用いればよい。  The appearance frequency is information that is an index of how often it appears in a character (string) of a certain language. The higher the frequency of occurrence, the more often it is used in the language usage environment. Therefore, the higher the frequency of appearance, the higher the display order. The appearance frequency may be fixed information using known statistical data.
[0088] (要素 B)ピンイン長の一致 Z不一致 (Element B) Pin-in length match Z mismatch
入力されたアルファベット (列)の文字数と、対応する候補のピンインの文字数が一 致するか否かに基づいて表示順序を決定する。ピンイン長が、入力されたピンイン文 字数と一致する候補の表示順序を上位にする。  The display order is determined based on whether or not the number of characters of the input alphabet (column) matches the number of characters of the corresponding candidate pinyin. Pinyin length ranks the display order of candidates whose number matches the number of input Pinyin characters.
[0089] (要素 C)直前文字列との連続性 [0089] (Element C) Continuity with the immediately preceding string
その時点までにすでに入力が確定している入力確定文字および Zまたは現時点で 選択中の文字 (列)の直後に続く可能性 (直前文字列との連続性)の大き 、候補ほど 表示順序を上位にする。あるいは、過去の入力履歴より、上記直前文字列に連続し て入力されたことの有無を判定して、連続して入力された履歴が残っている候補の表 示順序を上位にする。 The input confirmation character that has already confirmed input up to that point and Z or at this time Increase the display order of the possibility (continuity with the previous character string) immediately after the selected character (string) and the candidate. Alternatively, based on the past input history, it is determined whether or not the immediately preceding character string has been continuously input, and the display order of the candidates for which the continuously input history remains is ranked high.
[0090] (要素 D)前方文字列との関連性  [0090] (Element D) Relevance to Forward String
その時点までにすでに入力が確定している入力確定文字 (列)に関連し、当該文字 (列)の後方に入力される可能性 (前方文字列との関連性)の大き 、候補ほど表示順 序を上位にする。  Related to the input determined character (string) whose input has already been determined up to that point, the size of the possibility of being input behind the character (string) (Relationship with the forward character string) Make the order higher.
[0091] (要素 E)単独学習漢字 Z単語  (Element E) Independent Learning Kanji Z Word
学習辞書記録部 43に記録されている単独学習漢字 (単語)であるか否かによって、 表示順序を決定する。過去に入力(使用)されたことのある漢字 (単語)であれば、再 び、使用される可能性は大きい。したがって、単独学習漢字 (単語)として記録されて V、る候補の表示順序を上位にする。  The display order is determined depending on whether it is a single learning kanji (word) recorded in the learning dictionary recording unit 43 or not. If it is a kanji (word) that has been input (used) in the past, it is likely to be used again. Therefore, the display order of the candidate V recorded as a single learning kanji (word) is ranked high.
[0092] (要素 F)ペア学習漢字 Z単語  (Element F) Pair Learning Kanji Z Word
ペア学習辞書記録部 44に記録されて 、るペア学習漢字 (単語)であるか否かによ つて、表示順序を決定する。過去に入力(使用)されたことのある漢字 (単語)の組み 合わせであれば、再び、その組み合わせで使用される可能性は大きい。したがって、 ペア学習漢字 (単語)として記録されて 、る候補の表示順序を上位にする。  The display order is determined according to whether or not the pair learning kanji (word) is recorded in the pair learning dictionary recording unit 44 and whether or not it is a pair learning kanji (word). If it is a combination of kanji (word) that has been input (used) in the past, the possibility of being used in the combination is high again. Therefore, as a pair learning kanji (word), the display order of candidates is ranked high.
[0093] (要素 G)学習時期  (Element G) Learning period
候補漢字 (単語)に一致する学習漢字 (単語)が、学習辞書記録部 43および Zまた はペア学習辞書記録部 44に記録された時期に基づいて表示順序を決定する。記録 された時期が新しいものほど、ユーザにより選択される可能性が大きいので、最近記 録された漢字 (単語)の候補ほど表示順序を上位にする。  The display order is determined based on the time when the learning kanji (word) matching the candidate kanji (word) is recorded in the learning dictionary recording unit 43 and Z or the pair learning dictionary recording unit 44. The newer the recorded time is, the more likely it is to be selected by the user. Therefore, the more recently recorded kanji (word) candidates, the higher the display order.
[0094] 以上の各要素を条件にすることにより、ユーザに選択される可能性の大きい候補ほ ど、優先して候補文字表示領域 122 (図 14)に表示させることができるので、ユーザ は、少ないキー操作数で、所望の漢字を得ることが可能となる。  By setting each of the above elements as a condition, the candidate character display area 122 (FIG. 14) can be displayed preferentially as a candidate with a high possibility of being selected by the user. It is possible to obtain a desired kanji with a small number of key operations.
[0095] 上述したとおり、候補順序決定要素とは、候補順序決定部 32が、候補漢字 Z候補 単語の表示順序を決定するために必要な情報である。より具体的には、候補順序決 定部 32が、候補漢字 Z候補単語がユーザに選択される可能性の大きさを示す評価 値を算出するために使用する情報である。候補 (候補漢字 Z候補単語)ごとに候補 順序決定要素の情報が対応づけられて記録部 40の各 (記録)部に記憶されており、 候補順序決定部 32は、記録部 40の各部を参照して、候補ごとに候補順序決定要素 を特定することが可能となる。さらに、記録部 40には、候補順序決定部 32が特定した 候補順序決定要素からどのようにして上記評価値を算出するの力を規定した評価値 算出ルールが記憶されていてもよい。これにより、候補順序決定部 32は、特定した候 補順序決定要素と評価値算出ルールとにしたがって、候補ごとに評価値を求め、候 補の表示順序を決定することが可能となる。候補順序決定部 32の評価値算出方法 は、後述する。 As described above, the candidate order determining element is information necessary for the candidate order determining unit 32 to determine the display order of candidate kanji Z candidate words. More specifically, candidate order decision The determination unit 32 is information used to calculate an evaluation value indicating the degree of possibility that the candidate kanji character Z candidate word is selected by the user. Information on candidate order determination elements is associated with each candidate (candidate kanji Z candidate word) and stored in each (recording) unit of the recording unit 40. The candidate order determining unit 32 refers to each unit of the recording unit 40 It is then possible to identify candidate order decision factors for each candidate. Furthermore, the recording unit 40 may store an evaluation value calculation rule that defines how to calculate the evaluation value from the candidate order determination element specified by the candidate order determination unit 32. As a result, the candidate order determination unit 32 can obtain an evaluation value for each candidate according to the specified candidate order determination element and the evaluation value calculation rule, and can determine the candidate display order. The evaluation value calculation method of the candidate order determination unit 32 will be described later.
[0096] (候補順序決定部の構成)  (Configuration of Candidate Order Determination Unit)
図 22は、候補順序決定部 32のさらに詳細な構成を示すブロック図である。候補順 序決定部 32は、学習漢字判定部 131、ピンイン長判定部 132、連結度判定部 133、 出現頻度判定部 134、および、学習時期判定部 135を備えている。記録部 40 (図 12 、 16、 21)は、上述の各記憶部に加えて、出現頻度記録部 45および連結度記録部( 関連度記録部) 46を含んでいる。一時記憶部 3は、内部に、抽出候補一時記憶部 5 1、評価値一時記憶部 52、および、表示順序一時記憶部 53を含んでいる。  FIG. 22 is a block diagram showing a more detailed configuration of the candidate order determination unit 32. As shown in FIG. The candidate order determination unit 32 includes a learning kanji determination unit 131, a pinyin length determination unit 132, a connectivity degree determination unit 133, an appearance frequency determination unit 134, and a learning time determination unit 135. The recording unit 40 (FIGS. 12, 16, and 21) includes an appearance frequency recording unit 45 and a connectivity degree recording unit (association degree recording unit) 46 in addition to the above-described storage units. Temporary storage unit 3 internally includes extraction candidate temporary storage unit 51, evaluation value temporary storage unit 52, and display order temporary storage unit 53.
[0097] 一時記憶部 3内の抽出候補一時記憶部 51は、候補抽出部 31が抽出した候補 (以 下、抽出候補と称する)としての漢字および Zまたは単語を一時的に保持するもので ある。候補順序決定部 32は、抽出候補一時記憶部 51から抽出候補を読み出して、 各抽出候補の表示順序を決定する処理を行う。  An extraction candidate temporary storage unit 51 in the temporary storage unit 3 temporarily holds kanji characters and Z or words as candidates extracted by the candidate extraction unit 31 (hereinafter referred to as extraction candidates). . The candidate order determination unit 32 reads out the extraction candidates from the extraction candidate temporary storage unit 51, and performs processing of determining the display order of each extraction candidate.
[0098] 評価値一時記憶部 52は、候補順序決定要素と評価値算出ルールとに基づいて算 出された、上記各抽出候補に付与される評価値を一時的に保持するものである。表 示順序は、最終的に算出された評価値を基に決定されてもよい。  The evaluation value temporary storage unit 52 temporarily holds the evaluation value given to each of the extraction candidates calculated based on the candidate order determination element and the evaluation value calculation rule. The display order may be determined based on the finally calculated evaluation value.
[0099] 表示順序一時記憶部 53は、上記各抽出候補に対して決定された表示順序の情報 を一時的に保持するものである。候補順序決定部 32は、決定した表示順序を表示 順序一時記憶部 53に格納する。表示順序一時記憶部 53の表示順序の情報は、抽 出候補を表示部 12に表示する際に表示制御部 22によって参照される。 [0100] 候補順序決定部 32の学習漢字判定部 131は、抽出候補が、学習漢字 (単語)であ るか否か、および、それが単独学習であるかペア学習であるかを判定するものである 。具体的には、抽出された候補と一致する漢字 (単語)が学習辞書記録部 43内に記 録されている場合は、単独学習漢字 (単語)であると判定し、ペア学習辞書記録部 44 内に記録されている場合は、ペア漢字 (単語)であると判定する。学習漢字判定部 13 1は、判定結果を評価値一時記憶部 52の評価値または表示順序一時記憶部 53の 表示順序に反映させる。これにより、要素 Eおよび Zまたは要素 Fに基づく表示順序 を決定することができる。 The display order temporary storage unit 53 temporarily holds information of the display order determined for each of the extraction candidates. The candidate order determination unit 32 stores the determined display order in the display order temporary storage unit 53. Information on the display order of the display order temporary storage unit 53 is referred to by the display control unit 22 when displaying the extraction candidates on the display unit 12. The learning kanji judging unit 131 of the candidate order determining unit 32 judges whether the extracted candidate is a learning kanji (word) and whether it is single learning or pair learning. It is. Specifically, when a kanji (word) that matches the extracted candidate is recorded in the learning dictionary recording unit 43, it is determined to be a single learning kanji (word), and the pair learning dictionary recording unit 44 If it is recorded inside, it is judged as a paired kanji (word). The learning kanji judgment unit 131 reflects the judgment result on the evaluation value of the evaluation value temporary storage unit 52 or the display order of the display order temporary storage unit 53. This makes it possible to determine the display order based on elements E and Z or element F.
[0101] ピンイン長判定部 132は、入力受付部 21 (図 12、 16、 21)に入力されたアルファべ ット (列)の文字数と、対応する候補のピンインの文字数が一致するか否かを判定する ものである。例えば、漢字「「外字 2」(ピンイン: mei)」を得るために、「me」まで入力し ていたとする。この時点では、入力文字数は 2で、対応する候補漢字「「外字 2」 (mei ;)」のピンイン長は 3であるため、ピンイン長判定部 132は、ピンイン長不一致と判定す る。一方、ユーザによって「mei」まで入力された時点では、ピンイン長が一致すると 判定し、候補漢字「「外字 2」」の表示順序を上がるよう、評価値一時記憶部 52の評価 値または表示順序一時記憶部 53の表示順序を変更する。これにより、要素 Bに基づ く表示順序を決定することができる。  The pin-in length determination unit 132 determines whether or not the number of characters of the alphabet (row) input to the input reception unit 21 (FIGS. 12, 16, 21) matches the number of characters of the corresponding candidate pinyin. To determine the For example, suppose that you have entered up to "me" in order to obtain the kanji "" 外 外 2 "(pinyin: mei). At this point in time, the number of input characters is 2, and the pinyin length of the corresponding candidate kanji character "" 2 "(mei;)" is 3, so the pinyin length determination unit 132 determines that the pinyin lengths do not match. On the other hand, when “mei” is input by the user, it is determined that the pinyin length matches, and the evaluation value temporary storage unit 52 evaluation value or display order temporary is determined to raise the display order of the candidate kanji ““ extra character 2 ”. Change the display order of the storage unit 53. This makes it possible to determine the display order based on element B.
[0102] 連結度判定部 133は、その時点までにすでに入力が確定している入力確定文字 および Zまたは現時点で選択中の文字 (漢字 Z単語)とのつながり具合、つまり、連 結度に応じて次にユーザが所望する文字を予測して候補に挙げる判定を行うもので ある。連結度判定部 133は、連結度記録部 46に記録されている連結度テーブル 15 0を参照して、確定あるいは選択された前方文字列および Zまたは直前文字列との 連結度を判定する。前方文字列および Zまたは直前文字列との連結度が高 、もの ほど、ユーザが所望する文字である確率が高いとして、その表示順序を上位にする。 これにより、要素 Cおよび Zまたは要素 Dに基づく表示順序を決定することができる。 連結度記録部 46の連結度テーブル 150および連結度判定部 133の詳細につ ヽて は後述する。  [0102] Connectivity determination section 133 determines the degree of connection with the input determined character and the character Z or the character currently selected (kanji Z word) whose input has already been determined by that time, that is, according to the degree of connection. Next, the character desired by the user is predicted to be judged as a candidate. The connectivity determination unit 133 refers to the connectivity table 150 recorded in the connectivity recording unit 46 to determine the connectivity with the determined or selected front character string and Z or the previous character string. The higher the degree of connectivity with the forward character string and Z or the immediately preceding character string, the higher the probability that the character is the character desired by the user, and the display order is made higher. Thereby, the display order based on element C and Z or element D can be determined. Details of the connectivity degree table 150 and the connectivity degree determination unit 133 of the connectivity degree recording unit 46 will be described later.
[0103] 出現頻度判定部 134は、出現頻度記録部 45に記録される、既知の統計データに 基づいて、候補漢字 (単語)が出現する (使用される)頻度を判定するものである。出 現頻度が高いものほど、その文字がよく使用されていることになるので、ユーザにより 選択される可能性も大きい。したがって、出現頻度の高い候補ほど表示順序を上位 にする。これにより、要素 Aに基づく表示順序を決定することができる。 The appearance frequency determination unit 134 uses known statistical data to be recorded in the appearance frequency recording unit 45. Based on it, the frequency with which candidate kanji (words) appear (is used) is determined. The more frequently they occur, the more likely they will be used by the user, since the character will be used more frequently. Therefore, the higher the frequency of appearance, the higher the display order. Thereby, the display order based on element A can be determined.
[0104] 学習時期判定部 135は、候補漢字 (単語)に一致する学習漢字 (単語)が、学習辞 書記録部 43および Zまたはペア学習辞書記録部 44に記録された時期を判定するも のである。学習漢字 (単語)が記録された時期が新しいものほど、ユーザにより選択さ れる可能性も大きい。したがって、学習時期が最近の漢字 (単語)ほど上位になるよう 候補の表示順序を決定する。これにより、要素 Gに基づく表示順序を決定することが できる。 The learning time determination unit 135 determines the time when the learning kanji (word) matching the candidate kanji (word) is recorded in the learning dictionary recording unit 43 and Z or the pair learning dictionary recording unit 44. is there. The newer the time when the learned kanji (word) is recorded, the more likely it is to be selected by the user. Therefore, the display order of the candidates is determined so that the learning time is higher for the latest kanji (word). This makes it possible to determine the display order based on the element G.
[0105] 上記構成によれば、候補順序決定部 32は、上述のさまざまな条件に基づ 、て、候 補の表示順序を決定することができる。  According to the above configuration, the candidate order determination unit 32 can determine the candidate display order based on the various conditions described above.
[0106] これにより、ユーザの希望をより正確に反映した表示順序で候補を表示することが 可能となり、結果として、表音文字の入力に係る操作数について、より少ない操作数 で、所望の表意文字の入力を得ることが可能となる。  [0106] This makes it possible to display the candidates in a display order that more accurately reflects the user's wishes, and as a result, the number of operations related to the input of the phonetic characters can be reduced to the desired number of operations. It becomes possible to obtain input of characters.
[0107] (連結度の判定)  (Determination of connectivity)
次に、連結度判定部 133の動作について、さらに詳細に説明する。  Next, the operation of the connection degree determination unit 133 will be described in more detail.
[0108] 図 23は、連結度記録部 46に記録される、漢字 (単語)ごとの直前 (前方)文字列と の連結度を示した連結度テーブル 150の例を示す図である。図 23に示すとおり、連 結度テーブル 150は、漢字 (単語)ごとに格納されて 、る直前文字列との連続性情報 のアドレスおよび前方文字列との関連性情報のアドレスを検出するための連結度対 応表 151と、漢字 (単語)ごとに直前文字列との連続性を格納する連続性テーブル 1 52と、漢字 (単語)ごとに前方文字列との関連性を格納する前方関連性テーブル 15 3とを含んでいる。  FIG. 23 is a diagram showing an example of the connectivity degree table 150 recorded in the connectivity degree recording unit 46 and showing the connectivity degree with the immediately preceding (forward) character string for each kanji (word). As shown in FIG. 23, the continuity degree table 150 is stored for each kanji (word), and is used to detect the address of continuity information with the immediately preceding character string and the address of relevance information with the forward character string. Connectivity degree correspondence table 151 and continuity table 1 52 that stores continuity with immediately preceding character string for each kanji (word), and forward relevance that stores association with forward character string for each kanji (word) Table 15 contains three.
[0109] 例えば、「交通」という単語がユーザにより選択されると、連結度判定部 133は、直 前文字列としての「交通」のあとに連続する確率が高 、漢字 (単語)を連結度テープ ル 150から検索する。連結度判定部 133は、連結度対応表 151を参照し、直前文字 列「交通」との連続性情報力 連続性テーブル 152のアドレス「npi+l」に格納されて いることを検知する。 For example, when the word “traffic” is selected by the user, the connection degree determination unit 133 has a high probability of continuing after “traffic” as the immediately preceding character string, and the kanji (word) has a degree of connectivity Search from Table 150. The connectivity determination unit 133 refers to the connectivity correspondence table 151 and stores the continuity information strength with the immediately preceding character string “traffic” in the address “npi + l” of the continuity table 152 Detect that
[0110] 次に、連結度判定部 133は、連続性テーブル 152のアドレス「npi+l」を参照し、「 規則 (連続性 21)」、「大学 (16)」、「工具 (15)」· · ·の順に、連続性が高いことを検出 する。最後に、上述の各候補単語の表示順序を、「規則」「大学」「工具」 · · ·の順にす ることを決定する。  Next, the connectivity determination unit 133 refers to the address “npi + 1” of the continuity table 152, and “rule (continuity 21)”, “college (16)”, “tool (15)” · · · · · · · · Detects that the continuity is high. Finally, it is decided that the display order of each candidate word mentioned above is to be in the order of "rule" "college" "tool" · · · ·.
[0111] また、単語「上海」の入力が確定されたのちに、「交通」が選択されると、連結度判定 部 133は、前方文字列としての「上海」の後ろに関連して入力される確率が高 ヽ漢字 (単語)を連結度テーブル 150から検索する。連結度判定部 133は、連結度対応表 1 51を参照し、前方文字列「上海」との関連性情報が、前方関連性テーブル 153のァ ドレス「ppi+ s」に格納されて 、ることを検知する。  In addition, when “traffic” is selected after the entry of the word “Shanghai” is confirmed, the connectivity determination unit 133 is input in connection with “Shanghai” as a forward character string. Search the connectivity table 150 for ヽ kanji (words). The connectivity determination unit 133 refers to the connectivity correspondence table 151, and stores the relevancy information with the forward character string “Shanghai” in the address “ppi + s” of the forward relevancy table 153, Detect
[0112] 次に、連結度判定部 133は、前方関連性テーブル 153のアドレス「ppi+s」を参照 し、「大学 (関連性 10)」、「規則 (5)」、「警察 (3)」· · ·の順に、関連性が高いことを検 出する。最後に、上述の各候補単語の表示順序を、「大学」「規則」「警察」 · · ·の順に することを決定する。  Next, the connectivity determination unit 133 refers to the address “ppi + s” of the forward relevancy table 153 and selects “University (Relevancy 10)”, “Rule (5)”, “Police (3) · · · · · · · · · · · · · · · · · · · · · · · · · · · · · · · · · · · · · · · · · · · · Detect that the relevance is high. Finally, it is decided that the display order of each candidate word mentioned above should be in the order of "college" "rule" "police" · · ·.
[0113] これにより、ユーザがよく入力する単語の並びにしたがって、候補漢字 (単語)の表 示順序を決定することができるので、ユーザの嗜好を反映した表示順序でもって候 補漢字 '単語を表示させることができる。結果として、ユーザのキー操作回数を減らし 、所望の漢字を得るためのユーザの入力操作を簡素化することが可能となる。  Thus, since the display order of candidate kanji (words) can be determined as well as the words that the user often inputs, the candidate kanji 'words are displayed in the display order that reflects the user's preference. It can be done. As a result, it is possible to reduce the number of key operations of the user and simplify the user's input operation for obtaining a desired kanji.
[0114] (連結度の判定一変形例)  (Modification of Judgment of Connectivity)
表示順序の決定を、要素 C (直前文字列との連続性)および要素 D (前方文字列と の関連性)の両方を用いて実行する場合、要素 Dに基づく判定結果を優先させても よい。この場合、上述の例で言えば、「交通」が選択された場合に、候補単語「規則」 を常に優先して表示するのではなぐ前方に「上海」が入力されている場合には、候 補単語「大学」を優先して表示することができる。  If the determination of display order is performed using both element C (continuity with immediately preceding character string) and element D (relevancy with forward character string), the judgment result based on element D may be prioritized . In this case, in the above example, if "traffic" is selected, the candidate word "rule" is not always prioritized and displayed, but if "Shanghai" is input ahead, it is The supplementary word "University" can be displayed preferentially.
[0115] ユーザが「上海交通大学」を頻繁に入力して 、る文字変換装置 1にお 、て、「上海 交通大学」を入力するときの文字変換装置 1の動作について説明する。ユーザが、「 上」(shang)の入力のために、ピンインの先頭文字「s」を入力すると、上述の手順で 、「上」を候補漢字として表示する。ユーザによって「上」が選択されると、「上」で始ま る単語を候補単語として表示する。 An operation of the character conversion device 1 when the user frequently inputs “Shanghai traffic university” and inputs “Shanghai traffic university” to the character conversion device 1 will be described. When the user inputs a pinyin leading character "s" for "shang" input, "upper" is displayed as a candidate kanji in the above-mentioned procedure. Starts with "up" when "up" is selected by the user Displayed as candidate words.
[0116] ユーザによって、「上海」が選択されると、確定文字表示領域 123 (図 14)上に単語 「上海」が表示する。同時に、文字変換装置 1の候補抽出部 31は、入力確定文字「 上海」に続いて入力される候補単語を抽出する。候補順序決定部 32は、「上海」に 続く確率の高 ヽ単語を上位に表示するために、図 22の連結度記録部 46を参照し、 候補単語の表示順序を決定する。  When “Shanghai” is selected by the user, the word “Shanghai” is displayed on the determined character display area 123 (FIG. 14). At the same time, the candidate extraction unit 31 of the character conversion device 1 extracts candidate words input following the input determined character "Shanghai". The candidate order determination unit 32 refers to the connectivity recording unit 46 in FIG. 22 to determine the display order of candidate words in order to display the high probability word following “Shanghai” in the upper order.
[0117] まず、連結度判定部 133は、連結度対応表 151 (図 23)に基づいて、連続性テー ブル 152のアドレス「npi+ s」を参照し、単語候補を、単語「上海」に直接連続する確 率 (連続度)の高い順に並べる。すなわち、図 23に示す例では、「大学(15)」「交通( 13)」「警察(12)」「規則 (8)」. . .の順に候補が求まる。続いて、連結度判定部 133 は、入力確定文字「上海」の前方に位置する単語との関連性に基づいて、「上海」の あとに続く候補単語の表示順序を並べ替える。しかし、ここでは、「上海」の前方に位 置する漢字,単語がないため、前方文字列との関連性による表示順序の変更は実行 されない。  First, the connectivity determination unit 133 refers to the address “npi + s” of the continuity table 152 based on the connectivity correspondence table 151 (FIG. 23), and directly transmits the word candidate to the word “Shanghai”. Arrange in descending order of continuity probability. That is, in the example shown in FIG. 23, candidates are obtained in the order of “University (15)”, “Transportation (13)”, “Police (12)”, “Rule (8)”,. Subsequently, the connectivity determination unit 133 rearranges the display order of candidate words following “Shanghai” based on the relevancy to the word located in front of the input determined character “Shanghai”. However, here, since there are no kanji characters or words located in front of “Shanghai”, the change in display order due to the relationship with the front character string is not executed.
[0118] 上記手順で表示された候補単語の中から、ユーザによって「交通」が選択されると、 文字変換装置 1は、続いて、入力確定文字「交通」に直接連続する候補単語を上位 に表示するための表示順序を決定する。  When “traffic” is selected by the user from among the candidate words displayed in the above procedure, the character conversion device 1 subsequently ranks candidate words that are directly consecutive to the input determined character “traffic”. Determine the display order to display.
[0119] 連結度判定部 133は、連続性テーブル 152のアドレス「npi+l」を参照し、連続度 に基づいて「規則 (21)」、「大学 (16)」、「工具 (15)」、「警察(14)」. . .の順に候補 を求める。次に、前方関連性テーブル 153のアドレス「ppi+ s」を参照して、「交通」の 前方に位置する漢字 ·単語 (ここでは、「上海」)との関連性 (関連度)に基づいて、「交 通」のあとに続く候補単語の表示順序を並べ替える。より具体的には、図 23に示す 例では、前方関連性テーブル 153によれば、候補単語の「上海」との関連度は、順に 、「大学 (10)」、「規則 (5)」、「警察 (3)」、「工具 (2)」. . .となっている。  The connectivity determination unit 133 refers to the address “npi + l” of the continuity table 152, and based on the continuity, “rule (21)”, “college (16)”, “tool (15)” , "Police (14)"... Next, referring to the address “ppi + s” of the forward relevance table 153, based on the relevance (degree of relevance) with the kanji and words (here, “Shanghai”) located ahead of “traffic” Rearrange the display order of candidate words that follow “Cross”. More specifically, in the example shown in FIG. 23, according to the forward relevance table 153, the relevance of the candidate word to "Shanghai" is, in order, "University (10)", "Rule (5)", "Police (3)" and "Tool (2)"..
[0120] そこで、連結度判定部 133は、上記候補単語の関連度を優先し、関連度の順に表 示順序を変更する。この場合、「上海交通」に続ぐ候補単語の表示順序は、「規則」 、「大学」、「工具」、「警察」. . .の順から、「大学」、「規則」、「警察」、「工具」. . .の 順に変更される。あるいは、連結度および関連度の合計に基づいて、表示順序を変 更してもよい。また、連結度および関連度の平均値、最大値に基づいて表示順序を 変更してもよい。例えば、連結度と関連度との合計を評価値として算出することを規 定した評価値算出ルールが、記録部 40に記憶されており、候補順序決定部 32は、 上記評価値算出ルールにしたがって、連結度および関連度の合計を評価値として算 出し、その評価値に基づ 、て表示順序を決定することができる。 Therefore, the degree-of-connection determining unit 133 gives priority to the degree of association of the candidate word, and changes the display order in the order of the degree of association. In this case, the display order of candidate words following “Shanghai traffic” is “University”, “Rule”, “Police” in the order of “Rule”, “University”, “Tool”, “Police”... , "Tool" ... is changed in order. Alternatively, change the display order based on the sum of connectivity and relevancy. You may change it. Also, the display order may be changed based on the average value and the maximum value of connectivity and relevance. For example, an evaluation value calculation rule that specifies that the sum of the degree of connectivity and the degree of association is calculated as an evaluation value is stored in the recording unit 40, and the candidate order determination unit 32 determines in accordance with the evaluation value calculation rule. The sum of connectivity and relevance can be calculated as the evaluation value, and the display order can be determined based on the evaluation value.
[0121] 以上のことから、ユーザの嗜好に沿った、より精度の高い表示順序でもって候補漠 字 ·単語を表示させることができるので、ユーザのキー操作回数を減らすことができる 。結果として、所望の漢字を得るためのユーザの入力操作を簡素化することが可能と なる。 From the above, it is possible to display candidate vegs and words in a display order with higher accuracy in line with the preference of the user, so it is possible to reduce the number of key operations of the user. As a result, it is possible to simplify the user's input operation to obtain the desired kanji.
[0122] (表示順序決定手順)  (Display Order Determination Procedure)
本実施形態では、上述の各候補順序決定要素を組み合わせて、抽出された候補 の表示順序を決定する。以下にその一例について説明するが、本発明は、以下で説 明する組み合わせに限定されず、上述の各要素を適宜組み合わせることより表示順 序を決定することが可能である。  In this embodiment, the above-mentioned candidate order determining elements are combined to determine the display order of the extracted candidates. Although an example will be described below, the present invention is not limited to the combination described below, and the display order can be determined by appropriately combining the above-described respective elements.
[0123] 図 24は、候補順序決定部 32が表示順序を決定するときの、候補の分類手順を示 す図である。候補順序決定部 32は、まず、所定の条件(上記要素 A〜Gおよびその 組み合わせ)によって、表示順序ごとにいくつかのグループに候補を分類し、そのグ ループに内において、最終的な表示順序を決定する。  FIG. 24 is a diagram showing a classification procedure of candidates when the candidate order determination unit 32 determines the display order. The candidate order determination unit 32 first classifies the candidates into several groups for each display order according to a predetermined condition (the above-mentioned elements A to G and their combination), and the final display order in that group. Decide.
[0124] 候補抽出部 31によって抽出された候補単語 Gは、まず、一時記憶部 3の抽出候補 一時記憶部 51 (図 22)に格納される。格納された候補単語 Gに対し、学習漢字判定 部 131 (図 22)は、学習単語であるカゝ否かの判定を行う(S301)。また、ピンイン長判 定部 132は、候補単語 Gが抽出された時点で入力されているピンイン長(アルファべ ットの文字数)と、候補単語 Gの各ピンイン長との比較を行う(S302)。  The candidate word G extracted by the candidate extraction unit 31 is first stored in the extraction candidate temporary storage unit 51 (FIG. 22) of the temporary storage unit 3. For the stored candidate word G, the learning kanji judgment unit 131 (FIG. 22) judges whether or not the learning word is Ka (step S301). Also, the pinyin length determination unit 132 compares the pinyin length (number of characters of the alphabet) inputted when the candidate word G is extracted with each pinyin length of the candidate word G (S302). .
[0125] ここで、候補単語 Gのうち、学習単語であり、かつ、ピンイン長も一致すると判定され た候補単語は、表示順序が最上位のグループ Aに分類される。学習単語であるが、 ピンイン長は一致しな 、と判定された候補単語は、グループ Aに次 、で上位のダル ープ Bに分類される。学習単語ではないが、ピンイン長は一致すると判定された候補 単語は、グループ B下位のグループ Cに分類される。学習単語でもなぐピンイン長も 一致しな ヽと判定された候補単語は、最下位のグループ Dに分類される。 Here, among candidate words G, candidate words that are learning words and that are determined to have a matching pinyin length are classified into group A in the highest display order. Candidate words that are determined to be learning words but do not match the pinyin length are classified into group A in the upper layer dullup B next. Candidate words that are not learning words but whose pinyin length is determined to match are classified into group C under group B. Even Pinyin long as learning word The candidate words judged to be unmatched ヽ are classified into the lowest group D.
[0126] グループ A、 Bに分類された、学習単語である候補単語に対し、学習漢字判定部 1 31は、さら〖こ、単独学習単語であるかペア学習単語であるかの判定を行う(S303)。 また、連結度判定部 133は、直前文字列との連続性の有無についての判定を行う(S 304)。 For candidate words classified as learning words classified into groups A and B, learning kanji judgment unit 131 further judges whether it is a single learning word or a pair learning word ( S303). Further, the connectivity degree determination unit 133 determines whether or not there is continuity with the immediately preceding character string (S 304).
[0127] ここで、グループ Aの候補単語のうち、ペア学習単語であって、直前文字列との連 続性ありと判定された候補単語は、最上位のグループ A1に分類される。ペア学習単 語であるが、直前文字列との連続性なしと判定された候補単語は、上位のグループ A2に分類される。ペア学習単語でないが、直前文字列との連続性ありと判定された 候補単語は、下位のグループ A3に分類される。ペア学習単語でなぐ直前文字列と の連続性もないと判定された候補単語は、最下位のグループ A4に分類される。ダル ープ Bの候補単語にっ 、ても同様に分類される。  Here, among the candidate words of group A, candidate words that are paired learning words and determined to have continuity with the immediately preceding character string are classified into the topmost group A1. Candidate words that are judged as pair learning words but have no continuity with the immediately preceding character string are classified into the upper group A2. Candidate words that are judged not to be paired learning words but have continuity with the immediately preceding character string are classified into lower group A3. Candidate words judged not to be consecutive with the immediately preceding character string that is not a paired learning word are classified into the lowest group A4. Duppe B candidate words are classified similarly.
[0128] 一方、グループ C、 Dに分類された候補単語に対しては、連結度判定部 133が直 前文字列との連続性の有無を判定して(S305)、上位のグループ C (D) 1と下位のグ ループ C (D) 2とに分類する。  On the other hand, for candidate words classified into groups C and D, connectivity degree determining section 133 determines the presence or absence of continuity with the immediately preceding character string (S 305), and upper group C (D 1) and the lower group C (D) 2
[0129] 最終的に、全候補単語 (G)は、上位のグループ力も順に、グループ Al、グループ A2、グノレープ A3、グノレープ A4、グループ Bl、グループ B2、グループ B3、グルー プ B4、グループ Cl、グループ C2、グループ Dl、グループ D2、の小グループに分 類される。グループ A、 Bの各小グループは、さらに学習時期(要素 G)によって順序 が決定され (最近学習された漢字ほど上位)、グループ C、 Dの各小グループは、さら に出現頻度 (要素 A)によって順序が決定される(一般によく使われる単語ほど上位)  [0129] Finally, all candidate words (G) are, in order of upper group strength, group Al, group A2, groupo A3, groupe A4, group Bl, group B2, group B2, group B3, group B4, group Cl, group It is divided into small groups of C2, group Dl and group D2. The order of each small group in groups A and B is further determined by the learning time (element G) (the higher the kanji learned recently, the higher rank), and the small groups in groups C and D have their frequency of occurrence (element A). Determines the order (most commonly used words are higher)
[0130] なお、上述の実施形態では、学習された候補漢字'単語を、所定の条件で (すなわ ち、新しく学習された順に)並べ替えると説明したが、候補として抽出された候補漢字 の後に、過去に学習された候補単語として並べてもよい。あるいは、過去に学習され た単語が候補漢字より優先して表示されるように並べてもよい。また、候補漢字を含 む過去に使用された単語が複数ある場合は、文字数が少ない順に並べてもよい。あ るいは、時間的な経過として、学習された順もしくはその逆順に並べてもよい。 [0131] (変形例 3—四声の入力) In the above embodiment, it is explained that the learned candidate kanji 'words are rearranged under a predetermined condition (that is, in the order of newly learned), but the candidate kanji extracted as a candidate is You may arrange later as the candidate word learned in the past. Alternatively, the words learned in the past may be arranged so that they are displayed prior to candidate kanji. If there are multiple words used in the past, including candidate kanji, they may be arranged in ascending order of the number of characters. Alternatively, it may be arranged in the order of learning or in the reverse order as time lapse. (Modification 3—Input of Four Voices)
本実施形態では、四声の区別を無視してアルファベット (列)とピンインとを対応させ る構成について説明したが、本発明の文字変換装置 1は上記構成に限定されない。 四声の区別を行って、アルファベット (列)とピンインとを対応させる構成も可能である 。例えば、四声コード(例えば、 PCでの入力では一声から四声を 1から 4、軽声を 5で 表現している)を含めて、図 15に示す AZP対応表 141を作成すればよい。より具体 的には、図 15に示す AZP対応表 141および PZC対応表 142において、ピンイン「 shang」は、漢字「上」と関連付けられていることがわかる。ここで、「上」の四声ありの ピンインは「三声の shang」であるため、 AZP対応表 141の「cp— si」に対応する「sh ang」を「511&1183」と記述することで、四声ありのピンイン「shang3」と「上」とを関連付 けることができる。 In the present embodiment, the configuration in which the alphabet (string) and the pinyin are made to correspond to each other by ignoring the distinction of four voices has been described, but the character conversion device 1 of the present invention is not limited to the above configuration. A configuration is also possible in which four voices are distinguished and alphabets (rows) correspond to pinyins. For example, an AZP correspondence table 141 shown in FIG. 15 may be created including a four-voice code (for example, one to four voices are represented by one to four voices and five light voices for PC input). More specifically, in the AZP correspondence table 141 and the PZC correspondence table 142 shown in FIG. 15, it can be seen that pinyin "shang" is associated with the Chinese character "upper". Here, since the pin-in with the four voices of “upper” is “three-voice shang”, “sh ang” corresponding to “cp— si” in the AZP correspondence table 141 should be described as “511 & 11 8 3”. Then, it is possible to associate pinyin "shang3" with "four voices" with "on".
[0132] 〔実施形態 2〕  Second Embodiment
本発明の文字変換装置 1は、さらに、上述の候補順序決定要素に優先度の情報を 付与し、候補漢字 (単語)の表示順序を決定するときの条件 (評価値算出ルール)を、 ユーザが任意に設定できるように構成してもよい。表示順序を決定するときの条件を ユーザが任意に設定できるので、文字変換装置 1を備えたマルチファンクションプリ ンタにお 、て、ユーザの嗜好に合致した表示順序で候補を表示することが可能とな る。結果として、所望の漢字を得るための入力操作を簡素化することができる。  The character conversion device 1 of the present invention further adds priority information to the above-described candidate order determination element, and the user uses the condition (evaluation value calculation rule) for determining the display order of candidate kanji (words). It may be configured to be set arbitrarily. Since the conditions for determining the display order can be arbitrarily set by the user, it is possible to display the candidates in the display order that matches the user's preference on the multifunction printer equipped with the character conversion device 1. Become. As a result, the input operation for obtaining the desired kanji can be simplified.
[0133] (文字変換装置の構成)  (Configuration of Character Conversion Device)
本実施形態における文字変換装置 1は、上述の実施形態 1の構成に加えて、文字 入力制御部 30の内部に、候補順序決定要素設定部 (候補順序決定要素設定手段) 35を備えており、記録部 40の内部に、さらに候補順序決定要素記録部 47を含んで いる(図 12)。  The character conversion device 1 in the present embodiment includes a candidate order determination element setting unit (candidate order determination element setting unit) 35 in the character input control unit 30 in addition to the configuration of the above-described embodiment 1, The candidate order determination factor recording unit 47 is further included in the recording unit 40 (FIG. 12).
[0134] 本実施形態では、図 22に示す評価値一時記憶部 52に、候補ごとに算出した評価 値を一時的に記憶し、その評価値に基づいて表示順序を決定する。すなわち、候補 順序決定要素設定部 35によって設定された、優先度の高い候補順序決定要素ほど 、上記評価値の算出により大きな影響を与える。候補順序決定要素設定部 35が設 定した候補順序決定要素およびその優先度は、候補順序決定要素記録部 47に記 録される。 In the present embodiment, evaluation values calculated for each candidate are temporarily stored in the evaluation value temporary storage unit 52 shown in FIG. 22, and the display order is determined based on the evaluation values. That is, the higher the priority order candidate determination element set by the candidate order determination element setting unit 35, the larger the influence on the calculation of the evaluation value. The candidate order determining element set by the candidate order determining element setting unit 35 and the priority thereof are recorded in the candidate order determining element recording unit 47. Will be recorded.
[0135] まず、評価値一時記憶部 52に記憶される候補ごとの評価値のデータ構造について 説明する。  First, the data structure of the evaluation value for each candidate stored in the evaluation value temporary storage unit 52 will be described.
[0136] 図 25は、候補ごとに算出した表示順序の評価値を記憶するデータ構造の一例を示 す図である。本実施形態では、候補ごとの評価値を 16ビットで表現する。すなわち、 図 25に示す 16の領域に、ある候補漢字 (単語)の特性 (要素 A〜G)を示す「0」また は「1」の値を記憶する。本実施形態では、ビット位置は、評価値の高低に関連があり 、上位ビットになるにつれて、評価値の上位桁を表すようになっている。よって、優先 度の高い候補順序決定要素に関する判定結果は、上位ビットに反映させる。すなわ ち、評価値算出ルールは、候補漢字 (単語)の候補順序決定要素要素から「0」また は「1」の値をどのように求めるかを規定するとともに、そのように求まった値を、評価 値のどのビット位置に反映させるかを規定している。  FIG. 25 is a diagram showing an example of a data structure for storing evaluation values of display order calculated for each candidate. In this embodiment, the evaluation value for each candidate is expressed in 16 bits. That is, values of “0” or “1” indicating characteristics (elements A to G) of a candidate kanji (word) are stored in 16 areas shown in FIG. In the present embodiment, the bit position is associated with the high and low of the evaluation value, and as it becomes the upper bit, it represents the high-order digit of the evaluation value. Therefore, the judgment result on the candidate order determination element with high priority is reflected in the upper bits. That is, the evaluation value calculation rule specifies how to calculate the value of “0” or “1” from the candidate order determining element element of the candidate kanji (word), and also determines the value thus calculated. It defines which bit position of the evaluation value is to be reflected.
[0137] このようなデータ構造によれば、文字変換装置 1が候補ごとの評価値を算出すると きの、評価値に与える要素の重要度をビット位置によって調節することが可能となる。 つまり、重要な要素ほど上位位置に割り当てているので、ビットの ON「1」ZOFF「0」 操作を行うことのみで、候補の表示順序の評価値を容易に適切に数値化することが できる。また、各要素に対して個別に変数を割り当てると、表示順序を決定するため に確保すべき評価値のデータサイズが大きくなる。よって、各要素による判定結果を 1ビットに割り当てていることにより、データサイズの削減を実現することができる。  According to such a data structure, it is possible to adjust the degree of importance of the element given to the evaluation value by the bit position when the character conversion device 1 calculates the evaluation value for each candidate. That is, since important elements are assigned to higher positions, the evaluation value of the display order of candidates can be easily quantified appropriately by only performing the bit ON "1" ZOFF "0" operation. Also, if variables are individually assigned to each element, the data size of the evaluation value to be secured to determine the display order will be large. Therefore, the data size can be reduced by allocating the determination result of each element to one bit.
[0138] 候補順序決定要素記録部 47は、候補の表示順序を決定するための候補順序決定 要素の優先度情報を記録するためのものである。図 26の(a)および図 26の(b)は、 候補順序決定要素記録部 47に記録される優先度情報の一例を示す図である。  The candidate order determination element recording section 47 is for recording priority information of candidate order determination elements for determining the display order of candidates. (A) of FIG. 26 and (b) of FIG. 26 are diagrams showing an example of priority information recorded in the candidate order determining element recording section 47.
[0139] 優先度情報は、候補順序決定要素ごとに、その候補順序決定要素に関する判定 結果を、評価値のデータ構造のどのビット位置に反映させるかを示すビット位置情報 を含んでいる。図 26の(a)に示す例では、要素 Bの判定結果を図 25に示すビット位 置 12の領域に格納するよう設定されている。すなわち、ピンイン長が一致した場合は 、ビット位置 12の領域に「1」が、一致しなければ「0」が評価値として記憶される。高い ビット位置を割り当てられている候補順序決定要素ほど、優先度が高ぐ候補の評価 値を定めるのに与える影響が大きい。 The priority information includes, for each candidate order determination element, bit position information indicating which bit position of the data structure of the evaluation value the determination result on the candidate order determination element is to be reflected. In the example shown in (a) of FIG. 26, the determination result of the element B is set to be stored in the area of the bit position 12 shown in FIG. That is, when the pin-in length matches, "1" is stored in the area of bit position 12; otherwise, "0" is stored as the evaluation value. Evaluation of candidate with higher priority for candidate order determinant assigned higher bit position The impact on determining the value is large.
[0140] なお、本実施形態では、評価値を 16ビットで表現し、ビット位置 0〜 12の領域を用 いて評価値を算出しているが、本発明はこの例に限定されない。候補順序決定要素 数を減らしたり、新たな要素の追加または各要素の評価値としてのレンジ幅の変更を したりする場合は、それらが十分に納まるビット長にすることも可能である。例えば、図 26の(a)および図 26の(b)に示す例において、出現頻度を 8ビットで表現しているが 、出現頻度の差を明確にするために 16ビット確保する、あるいは、出現頻度の差を重 視しな 、ために 4ビットに縮小することなどが可能である。また、空きのビット位置 13〜 15を用いてもよい。  In the present embodiment, the evaluation value is expressed in 16 bits and the evaluation value is calculated using the area of bit positions 0 to 12, but the present invention is not limited to this example. When reducing the number of candidate order determining elements or adding new elements or changing the range width as the evaluation value of each element, it is also possible to make the bit length sufficient to accommodate them. For example, in the example shown in (a) of FIG. 26 and (b) of FIG. 26, although the appearance frequency is expressed by 8 bits, 16 bits are secured to clarify the difference in the appearance frequency or It is possible to reduce to 4 bits, etc., in order not to emphasize the difference in frequency. Also, empty bit positions 13 to 15 may be used.
[0141] 候補順序決定要素設定部 35は、入力部 11 (図 12)を介してユーザが指定する候 補順序決定要素の優先度を設定し、優先度情報を候補順序決定要素記録部 47に 格納するものである。  The candidate order determination element setting unit 35 sets the priority of the candidate order determination element specified by the user via the input unit 11 (FIG. 12), and the priority information is stored in the candidate order determination element recording unit 47. It is something to store.
[0142] より具体的には、例えば、ユーザが候補順序決定要素の優先度設定メニューを、操 作パネル 2を用いて文字変換装置 1に指示すると、操作パネル 2の表示部 12は、候 補順序決定要素の優先順位の指定を促す操作画面を表示する。例えば、「(各要素 (A〜G)を)優先したい順に指示してください。」などのメッセージとともに、優先度を 設定できる候補順序決定要素一覧を表示すればよい。ユーザが、優先したい順に表 示されている各要素を指示すると、決定する力否かを問うメッセージの表示を行う。例 えば、「変更を決定する場合は、決定キーを押してください。」などのメッセージを表 示すればよい。  More specifically, for example, when the user instructs the character conversion device 1 using the operation panel 2 to set the priority setting menu of candidate order determination elements, the display unit 12 of the operation panel 2 displays the candidate Display an operation screen prompting to specify the priority of the order determining element. For example, along with a message such as “Indicate in the order you want each element (A to G) to be prioritized”, a list of candidate order determining elements for which priority can be set may be displayed. When the user designates each element displayed in the order desired to be prioritized, a message asking whether to determine the power is displayed. For example, you may display a message such as "Please press the Enter key to make a change."
[0143] ユーザが操作パネル 2の決定キーを押すと、候補順序決定要素設定部 35は、入力 された指示信号に応じて、低いビット位置から、ユーザが指定した優先度が低い候補 順序決定要素の領域として順に割り当て、各候補順序決定要素の判定結果を格納 するビット位置の情報を変更する。  When the user presses the determination key of operation panel 2, candidate order determination element setting unit 35 determines a candidate order determination element having a low priority designated by the user from the low bit position according to the input instruction signal. Allocating in order as the area of, change the information of the bit position storing the judgment result of each candidate order decision element.
[0144] 例えば、ユーザが、図 26の(a)に示した例の状態から、要素 Fのペア学習漢字 Z単 語を最も優先度の高い候補順序決定要素として、順に、要素 E、要素 F、要素 B、要 素 C、要素 D、要素 AZGを指定したとすると、優先度情報は、図 26の (b)に示すとお りに変更される。図 26の (b)に示す例では、要素 Fの判定結果力 ビット位置 12の領 域に格納されることになる。 For example, from the state of the example shown in (a) of FIG. 26, the user determines element E, element F in order with the pair learning kanji Z word of element F as the candidate order determining element with the highest priority. If element B, element C, element D, and element AZG are specified, the priority information is changed as shown in (b) of FIG. In the example shown in (b) of FIG. 26, the judgment result of element F is the range of bit position 12 Will be stored in the area.
[0145] 上記構成によれば、候補順序決定部 32の各部(図 22)は、候補順序決定要素記 録部 47が指定する候補順序決定要素の優先度情報に基づ 、て、判定結果に係る 評価値を指定された評価値一時記憶部 52のビット位置に格納する。  According to the above configuration, each part (FIG. 22) of candidate order determination unit 32 determines the determination result based on the priority information of the candidate order determination elements specified by candidate order determination element storage unit 47. The evaluation value is stored in the bit position of the designated evaluation value temporary storage unit 52.
[0146] これにより、表示順序を決定するための評価値を、ユーザが優先させたい候補順序 決定要素をもとに算出することが可能となるので、よりユーザの嗜好に合致した表示 順序で候補を表示させることが可能となる。  Thus, the evaluation value for determining the display order can be calculated based on the candidate order determination factor that the user wants to prioritize, so the candidates can be displayed in a display order that more closely matches the user's preference. Can be displayed.
[0147] なお、この候補順序決定要素の優先度情報は、候補漢字と候補単語とで別に設定 できる構成としてもよいし、候補漢字'候補単語の両方の表示順序に同様に適用する 1つの優先度情報を設定できる構成としてもよい。候補漢字と候補単語とで別に設定 できる構成とした場合、優先度の変更操作を、 1回の優先度設定メニュー呼び出し操 作によって、候補漢字、候補単語の順に候補順序決定要素の優先度変更操作を行 えるようにしてもよいし、候補漢字用、候補単語用それぞれの優先度設定メニュー呼 び出し操作および優先度変更操作を行えるようにしてもよ!ヽ。  The priority information of the candidate order determining element may be set separately for the candidate kanji and the candidate word, or may be similarly applied to the display order of both candidate kanji 'candidate words. The configuration may be such that the degree information can be set. If the configuration is such that the candidate kanji and the candidate word can be set separately, the priority change operation can be changed by changing the priority of the candidate order determination element in order of the candidate kanji and the candidate word by one priority setting menu call operation. You may be able to select the priority setting menu for candidate kanji and candidate words, and change the priority.
[0148] また、設定終了キーを設け、設定終了キーを押した時点までの内容に関して、所望 の優先度情報の変更を行うようにしてもよい。例えば、候補順序決定要素の優先度 の変更途中で設定終了キーを押した場合には、優先度の変更内容を反映しないよう にし、候補順序決定要素の変更だけを反映さえるなどすることが可能である。  In addition, a setting end key may be provided, and desired priority information may be changed with respect to the contents up to when the setting end key is pressed. For example, when the setting end key is pressed in the middle of changing the priority of the candidate order determination element, it is possible not to reflect the content of the change in priority but to reflect only the change of the candidate order determination element. is there.
[0149] 次に、図 22に示す候補順序決定部 32の各部の判定結果に基づいて、図 25に示 す評価値を算出する方法についてより詳細に説明する。なお、以下で説明する例で は、優先度は、図 26の(a)に示すとおりに設定されているものとする。  Next, the method of calculating the evaluation value shown in FIG. 25 will be described in more detail based on the determination results of each part of the candidate order determination unit 32 shown in FIG. In the example described below, the priority is set as shown in (a) of FIG.
[0150] まず、候補抽出部 31が抽出した候補は、一旦抽出候補一時記憶部 51に格納され る。評価値一時記憶部 52において、格納された候補ごとに、その評価値を表す 16ビ ットの領域(図 25)が確保される。  First, the candidate extracted by the candidate extraction unit 31 is temporarily stored in the extraction candidate temporary storage unit 51. In the evaluation value temporary storage unit 52, for each candidate stored, a 16-bit area (FIG. 25) representing the evaluation value is secured.
[0151] 次に、候補が学習漢字 (単語)である場合には、学習時期判定部 135が、そうでな い場合には、出現頻度判定部 134が、学習時期または出現頻度をそれぞれ算出し、 評価値算出ルールに基づいて、算出結果をビット位置 0〜7 (図 25)に格納する。  Next, when the candidate is a learned kanji (word), the learning time determination unit 135 calculates the learning time or the appearance frequency, respectively, otherwise the appearance frequency determination unit 134 calculates the learning time or the appearance frequency. , Store the calculation results in bit positions 0 to 7 (Fig. 25) based on the evaluation value calculation rule.
[0152] 出現頻度は、あら力じめ大量の新聞や雑誌などの様々な文書文字データについて 統計的に文字および単語ごとに求めた使用頻度であり、頻度が大きいほど一般的に よく使われる文字 '単語であるとする。本実施形態では、出現頻度の領域を 8ビット確 保しているので、出現頻度に基づいて、評価値算出ルールにしたカ^、、候補の評価 値を 0〜255の間で定めることができる。 The appearance frequency is about various document character data such as a large amount of newspapers and magazines. It is the frequency of use statistically determined for each letter and word, and the larger the frequency, the more commonly used letter 'word' is assumed. In this embodiment, since the area of appearance frequency is secured at 8 bits, it is possible to set the evaluation value of the candidate between 0 and 255 based on the appearance frequency. .
[0153] 学習時期についても同様に、最近学習されたものほど評価値が高くなるよう 0〜25 5の間で定めることができる。学習時期とは、候補漢字 (単語)がいつ学習されたかを 表す。評価値算出ルールについて具体的に説明すれば以下のとおりである。例えば 、学習時期を X、学習できる最大個数を Nとし、学習できる漢字,単語の学習した時点 を n、現時点の学習できる時点を mとすると任意の漢字 ·単語の学習時期 Xは、以下 の式で求まる。 Similarly, the learning time can be determined between 0 and 255 so that the more recently learned one has a higher evaluation value. The learning time indicates when the candidate kanji (word) was learned. It will be as follows if an evaluation value calculation rule is explained concretely. For example, assuming that the learning time is X, the maximum number of learning is N, the learning kanji and the word learning time are n, and the current learning time is m, the learning time for any kanji and word is X It is determined by
m— n≥0のとき  m-when n 0 0
X=N+m— n— 1…(式 1 1)  X = N + m-n-1 ... (Equation 1 1)
m— n< 0のとき  When m-n <0
X=n—m—l…(式 1— 2)  X = n-m-l ... (Equation 1-2)
ここで、学習時期 Xにおいて、 Xの値が小さいほど過去に学習されたものとする。  Here, it is assumed that the learning time X has been learned in the past as the value of X is smaller.
[0154] なお、上記式の Xの値域が、学習時期の評価値として確保されているビット長の範 囲 (本実施形態では 8ビット)を越える場合は、 Xの値をビット長の範囲への数値変換 や、ビット長の変更を行えばよい。 If the value range of X in the above equation exceeds the bit length range (8 bits in this embodiment) secured as the evaluation value of the learning time, the value of X is set to the bit length range. You can do numerical conversion of or change the bit length.
[0155] ビット位置 8〜12には、それぞれ、要素 D、 C、 F、 E、 Bについての判定結果が格納 される。学習漢字判定部 131、ピンイン長判定部 132、および、連結度判定部 133はBit positions 8 to 12 store the determination results for elements D, C, F, E, and B, respectively. The learning kanji determination unit 131, the pinyin length determination unit 132, and the connection degree determination unit 133
、候補の特性が条件を満足していると判定した場合には「1」、そうでない場合には「0, “1” if it is determined that the characteristic of the candidate satisfies the condition, “0” otherwise.
」を指定されたビット位置にセットする。例えば、連結度判定部 133は、候補がすでに 入力されている前方文字列と関連性ありと判定した場合には、ビット位置 8に「1」をセ ッ卜する。 Is set to the designated bit position. For example, if it is determined that the candidate is related to the forward character string to which the candidate has already been input, “1” is set in bit position 8.
[0156] すべての候補について表示順序の評価値を算出すると、候補順序決定部 32は、 候補を評価値一時記憶部 52に記憶される評価値の大き 、順に並べ替え、最終的な 表示順序を表示順序一時記憶部 53に格納する。最後に、表示制御部 22 (図 12)は 、表示順序一時記憶部 53に記憶されている上記表示順序に基づいて、候補を表示 部 12に表示する。 When the evaluation value of the display order is calculated for all the candidates, the candidate order determination unit 32 rearranges the candidates in order of the evaluation value magnitude stored in the evaluation value temporary storage unit 52, and displays the final display order Store in display order temporary storage unit 53. Finally, the display control unit 22 (FIG. 12) displays the candidates based on the display order stored in the display order temporary storage unit 53. Display on section 12.
[0157] 〔実施形態 3〕 Third Embodiment
上述の各実施形態では、ピンインを構成するアルファベットキーの入力による中国 語入力の説明を行った。しかし、本発明の文字変換装置 1はこれに限定されない。操 作パネル 2の入力部 11 (図 12)が、手書き文字の入力機能を備え、入出力制御部 20 が手書き文字を認識することにより、中国語を入力する文字変換装置 1を構成するこ とも可能である。  In each of the above-described embodiments, the Chinese language input by the input of the alphabet key constituting the pin-in has been described. However, the character conversion device 1 of the present invention is not limited to this. The input unit 11 (FIG. 12) of the operation panel 2 also has a function of inputting handwritten characters, and the input / output control unit 20 recognizes the handwritten characters to constitute the character conversion device 1 for inputting Chinese characters. It is possible.
[0158] また、手書き文字を入力するための入力部 (筆記文字入力部) 11は、例えば、表示 一体型タブレットとしての操作パネル 2の表示領域に手書き文字入力のための枠を 表示すること〖こより実現することができる。  Further, the input unit (written character input unit) 11 for inputting handwritten characters may display a frame for inputting handwritten characters on the display area of the operation panel 2 as a display integrated type, for example. It can be realized from this.
[0159] 文字変換装置 1は、入出力制御部 20内部に、上記文字入力枠にユーザが筆記し た文字の筆記情報 (座標データ)を認識するための文字認識部 (文字認識手段)(図 示せず)を備える。また、記録部 40に、上記文字認識部で認識された入力文字バタ ーンとマッチングするために用いる認識辞書(図示せず)をさらに含む。そして、表示 制御部 22が、入力文字の認識結果を出力するように構成されて!、る。  The character conversion device 1 is a character recognition unit (character recognition means) for recognizing writing information (coordinate data) of characters written by the user in the character input box in the input / output control unit 20. Not shown). The recording unit 40 further includes a recognition dictionary (not shown) used to match the input character pattern recognized by the character recognition unit. Then, the display control unit 22 is configured to output the recognition result of the input character! .
[0160] ユーザは、操作パネル 2の文字入力枠に所望の文字を筆記すると、操作パネル 2 は筆記された文字の座標情報 60 (図 27)を検出し、入出力制御部 20の内部バッファ に記憶する。文字認識部は、内部バッファに記憶している座標情報と認識辞書の辞 書パターンとマッチング(例えば、 DPマッチング)を行い、入力パターンと辞書パター ンの類似度 (例えば距離)を求め、辞書パターンに対応する文字コードと類似度を関 連付けて内部バッファに記憶する。文字認識部は、入力パターンと認識辞書の各パ ターンとのマッチングを行うと、内部バッファの認識結果を類似度の高い順に並べ換 える。そして、認識結果を候補漢字 (単語)として候補抽出部 31に供給する。  When the user writes a desired character in the character input frame of operation panel 2, operation panel 2 detects the coordinate information 60 (FIG. 27) of the written character, and the input / output control unit 20 internal buffer Remember. The character recognition unit matches the coordinate information stored in the internal buffer with the dictionary pattern of the recognition dictionary (for example, DP matching), and obtains the similarity (for example, distance) between the input pattern and the dictionary pattern. The character code corresponding to and the similarity are associated and stored in the internal buffer. When the character recognition unit matches the input pattern with each pattern of the recognition dictionary, the character recognition unit rearranges the recognition results of the internal buffer in descending order of similarity. Then, the recognition result is supplied to the candidate extraction unit 31 as a candidate kanji (word).
[0161] 以下、手書き文字認識機能を備えた文字変換装置 1の動作を具体的に説明する。  The operation of the character conversion device 1 having the handwritten character recognition function will be specifically described below.
[0162] ユーザが、表示一体型タブレットの文字入力枠に「外字 2」を筆記すると(図 28の(a ) )、操作パネル 2は「外字 2」の座標データを検出し、内部バッファに記憶する(図 27 )。文字認識部は、内部バッファの「外字 2」の座標データと認識辞書の各辞書パター ンと DPマッチングを行い、距離を計算し、距離の小さい順に並べ替える(図 29)。入 出力制御部 20は、認識結果を表示制御部 22に転送する。 When the user writes “extra character 2” in the character input frame of the display integrated tablet ((a) in FIG. 28), operation panel 2 detects the coordinate data of “extra character 2” and stores it in the internal buffer. (Fig. 27). The character recognition unit performs DP matching between the coordinate data of “external character 2” in the internal buffer and each dictionary pattern of the recognition dictionary, calculates distances, and rearranges them in ascending order of distance (FIG. 29). On The output control unit 20 transfers the recognition result to the display control unit 22.
[0163] 表示制御部 22は、認識結果に基づ!/、て、認識文字候補となる n個の文字 (ここでは 、漢字)の表示位置を求め、表示位置の情報および認識文字候補 n個を操作パネル 2の表示部 12に表示する(図 28の(b) )。操作パネル 2は、表示された認識文字候補 と表示位置を関連付けて内部バッファに記憶する。  Display control unit 22 determines the display position of n characters (here, kanji) as recognition character candidates based on the recognition result, and displays information on display positions and n recognition character candidates. Is displayed on the display unit 12 of the operation panel 2 ((b) in FIG. 28). Operation panel 2 associates the displayed recognized character candidate with the display position and stores it in the internal buffer.
[0164] ユーザは、「外字 2」が表示されて ヽることを確認すると、操作パネル 2上の「外字 2」 を指定する。操作パネル 2は指示された位置を検出し、入出力制御部 20の候補選択 受付部 23に位置情報を転送する。そして、候補選択受付部 23で「外字 2」が指示さ れたことを検出し、「外字 2」力 始まる単語 (「「外字 2」天」 · · ·)が表示される(図 28の (c) )。このときの動作は、すでに説明しているので、ここでは繰り返さない。  When the user confirms that “External character 2” is displayed and specifies, “External character 2” on operation panel 2 is designated. The operation panel 2 detects the instructed position, and transfers the position information to the candidate selection receiving unit 23 of the input / output control unit 20. Then, the candidate selection reception unit 23 detects that “extra character 2” is instructed, and a word beginning with “foreign character 2” (““ foreign character 2 ”heaven” ···) is displayed (see FIG. c)). Since the operation at this time has already been described, it will not be repeated here.
[0165] これにより、ユーザが所望の漢字に対応するピンインを把握していない場合でも、 所望の漢字を入力することが可能となり、所望の漢字を得るための入力操作を簡素 ィ匕することがでさる。  [0165] Thereby, even when the user does not know the pinyin corresponding to the desired kanji, it becomes possible to input the desired kanji, and the input operation for obtaining the desired kanji can be simplified. You
[0166] なお、手書き文字を認識するための、文字変換装置 1の上記文字認識部の機能に ついて、マルチファンクションプリンタを例に挙げて説明した力 これに限定されない 。表示一体型のタツチパネルを備えた機器であればいずれも、上記文字認識部を適 用することが可能である。したがって、筆記文字を入力することにより所望の漢字 '単 語を入力することが可能な本発明の文字変換装置を、上記マルチファンクションプリ ンタ以外の機器にぉ 、ても実現することが可能である。  Note that the function of the character recognition unit of the character conversion device 1 for recognizing handwritten characters is not limited to this, with the multifunction printer taken as an example. The above-described character recognition unit can be applied to any device provided with a touch panel integrated with a display. Therefore, it is possible to realize the character conversion device of the present invention capable of inputting a desired kanji 'word by inputting a written character, on any device other than the above-mentioned multifunction printer. .
[0167] 〔実施形態 4〕  Embodiment 4
(繁体字の入力)  (Traditional character input)
上述の実施形態では、ピンイン表記から中国語 (簡体字)を入力可能な文字変換 装置について説明した。し力しながら、本発明の文字変換装置 1は上記構成に限定 されない。  In the above-described embodiment, the character conversion device capable of inputting Chinese (Simplified) from Pinyin notation has been described. However, the character conversion device 1 of the present invention is not limited to the above configuration.
[0168] 例えば、本発明の文字変換装置 1を、主に台湾で使用される繁体字を入力するた めの文字変換装置として実現することも可能である。  For example, it is possible to realize the character conversion device 1 of the present invention as a character conversion device for inputting the traditional characters mainly used in Taiwan.
[0169] 本実施形態では、以下、表音文字としての注音文字 (注音の字母)の入力を受け付 けて、表意文字としての繁体字を、候補漢字 (単語)としてユーザに提示する文字変 換装置 1について説明する。 In the present embodiment, hereinafter, the input of a vowel character (jutsu character's letter) as a phonetic character is accepted, and a character variation is presented to the user as a candidate kanji character (word) as a traditional kanji character as an ideographic character. The replacement device 1 will be described.
[0170] 図 30は、本実施形態に係る文字変換装置 1の要部構成を示すブロック図である。  FIG. 30 is a block diagram showing the main configuration of the character conversion device 1 according to the present embodiment.
なお、図 30の各構成要素に付された符号は、図 12·図 22の各構成要素に付された 符号に対応しており、同じ符号は、同じ構成要素を示している。したがって、上述の 各実施形態ですでに説明した構成要素についての説明は繰り返さない。  The reference numerals attached to the respective components in FIG. 30 correspond to the reference numerals attached to the respective components in FIG. 12 and FIG. 22, and the same reference numerals indicate the same components. Therefore, the description of the components already described in the above embodiments will not be repeated.
[0171] 図 30に示す文字変換装置 1において、図 12·図 22と異なる点は、入力部 11が、注 音文字を入力するための、各注音文字に対応付けられたキーを備えている点と、入 力受付部 21が、上記入力された注音文字を受け付けて候補抽出部 31に供給する ための注音文字入力受付部 24を備えている点と、漢字辞書部 41において、注音文 字 注音、および、注音 繁体字の対応関係を記録した対応表が含まれている点で ある。  The character conversion device 1 shown in FIG. 30 differs from those in FIG. 12 and FIG. 22 in that the input unit 11 is provided with a key for inputting a note character, which is associated with each note character. And the kanji dictionary unit 41 includes a phononic character input acceptance unit 24 for accepting the input phononic character and supplying it to the candidate extraction unit 31. It is a point that contains a correspondence table that records the correspondences between the ring tones and the traditional characters of the ring tones.
[0172] 図 31は、本実施形態における、繁体字入力のための注音文字入力処理時の操作 画面の一例を示す図である。本実施形態における表示一体型の操作パネル 2では、 アルファベットを表示するほかに、注音文字を表示してもよい(文字入力領域 121)。 繁体字の入力では、アルファベットではなく注音文字を入力することが一般的である 。図 31に示すとおり、キーボードのキーに注音文字が割り当てられており、これにより 、繁体字に対応した注音を入力することができる。  FIG. 31 is a diagram showing an example of the operation screen at the time of the phonetic character input process for traditional character input in the present embodiment. The display integrated type operation panel 2 in the present embodiment may display the vulgar character in addition to displaying the alphabet (character input area 121). It is common to enter vowels instead of the alphabet for traditional Chinese input. As shown in FIG. 31, the phonetic characters are assigned to the keys of the keyboard, so that it is possible to input a vocal sound corresponding to the traditional characters.
[0173] 図 32は、本実施形態における、漢字辞書部 41 (図 30)に記録される、漢字辞書( 繁体字)の例を示す図である。  FIG. 32 is a diagram showing an example of a kanji dictionary (in Japanese) recorded in the kanji dictionary unit 41 (FIG. 30) according to the present embodiment.
[0174] 図 33は、本実施形態における、単語辞書部 42に記録される、単語辞書の例を示 す図である。  FIG. 33 is a diagram showing an example of a word dictionary recorded in the word dictionary unit 42 in the present embodiment.
[0175] 図 34は、本実施形態における、連結度記録部 46に記録される、連結度テーブル 1 50aの例を示す図である。  FIG. 34 is a diagram showing an example of the connectivity degree table 150a recorded in the connectivity degree recording unit 46 in the present embodiment.
[0176] 上記構成によれば、中国語 (簡体字)を入力する場合と同様に、所望の表意文字( 台湾で主に用いられる繁体字)を得るための、表音文字 (注音の字母)を入力する操 作を簡素化することが可能な文字変換装置を実現することが可能となる。  [0176] According to the above configuration, as in the case of inputting Chinese (Simplified), it is necessary to use phonetic characters (jingee's alphabet) for obtaining desired ideographic characters (the traditional characters mainly used in Taiwan). It becomes possible to realize a character conversion device that can simplify the input operation.
[0177] 〔学習機能について〕  [0177] [About the learning function]
(学習漢字'単語における表示順序決定方法) 学習漢字 ·単語の表示順序は、例えば以下のような計算によって、学習された順序(How to determine display order for learning kanji 'words') Learning kanji · Display order of words is, for example, the order learned by the following calculation
(学習順位 Yと称する)を算出することにより、決定することもできる。学習順位 Yの算 出は、図 22に示す学習時期判定部 135が実行すればょ 、。 It can also be determined by calculating (Learning order Y). The calculation of the learning order Y may be performed by the learning time determination unit 135 shown in FIG.
Y=学習番号一学習カウンタ 1 · · · (式 2)  Y = learning number 1 learning counter 1 · · · · (Expression 2)
学習番号は、新たに学習される漢字に割り当てるべきユニークな番号を表しており 、この番号が、学習カウンタとして学習漢字とともに記録されることにより、その漢字が いつ学習されたかを知ることができるものである。図 21の学習辞書記録部 43 (または 、ペア学習辞書記録部 44)には、 500個の学習漢字 (単語)を記録することが可能で あるとすると、学習番号は、 0〜499までを用意すればよい。  The learning number represents a unique number to be assigned to the newly learned kanji, and this number can be recorded together with the learning kanji as a learning counter to know when the kanji has been learned. It is. Assuming that it is possible to record 500 learning kanji (words) in the learning dictionary recording unit 43 (or the pair learning dictionary recording unit 44) of FIG. 21, the learning numbers are prepared from 0 to 499. do it.
[0178] 上記式 2の計算により、 Υがマイナスになる場合は、学習番号の上限値 (ここでは、 5 00程度)を加算する。 When 計算 becomes negative by the calculation of the above equation 2, the upper limit value of the learning number (here, about 500) is added.
[0179] さらに、学習時期判定部 135は、候補の並び替えの手間を省くために、候補ごとに 求まった学習順位 Υをまるめる処理を行ってもよい。例えば、以下の手順にしたがつ て、学習順位をまるめることができる。  Furthermore, the learning timing determination unit 135 may perform processing of rounding the learning order 求 obtained for each candidate, in order to save time and effort of rearranging the candidates. For example, the learning order can be rounded according to the following procedure.
(ィ) 0≤Υ≤9のとき、 Υ Υ (そのまま)  (I) When 0 ≤Υ≤ 9, Υ そ の ま ま (As is)
(口) 10≤Υ≤29のとき、 Υ (Υ— 10) Ζ2+ 10  (口) When 10≤Υ≤29, Υ (Υ-10) Ζ 2 + 10
(ハ) 30≤Υ≤109のとき、 Υ (Υ— 30) Ζ8 + 20  (C) When 30 ≤Υ≤ 109, Υ (Υ 30) Ζ 8 + 20
(二) Υ≥110のとき、 Υ^ (Υ— 110) Ζ32 + 30、さらに、 Υ> 39のとき、 Υ 39 以上のように、上記式 2を用いれば、学習漢字 (単語)を学習された順、あるいは、 学習された時期が新しい順に効率よく並べ替えることができるので、学習時期に応じ て候補としての表示順序を決定することが可能となる。  (2) If Υ 110, Υ ^ (Υ-110) Ζ 32 + 30, and Υ> 39, Υ 39 or more, if using the above equation 2, learning kanji (words) can be learned Since the learning order can be efficiently rearranged in the order of newness, it is possible to determine the display order as a candidate according to the learning time.
[0180] (学習カウンタ付与方法) (Method for attaching learning counter)
次に、図 21に示す学習制御部 34が制御する学習番号 (初期値 =0)および学習力 ゥンタ (初期値 =0)について、より具体的に説明する。学習辞書記録部 43に 500個 の学習漢字を記録できるものとし、学習辞書記録部 43における学習漢字の記録位 置(アドレス)の初期値を ID = 0とする。  Next, the learning number (initial value = 0) and the learning ability (initial value = 0) controlled by the learning control unit 34 shown in FIG. 21 will be described more specifically. It is assumed that 500 learning kanji can be recorded in the learning dictionary recording unit 43, and the initial value of the recording position (address) of the learning kanji in the learning dictionary recording unit 43 is ID = 0.
[0181] 学習番号 =0のときに漢字「学」を学習すると、学習制御部 34は、学習辞書記録部When learning the kanji character “student” when the learning number = 0, the learning control unit 34
43に学習されている学習漢字のうち、現在の学習番号に等しい学習カウンタを持つ 学習漢字を検索する。検索された学習漢字のアドレス IDの漢字を削除し、新規の学 習漢字をセットする。現在の学習番号に等 、学習カウンタを持つ学習漢字が無 、 場合は、 IDの位置に学習漢字と学習カウンタを共に記録する。そして、学習番号に 1 を加算する。ここでは、最初の学習であるため、アドレス「ID = 0」の位置に、学習漠 字「学」を学習カウンタ =0とともに記録し、学習番号に 1加算して、学習番号 = 1とす る。 Of the learning kanji learned in 43, it has a learning counter equal to the current learning number Search for learning kanji. Delete the kanji at the address ID of the retrieved learning kanji and set a new learning kanji. If there is no learning kanji with a learning counter, etc., etc., for the current learning number, record both the learning kanji and the learning counter at the ID position. Then, add 1 to the learning number. Here, since it is the first learning, the learning vague “g” is recorded at the position of the address “ID = 0” with the learning counter = 0, 1 is added to the learning number, and the learning number = 1. .
[0182] この動作を繰り返して、学習番号 =499のときに「文」を学習すると、学習制御部 34 は、学習辞書記録部 43のアドレス「k」の位置に、学習漢字「文」を、学習カウンタ 499 とともに記録し、学習番号 =0に更新する (学習番号 499に 1を加算した結果、学習 番号が 500以上になる場合は、 500を減算して学習番号を 0に戻す)。  Repeating this operation, when learning a “sentence” when learning number = 499, learning control unit 34 causes learning kanji “sentence” to be located at address “k” of learning dictionary recording unit 43, Record along with the learning counter 499 and update it to the learning number = 0 (If the learning number is 500 or more as a result of adding 1 to the learning number 499, 500 is subtracted and the learning number is returned to 0).
[0183] さらに、新しく漢字 (例えば「大」)が学習されると、学習制御部 34は、新たな学習漠 字「大」を、学習カウンタ 0とともに記録されている「学」に上書きする形で記録する。こ のとき「大」には、学習カウンタ 0が関連付けて記録される。続いて、上記と同様学習 番号 = 2に更新する。  Furthermore, when a new kanji (for example, “large”) is learned, the learning control unit 34 overwrites the new learning vague “large” with “learning” recorded along with the learning counter 0. Record with At this time, learning counter 0 is recorded in association with "large". Subsequently, the learning number is updated to 2 as described above.
[0184] (学習辞書のデータ構造)  (Data structure of learning dictionary)
次に、学習辞書記録部 43 (および、ペア学習辞書記録部 44)に記録される学習単 語のデータ構造にっ 、て説明する。  Next, a data structure of learning words recorded in the learning dictionary recording unit 43 (and the pair learning dictionary recording unit 44) will be described.
[0185] 学習辞書記録部 43に記録される学習単語には、その単語を構成する漢字の文字 コードから生成されるグループ識別子 (GId値)が割り当てられており、学習単語は GI d値と関連付けて記録される。この GId値を用いることにより、学習辞書記録部 43から 目的の学習単語を検索するための処理効率が向上する。  A group identifier (GId value) generated from the character code of the kanji constituting the word is assigned to the learning word recorded in the learning dictionary recording unit 43, and the learning word is associated with the GId value. Are recorded. By using this GId value, the processing efficiency for searching for a target learning word from the learning dictionary recording unit 43 is improved.
[0186] GId値の生成方法にっ 、て、具体例を挙げて説明する。なお、以下の説明では、 漢字を一意に特定するための漢字コードおよび GId値をともに 16ビットで表すものと する。本実施形態では、一例として、 GId値は、当該単語の文字数を示す上位 4ビッ トと、単語を構成する各漢字の漢字コードの演算により求まった数値を示す下位 12ビ ットとで構成されるものとするが、本発明の文字変換装置 1で利用される GId値のデ ータ構造はこれに限定されない。  The method of generating the GId value will be described by taking a specific example. In the following description, both the kanji code and the GId value for uniquely identifying a kanji are represented by 16 bits. In the present embodiment, as an example, the GId value is composed of the upper 4 bits indicating the number of characters of the word and the lower 12 bits indicating the numerical value obtained by calculation of the kanji code of each kanji constituting the word. However, the data structure of the GId value used in the character conversion device 1 of the present invention is not limited to this.
[0187] 例えば、「上海」という単語の入力が確定されたとする。このとき、学習制御部 34 (図 21)は、まず「上海」という単語の GId値を生成する。 GId値を生成するために、単語「 上海」を構成する各漢字の漢字コードの下位 8ビットを、各漢字を表す数値(「上」 =「 009F (16進数)」、「海」=「00八3 (16進数)」)として用ぃるものとする。例えば、 GId 値の下位 12ビットを、上記漢字コード 16ビットの下位 8ビット同士の加算により求める とすると、学習制御部 34は、下位 12ビットとして、 9F+A3=「141 (16進数)」を算出 する。 For example, it is assumed that the input of the word “Shanghai” is confirmed. At this time, the learning control unit 34 (see FIG. 21) First, generate the GId value of the word "Shanghai". In order to generate a GId value, the lower 8 bits of the kanji code of each kanji constituting the word "Shanghai" is a numerical value representing each kanji ("upper" = "009F (hexadecimal number)", "sea" = "00 It shall be used as H.3 (hexadecimal number). For example, assuming that the lower 12 bits of the GId value are obtained by adding the lower 8 bits of the 16-bit Kanji code, the learning control unit 34 sets 9F + A3 = “141 (hexadecimal number)” as the lower 12 bits. calculate.
[0188] 次に、「上海」は 2文字であるので、この単語が 2文字で構成されていることを示す「 2 (16進数)」を上位 4ビットとして算出する。以上のようにして、単語「上海」の GId値 を「2141 (16進数)」として算出する。  [0188] Next, since "Shanghai" has 2 characters, "2 (hexadecimal number)" indicating that this word is composed of 2 characters is calculated as the upper 4 bits. As described above, the GId value of the word "Shanghai" is calculated as "2141 (hexadecimal number)".
[0189] 上述のようにして生成された GId値は、どのような文字数力もなる単語であっても(た だし、上述の例のように文字数を 4ビットで表す場合、最大 8文字までを単語として学 習するものとする)、固定長(例えば、 16ビット)に要約されることになる。したがって、 固定長で構成される GId値によって、目的の単語を絞り込んでから、検索処理を実行 することができるので、学習単語の検索効率を向上させることが可能となる。  [0189] The GId value generated as described above is a word having any number of characters (if the number of characters is represented by 4 bits as in the above example, a maximum of 8 characters can be used as a word). It will be summarized as a fixed length (eg, 16 bits). Therefore, since it is possible to execute the search process after narrowing down the target word by the GId value configured with the fixed length, it becomes possible to improve the search efficiency of the learning word.
[0190] 〔操作パネルについて〕  [About Operation Panel]
操作パネル 2に表示される文字入力処理時の操作画面は、図 14や図 17で示す例 に限定されな ヽ。操作パネル 2に表示される操作画面の他の例にっ 、て説明する。  The operation screen for character input processing displayed on operation panel 2 is not limited to the example shown in FIG. 14 or FIG. Another example of the operation screen displayed on the operation panel 2 will be described.
[0191] 操作パネル 2に表示される、ピンインを入力するためのアルファベットキーは、図 35 に示すように、 PCのキーボードを模したキー配列としてもよいし、図 36に示すように、 入力文字をアルファベット順に m X nのキー配列に当てはめるようにしてもょ 、。  The alphabet keys for inputting pinyin displayed on operation panel 2 may be a key arrangement imitating a PC keyboard as shown in FIG. 35, or as shown in FIG. 36, an input character Let's fit in alphabetical order to m X n key array.
[0192] また、候補漢字を表示するための候補文字表示領域 122を確定文字表示領域 12 3に重畳させて表示してもよ 、(図 37の(a)〜(c) )。  Further, a candidate character display area 122 for displaying candidate kanji may be superimposed on the determined character display area 123 and displayed ((a) to (c) in FIG. 37).
[0193] さらに、候補漢字を選択後、選択した漢字から始まる候補単語を表示するための第 2の候補文字表示領域 122'を、候補文字表示領域 122および確定文字表示領域 1 23に重畳させて表示してもよいし(図 38の(a) )、候補文字表示領域 122と並べて表 示してもよい(図 38の(b) )。  Furthermore, after candidate kanji is selected, second candidate character display area 122 ′ for displaying candidate words starting from the selected kanji is superimposed on candidate character display area 122 and determined character display area 123. It may be displayed (FIG. 38 (a)) or may be displayed side by side with the candidate character display area 122 (FIG. 38 (b)).
[0194] 〔実施形態 5〕  Fifth Embodiment
本発明の一実施形態について図 1〜図 11に基づいて説明すると以下の通りである 。ここでは、漢字 (変換文字、変換された後の文字)の読み (表音文字)を入力して漠 字を含む単語を決定する装置とするが、これに限定されない。つまり、以下で説明す る実施の形態は本発明を実施するための一例であって、本発明を実現する上で制 限されるものではない。 One embodiment of the present invention will be described below with reference to FIGS. 1 to 11. . Here, a device for inputting a reading (phonetic characters) of kanji characters (converted characters, characters after converted) to determine words including vague characters is used, but the present invention is not limited thereto. That is, the embodiments described below are merely examples for implementing the present invention, and are not limited to the implementation of the present invention.
[0195] 図 1は、本実施の形態に係る文字変換装置 1の構成を示すブロック図である。文字 変換装置 1は、入力部 11、辞書検索部 310、表示制御部 22、漢字辞書部 (変換文 字辞書) 41、単語辞書部 (単語辞書) 42、一時記憶部 3、制御部 10、表示部 12を備 えている。なお、図 1の各構成要素に付された符号は、図 12·図 16 ·図 21 ·図 22·図 30の各構成要素に付された符号に対応しており、同じ符号は、同じ構成要素を示し ている。なお、図 1には示されていない図 12·図 16 ·図 21 ·図 22·図 30の各部を、図 1の文字変換装置 1が備えていてもよい。  FIG. 1 is a block diagram showing a configuration of character conversion device 1 according to the present embodiment. The character conversion device 1 includes an input unit 11, a dictionary search unit 310, a display control unit 22, a kanji dictionary unit (conversion character dictionary) 41, a word dictionary unit (word dictionary) 42, a temporary storage unit 3, a control unit 10, a display It has part 12. The reference numerals attached to the respective constituent elements in FIG. 1 correspond to the reference numerals attached to the respective constituent elements in FIG. 12, FIG. 16, FIG. 21, FIG. 22 and FIG. It shows the elements. Note that the character conversion device 1 shown in FIG. 1 may be equipped with the components shown in FIG. 12, FIG. 16, FIG. 21, and FIG. 22, FIG. 30, which are not shown in FIG.
[0196] 入力部 11は、キーボード等からなり、ユーザが所望の入力文字列の読み (表音文 字)、後段で説明する選択した漢字、選択した単語等を、入力することができるように 構成されている。また、入力部 11は、ユーザが文字変換装置 1への各種指示を入力 できるように構成されている。なお、入力部 11と表示部 12とを兼用して、操作パネル 2 (タツチパネル)として設けられて 、てもよ 、。  [0196] The input unit 11 includes a keyboard and the like so that the user can input a desired input character string (phonetic character), selected kanji described later, selected words, and the like. It is configured. Further, the input unit 11 is configured to allow the user to input various instructions to the character conversion device 1. The input unit 11 and the display unit 12 may be used as the operation panel 2 (touch panel).
[0197] 表示制御部 22は、入力部 11から入力された読み及び読みを基に、辞書検索部 31 0が検索した漢字、単語及びそれらの候補をユーザが選択可能に表示部 12に表示 する。表示部 12は、表示制御部 22の制御の下、各種データの表示を行う。  Display control unit 22 displays the kanji, words, and their candidates searched by dictionary search unit 310 on display unit 12 in a selectable manner based on the readings and readings input from input unit 11. . The display unit 12 displays various data under the control of the display control unit 22.
[0198] 本実施形態では、候補抽出部 31は、辞書検索部 310を備えている。辞書検索部 3 10は、記録部 40に記憶されている各辞書部(漢字辞書部 41および単語辞書部 42) を検索し、候補となる漢字 (または単語)を検出する。本実施形態では、辞書検索部 3 10は、さらに、漢字取得部 (先頭変換文字取得手段) 311と単語取得部 (単語取得 手段) 312とを備えている。  In the present embodiment, the candidate extraction unit 31 includes a dictionary search unit 310. The dictionary search unit 310 searches each dictionary unit (Kanji dictionary unit 41 and the word dictionary unit 42) stored in the storage unit 40, and detects candidate kanji (or words). In the present embodiment, the dictionary search unit 310 further includes a kanji acquisition unit (head conversion character acquisition unit) 311 and a word acquisition unit (word acquisition unit) 312.
[0199] 漢字取得部 311は、入力受付部 (受付手段) 21が受け付けた読みの文字列を入力 とし、漢字辞書部 41を検索して所望の条件に一致した漢字 (あるいは、以下で説明 する漢字情報)を検索結果として一時記憶部 3に記憶する。さらに単語取得部 312は 、漢字取得部 311の検索結果及び入力受付部 21が受け付けた読みの文字列を入 力とし、単語辞書部 42を検索して所望の条件に一致した単語 (あるいは、以下で説 明する単語情報)を検索結果として一時記憶部 3に記憶する。漢字取得および単語 取得の詳細は後述する。 The kanji acquisition unit 311 receives the character string of the reading accepted by the input acceptance unit (acceptance means) 21 as an input, and searches the kanji dictionary unit 41 to find the kanji that matches the desired conditions (or, described below. The kanji information is stored in the temporary storage unit 3 as a search result. Furthermore, the word acquisition unit 312 receives the search result of the kanji acquisition unit 311 and the character string of the reading accepted by the input accepting unit 21. The word dictionary unit 42 is searched as a force, and a word matching the desired condition (or word information described below) is stored in the temporary storage unit 3 as a search result. Details of kanji acquisition and word acquisition will be described later.
[0200] 漢字辞書部 41は、読みと漢字とを対応付けた漢字対応情報 144を保持している。 [0200] The kanji dictionary unit 41 holds kanji correspondence information 144 in which the yomi and the kanji are associated.
さらに、読みの先頭文字と該先頭文字を先頭とする漢字の、漢字対応情報 144での 格納先のアドレス情報を対応付けた漢字インデックス (漢字インデックステーブル) 14 5を保持している。ここでは、漢字インデックス 145は、漢字の読みの順に並べられて いるものとする。  Furthermore, a kanji index (kanji index table) 145 in which the first character of reading and the address information of the storage destination in the kanji correspondence information 144 of the kanji whose head is the first character is associated is held. Here, the kanji index 145 is arranged in the order of kanji reading.
[0201] 単語辞書部 42は、単語と該単語の読みとが対応付けられた単語対応情報 162を 保持している。さらに、単語の先頭となりうる先頭文字と、該先頭文字を先頭に持つ 単語の、単語対応情報 162での格納先のアドレス情報とが対応付けられた単語イン デッタス(単語インデックステーブル) 161を保持している。単語対応情報 162と単語 インデックス 161とは、単語の先頭文字毎にグループィ匕されて!、る。  The word dictionary unit 42 holds word correspondence information 162 in which a word and a reading of the word are associated with each other. Furthermore, a word index (word index table) 161 in which the head character which can be the head of the word and the address information of the storage destination in the word correspondence information 162 of the word having the head character at the head are associated is held. ing. The word correspondence information 162 and the word index 161 are grouped by the first letter of the word! .
[0202] また、漢字対応情報 144は、漢字に対応付けて補助情報を格納して 、る。また、単 語対応情報 162は、単語に対応付けて補助情報を格納している。これら、補助情報 は、例えば、読みの文字数 (例えば、上述の各実施形態におけるピンイン長(要素 B) ) (検索補助情報)や出現頻度 (上述の要素 A) (表示補助情報)等が挙げられる。上 記補助情報は、読みの文字数及び出現頻度を所定の値域、例えば、 1から 100の範 囲に変換した数値とする。以下では、漢字情報とは、漢字とその漢字に対応付けられ た情報 (読み、補助情報)とを指すものとする。また、単語情報とは、単語とその単語 に対応付けられた情報 (読み、補助情報)とを指すものとする。しかし、補助情報は、 もちろん、上記に限定されず、これ以外の情報であってもよい。例えば、上述の各実 施形態で述べた候補順序決定要素 (要素 C〜要素 G)を補助情報として、候補 (漢字 Z単語)に対応付けて記憶しておいてもよい。  Also, the kanji correspondence information 144 stores auxiliary information in association with kanji. The word correspondence information 162 stores auxiliary information in association with a word. The auxiliary information includes, for example, the number of characters of reading (for example, the pin-in length (element B) in each of the embodiments described above) (search auxiliary information), the frequency of occurrence (element A described above) (display auxiliary information), etc. . The above auxiliary information is a numerical value obtained by converting the number of characters and the frequency of occurrence of reading into a predetermined range, for example, 1 to 100. In the following, kanji information refers to kanji and information (reading and supplementary information) associated with the kanji. In addition, word information refers to a word and information (reading and supplementary information) associated with the word. However, the auxiliary information is, of course, not limited to the above, and may be other information. For example, the candidate order determining elements (element C to element G) described in each of the above-described embodiments may be stored as auxiliary information in association with the candidate (kanji Z word).
[0203] 制御部 10は、文字変換装置 1における各種構成の動作を統括的に制御する。例え ば、入力部 11から入力された文字列の文字数をカウントする。また、辞書検索部 310 、表示制御部 22の制御を行う。また、入力受付部 21を備えおり、ユーザが入力部 11 力 入力した文字列を受け付けるようになって!/、る。 [0204] 次に、文字変換装置 1の処理動作について、図 2に示すフローチャートを用いて説 明する。 The control unit 10 centrally controls the operation of various components in the character conversion device 1. For example, the number of characters of the character string input from the input unit 11 is counted. Also, the dictionary search unit 310 controls the display control unit 22. In addition, an input receiving unit 21 is provided so that the user can receive a character string input by the input unit 11! Next, the processing operation of the character conversion device 1 will be described using the flowchart shown in FIG.
[0205] ユーザが入力部 11から入力したい文字列の先頭文字の読みとして、その読みの先 頭文字を含む 1以上の読み文字列を入力部 11から入力すると、入力受付部 21は入 力された読み文字列を制御部 10に転送する。  [0205] When the user inputs one or more reading character strings including the first character of the reading from the input unit 11 as the reading of the first character of the character string desired to be input from the input unit 11, the input reception unit 21 is input. Transfer the read string to the control unit 10.
[0206] 制御部 10は入力部 11から転送された読み文字列の文字数をカウントし、読み文字 列とその文字数を一時記憶部 3に記憶する。そして、制御部 10は辞書検索部 310に 漢字検索を指示する (S101)。  Control unit 10 counts the number of characters of the read character string transferred from input unit 11, and stores the read character string and the number of characters in temporary storage unit 3. Then, the control unit 10 instructs the dictionary search unit 310 to search for kanji (S101).
[0207] 次に、辞書検索部 310の漢字取得部 311は、一時記憶部 3から読み文字列と読み 文字列の文字数を辞書検索部 310の内部バッファ(図示せず)に読み込む。漢字取 得部 311は、漢字インデックス 145を参照し、読み文字列の先頭文字を基に、漢字 対応情報 144内での検索すべき検索対象範囲の終了位置のアドレス (iNum)を求 めて内部バッファに保持する(S102)。さらに、漢字対応情報 144内での検索すべき 検索対象範囲の開始位置のアドレス (iStart)を求めて内部バッファに保持する(S1 03)。  Next, the Chinese character acquisition unit 311 of the dictionary search unit 310 reads the read character string and the number of characters of the read character string from the temporary storage unit 3 into an internal buffer (not shown) of the dictionary search unit 310. The kanji acquisition unit 311 refers to the kanji index 145, determines the address (iNum) of the end position of the search target range to be searched in the kanji correspondence information 144 based on the first character of the read character string, and internally It holds in the buffer (S102). Further, the address (iStart) of the start position of the search target range to be searched in the kanji correspondence information 144 is obtained and held in the internal buffer (S103).
[0208] そして、漢字取得部 311は、辞書検索部 310の内部バッファに保持したアドレスを 基に、漢字対応情報 144を参照して漢字情報を読み込み、漢字対応情報 144の読 み情報と入力文字列の読みとを比較する(S104)。入力文字列の読みと、漢字対応 情報 144での読みが一致している場合は(S104で YES)、漢字取得部 311は、読み がー致の条件を満足した漢字情報を一時記憶部 3に記憶する(S105)。  [0208] Then, the kanji acquisition unit 311 reads kanji information by referring to the kanji correspondence information 144 based on the address held in the internal buffer of the dictionary search unit 310, and reads the kanji correspondence information 144 and the input characters. The column reading is compared (S104). If the reading of the input character string matches the reading in the kanji correspondence information 144 (YES in S104), the kanji acquisition unit 311 stores the kanji information satisfying the reading condition in the temporary storage unit 3. It memorizes (S105).
[0209] そして、 iStartを更新する(S106)。その後、漢字取得部 311は内部バッファに保 持している iNumと S 106で更新された iS tartとを比較する(S107)。比較した結果、 検索する漢字にっ 、て、 iStartが iNum以上になって!/、ると(S 107にお!/、て NO)、 制御部 10に漢字検索が終了したことを知らせる。  Then, iStart is updated (S106). Thereafter, the kanji acquisition unit 311 compares the iNum stored in the internal buffer with the iS tart updated in S 106 (S 107). As a result of comparison, in the kanji to be searched, iStart becomes iNum or more! /, And (in S 107,! /, NO), notifies the control unit 10 that the kanji search is finished.
[0210] なお、漢字検索の具体的な説明は、実施例を用いて後述する。  A specific description of the kanji search will be described later using an example.
[0211] 制御部 10は、辞書検索部 310から漢字検索が終了したことの通知を受けると、一 時記憶部 3に記憶されている漢字検索結果を所定の順に並べ替え(S 108)、表示位 置の情報を付加して一時記憶部 3に記憶する。力!]えて、表示制御部 22は、表示部 1 2に漢字検索結果の表示を行う(S109)。なお、上記所定の順に並べ替える際、補 助情報の出現頻度の順に並べ替えてもよいし、上述の各実施形態で説明したとおり 、候補順序決定要素 (および評価値算出ルール)に基づいて、候補順序決定部 32 が評価値を算出し、それに基づ!、て並べ替えてもよ!、。 When control unit 10 receives the notification that the kanji search is completed from dictionary search unit 310, control unit 10 rearranges the kanji search results stored in temporary storage unit 3 in a predetermined order (S 108), Information on the position is added and stored in temporary storage unit 3. Power! ], The display control unit 22 is a display unit 1 The kanji search result is displayed at step 2 (S109). When sorting in the predetermined order, it may be sorted in the order of appearance frequency of the auxiliary information, or as described in the above embodiments, based on the candidate order determination factor (and the evaluation value calculation rule). The candidate order determination unit 32 calculates the evaluation value, and based on it, it may be rearranged!
[0212] 次に、ユーザの指示入力があるかを確認する(S 110)。ユーザは、表示部 12に表 示された漢字検索結果を目視して、所望の漢字を選択し、入力部 11を用いてその漠 字が表示されている位置を指示入力する。このようなユーザの指示入力があると(S1 10にて YES)、入力部 11は指示位置の位置情報を制御部 10の入力受付部 21に送 信する。 Next, it is confirmed whether there is a user's instruction input (S 110). The user visually checks the kanji search result displayed on the display unit 12, selects a desired kanji, and uses the input unit 11 to designate and input the position at which the vague character is displayed. When such an instruction input by the user is received (YES in S110), the input unit 11 transmits position information of the indicated position to the input reception unit 21 of the control unit 10.
[0213] 制御部 10は、入力受付部 21で受け付けた位置情報と、一時記憶部 3に記憶して いる漢字検索結果の表示位置の情報とを比較し、入力された漢字を検出する (S11 D oさらに、制御部 10は、ユーザにより指定された漢字を辞書検索部 310に転送し、 単語検索を行うことを指示する。なお、このとき、候補選択受付部 23が、ユーザにより 選択された漢字を検出し、学習制御手段が、それを学習漢字として、学習辞書記録 部 43またはペア学習辞書記録部 44に記憶してもよい。  The control unit 10 compares the position information received by the input receiving unit 21 with the display position information of the kanji search result stored in the temporary storage unit 3 and detects the input kanji (S 11). In addition, the control unit 10 transfers the kanji designated by the user to the dictionary search unit 310, and instructs to perform a word search Note that, at this time, the candidate selection reception unit 23 is selected by the user. The kanji may be detected, and the learning control means may store it as a learning kanji in the learning dictionary recording unit 43 or the pair learning dictionary recording unit 44.
[0214] 辞書検索部 310の単語取得部 312は、単語検索の指示を受けると、単語インデッ タス 161を参照し、取得した漢字の識別情報 (文字コード)を基に、単語対応情報 16 2内での検索すべき検索対象範囲の終了位置のアドレス(iNum)を求めて内部バッ ファに保持する (S112)。さらに、単語対応情報 162内で検索すべき検索対象範囲 の開始位置のアドレス (iStart)を求めて内部バッファに保持する(S113)。  When the word acquisition unit 312 of the dictionary search unit 310 receives a word search instruction, the word acquisition unit 312 refers to the word index 161 and, based on the acquired identification information (character code) of the kanji, in the word correspondence information 162. The address (iNum) of the end position of the search target range to be searched in is determined and stored in the internal buffer (S112). Further, the address (iStart) of the start position of the search target range to be searched in the word correspondence information 162 is obtained and held in the internal buffer (S113).
[0215] そして、単語取得部 312は、辞書検索部 310の内部バッファに保持したアドレスを 基に、単語対応情報 162を参照して単語情報を読み込み、所定の条件に一致する 単語情報を一時記憶部 3に記憶する(S 114)。  Then, based on the address held in the internal buffer of the dictionary search unit 310, the word acquisition unit 312 reads the word information with reference to the word correspondence information 162, and temporarily stores the word information matching the predetermined condition. It stores in the part 3 (S 114).
[0216] その後、単語取得部 312は、検索対象の単語についての iStartを更新し(S114a) 、内部バッファに保持して 、る iNumと S 114aで更新した iStartとを比較する(S 115 ) o比較した結果、検索対象の単語について、 iStartが iNum以上になっていると(S 115にお 、て NO)、制御部 10に単語検索が終了したことを知らせる。  Thereafter, the word acquiring unit 312 updates iStart for the word to be searched (S 114 a), holds it in an internal buffer, and compares iNum with iStart updated in S 114 a (S 115) o As a result of comparison, when iStart is iNum or more for the search target word (NO in S115), the control unit 10 is notified that the word search has ended.
[0217] なお、単語検索の具体的な説明は実施例を用いて後述する。 [0218] そして、制御部 10は、辞書検索部 310から単語検索が終了したことの通知を受け ると、一時記憶部 3に記憶されている単語検索結果を所定の順に並べ換え(S116) 、表示位置の情報を付加して一時記憶部 3に記憶する。力!]えて、表示制御部 22は、 表示部 12にて表示部 12に単語検索結果の表示を行う(S 117)。なお、上記所定の 順に並べ替える際、補助情報の出現頻度の順に並べ替えてもよいし、上述の各実施 形態で説明したとおり、候補順序決定要素 (および評価値算出ルール)に基づいて、 候補順序決定部 32が評価値を算出し、それに基づいて並べ替えてもよい。 A specific description of the word search will be described later using an embodiment. When the control unit 10 receives the notification that the word search is completed from the dictionary search unit 310, the control unit 10 arranges the word search results stored in the temporary storage unit 3 in a predetermined order (S116), The position information is added and stored in the temporary storage unit 3. The display control unit 22 causes the display unit 12 to display the word search result on the display unit 12 (S117). Note that when sorting in the above-mentioned predetermined order, it may be sorted in the order of appearance frequency of the auxiliary information, or, as described in each of the above-mentioned embodiments, candidates are determined based on candidate order determination elements (and evaluation value calculation rules). The order determination unit 32 may calculate the evaluation value and rearrange it based on it.
[0219] さらに、ユーザが表示部 12に表示された単語検索結果力も所望の単語を目視して 、所望の単語を選択し、その単語が表示されている位置を指示入力する。指示入力 があると、入力部 11は入力された位置の位置情報を制御部 10の入力受付部 21に 送信する。なお、このとき、候補選択受付部 23が、ユーザにより選択された漢字を検 出し、学習制御手段が、それを学習漢字として、学習辞書記録部 43またはペア学習 辞書記録部 44に記憶してもよ ヽ。  Furthermore, the user also visually checks a desired word as the word search result displayed on the display unit 12, selects the desired word, and designates and inputs the position where the word is displayed. When there is an instruction input, the input unit 11 transmits position information of the input position to the input reception unit 21 of the control unit 10. At this time, even if the candidate selection receiving unit 23 detects the kanji selected by the user and the learning control means stores it as a learning kanji in the learning dictionary recording unit 43 or the pair learning dictionary recording unit 44. Yo.
[0220] 制御部 10は、入力受付部 21で受け付けた位置情報と、一時記憶部 3に記憶して いる単語検索結果の表示位置の情報と比較し、ユーザにより指定された単語を検索 する。そして表示制御部 22は表示部 12にて検索された単語を表示する。  The control unit 10 compares the position information received by the input receiving unit 21 with the information on the display position of the word search result stored in the temporary storage unit 3 to search for a word specified by the user. Then, the display control unit 22 displays the word retrieved by the display unit 12.
[0221] つまり、本発明に係る文字変換装置では、漢字辞書部 41及び単語辞書部 42に、 インデックスと漢字対応情報 144及び単語対応情報 162を持たせ、同一先頭漢字の 単語をグループ化し、漢字対応情報及び単語対応情報に補助情報を持たせ、補助 情報を基に検索対象の漢字及び単語を制限することで、検索時間の短縮を行えるよ うにしている。さらに、検索された候補となる漢字 (単語)の表示順序は、その候補の 補助情報 (候補順序決定要素)に基づいて、候補順序決定部 32によって、ユーザに よって選択される可能性が高 、ものほど優先して表示されるように決定される。したが つて、ユーザが所望する順序に並べ替えられた候補の漢字 (単語)を、より短い時間 で表示することができる。結果として、ユーザの利便性を向上させることが可能となる  That is, in the character conversion device according to the present invention, the kanji dictionary unit 41 and the word dictionary unit 42 are provided with the index and the kanji correspondence information 144 and the word correspondence information 162 to group the words of the same leading kanji. Auxiliary information is added to the correspondence information and the word correspondence information, and the search time can be shortened by limiting the search target kanji and words based on the auxiliary information. Furthermore, the display order of the retrieved kanji characters (words) is likely to be selected by the user by the candidate order determination unit 32 based on the auxiliary information (candidate order determination factor) of the candidate. It is decided that things will be displayed with priority. Therefore, candidate kanji (words) rearranged in the order desired by the user can be displayed in a shorter time. As a result, user convenience can be improved.
[0222] なお、以下の実施例 5— 1、 5— 2では、上記説明した文字変換装置 1を、ピンイン( 表音文字)を入力して漢字 (変換文字)に変換する中国語変換装置として説明する。 この場合、以下で詳細に説明するが、漢字インデックス 145、漢字対応情報 144、単 語インデックス 161、単語対応情報 162は、次のようになっているものとする。 In the following embodiments 5-1 and 5-2, the Chinese character conversion device 1 described above is converted into a Chinese character (converted character) by inputting pinyin (phonetic characters). explain. In this case, although described in detail below, it is assumed that the kanji index 145, the kanji correspondence information 144, the word index 161, and the word correspondence information 162 are as follows.
[0223] 漢字インデックス 145は、図 3に示すようになっており、ピンインの先頭文字 (先頭の 表音文字)と、このピンインの先頭文字を漢字のピンイン (表音情報)の先頭として持 つ漢字の、漢字対応情報 144での漢字アドレス (格納先情報)とが対応付けられて!/、 る。図 3で、漢字インデックス 145の左の記載(cidx— ml、 cidx— m2等)は、漠字ィ ンデッタス 145内でのアドレスである。具体的には、 cidx— mlとは、「読みの先頭が mであり、 1声」の漢字インデックス 145内でのアドレスである。  The kanji index 145 is as shown in FIG. 3. The kanji index 145 has the first character of pinyin (the first phonetic character) and the first character of this pinyin as the first of the pinyin of kanji (phonetic information). The kanji address (storage destination information) in the kanji correspondence information 144 of the kanji is associated! In FIG. 3, the description on the left of the Kanji index 145 (cidx-ml, cidx-m2, etc.) is an address in the vague index 145. Specifically, cidx-ml is an address in the kanji index 145 of "the beginning of reading is m and one voice".
[0224] また、漢字対応情報 144は、図 4に示すように、漢字とそのピンインと、補助情報(図 4では、検索条件および頻度情報)とが対応付けられている。図 4の漢字対応情報 14 4の左の記載(cp— ml、 cp— m2等)は、漢字対応情報 144内でのアドレスである。  Further, in the kanji correspondence information 144, as shown in FIG. 4, kanji and its pinyin are associated with auxiliary information (in FIG. 4, search conditions and frequency information). The description (cp-ml, cp-m2, etc.) on the left of the kanji correspondence information 144 in FIG. 4 is an address in the kanji correspondence information 144.
[0225] また、単語インデックス 161は、図 5に示す 1次インデックス(単語 1次インデックステ 一ブル) 161aおよび図 6に示す 2次インデックス(単語 2次インデックステーブル) 16 lbを有している。図 5に示すように、 1次インデックス 161aは、漢字とそのコードと、そ の漢字を先頭として持つ単語の 2次インデックス 161bでのアドレスが対応付けられて いる。 2次インデックス 16 lbは、漢字と第 2文字目の漢字の先頭ピンインと、その単語 対応情報 162でのアドレスが対応付けられている。図 6の 2次インデックス 161bの左 の記載 (widx— di、 widx— gi等)は、 2次インデックス 161b内でのアドレスである。  The word index 161 has a primary index (word primary index table) 161a shown in FIG. 5 and a secondary index (word secondary index table) 16 lb shown in FIG. As shown in FIG. 5, in the primary index 161a, the kanji and its code are associated with the address at the secondary index 161b of the word having the kanji as the head. In the secondary index 16 lb, the top Pinyin of the kanji and the second kanji is associated with the address in the word correspondence information 162. The description on the left of the secondary index 161b (widx-di, widx-gi, etc.) in FIG. 6 is an address in the secondary index 161b.
[0226] さらに、単語対応情報 162は、図 7に示すように、単語とその読みのピンインが対応 付けられている。図 7の単語対応情報 162の左の記載 (wp— di、 wp— gi等)は、単 語対応情報 162内でのアドレスである。なお、漢字対応情報 144にて、漢字コードと ピンインとが対応している場合には、 1次インデックス 161aは必ずしも必要ではない。  Further, in the word correspondence information 162, as shown in FIG. 7, a pin is associated with a word and its reading. The description (wp−di, wp−gi, etc.) on the left of the word correspondence information 162 in FIG. 7 is an address in the word correspondence information 162. When the kanji code and the pinyin correspond to each other in the kanji correspondence information 144, the primary index 161a is not necessarily required.
[0227] (実施例 5— 1)  Example 5-1
本実施例では、上記説明した文字変換装置 1を、ピンインを入力して漢字に変換す る中国語変換装置として説明するが、これに限定されず、例えばひらがな (あるいは カタカナ)を入力して漢字を含む単語を決定する装置であってもよい。  In the present embodiment, although the character conversion device 1 described above is described as a Chinese conversion device which converts pinyin to kanji, the invention is not limited thereto. For example, a hiragana (or katakana) may be input to kanji. The apparatus may be a device that determines a word including
[0228] 具体的に、ユーザが「外字 1」を入力するために、「外字 2」の中国読みであるピンィ ンの「mei」を入力した場合にっ 、て説明する。 [0229] 初めに、漢字検索の処理について説明する。なお、入力読みと漢字情報の読みと がー致する条件は、入力読みが漢字情報の読みに前方一致であることとする。 [0228] Specifically, the case will be described where the user inputs "mei", which is a Chinese reading of "extra character 2" to input "extra character 1". First, the process of the kanji search will be described. The condition for the input reading to match the reading of the kanji information is that the input reading matches the reading of the kanji information.
[0230] (漢字検索)  (Kanji search)
辞書検索部 310の漢字取得部 311は、一時記憶部 3から読み文字列「mei」を辞書 検索部 310の内部バッファに読み込む。そして、辞書検索部 310は読み文字の先頭 文字「m」を基に以下の式で漢字インデックス 145の「m」の範囲のアドレスを求め、そ れらのアドレスに記述している漢字アドレスを内部バッファに記憶する。  The Chinese character acquisition unit 311 of the dictionary search unit 310 reads the read character string "mei" from the temporary storage unit 3 into the internal buffer of the dictionary search unit 310. Then, the dictionary search unit 310 obtains an address in the range of “m” of the kanji index 145 according to the following expression based on the first character “m” of the reading character, and the kanji address described in those addresses Store in buffer.
[0231] アドレスは、  The address is
(1)入力読みに四声コードがある場合  (1) If there is a four voice code in the input reading
先頭アドレス = (入力読みの文字コード ベースの読みの文字コード)  Start address = (character code of input reading character code of reading character code)
X漢字インデックスの単位サイズ X 5  Unit size of X kanji index X 5
+ (入力読みの四声コード 1) X漢字インデックスの単位サイズ 終了アドレス = (入力読みの文字コード ベースの読みの文字コード)  + (Input reading four voice code 1) Unit size of X kanji index End address = (Character code of input reading character code base reading character code)
X漢字インデックスの単位サイズ X 5  Unit size of X kanji index X 5
+入力読みの四声コード X漢字インデックスの単位サイズ  + Four voice code of input reading Unit size of X kanji index
(2)入力読みに四声コードが無い場合  (2) When there is no four voice code in input reading
先頭アドレス = (入力読みの文字コード ベースの読みの文字コード)  Start address = (character code of input reading character code of reading character code)
X漢字インデックスの単位サイズ X 5  Unit size of X kanji index X 5
終了アドレス = (入力読みの文字コード ベースの読みの文字コード + 1) End address = (Character code of input reading Character code of reading + 1)
X漢字インデックスの単位サイズ X 5 Unit size of X kanji index X 5
として決定される。  It is determined as
[0232] 上記アドレスを決定する、「5」を掛けているのは、次の理由力もである。中国語の発 音は、ピンインや注音のように読みを表す部分と発声の上げ下げ (調子)を表す声調 (四声)の組み合せである。声調が 5種類(1〜4声と軽声)あるので、 5としている。  [0232] The reason why the above address is determined and multiplied by "5" is also the following reason. Chinese pronunciation is a combination of a part that represents reading like pinyin and a gushing and a tone (four voice) that indicates ups and downs of the utterance (tone). There are 5 types of tones (1 to 4 voices and light voices), so 5 is used.
[0233] ピンインによる中国語入力では、ピンインはアルファベットで表現されるので、ベー スの読みの文字コードは「a ( = 0x61 (アスキーコード))」とし、注音による中国語入力 では、ベースの読みの文字コードは「  [0233] In Chinese input by Pinyin, since Pinyin is expressed as an alphabet, the character code of base reading is "a (= 0x61 (ASCII code))", and in Chinese input by sound injection, base reading is used. The character code of "
[0234] [表 5] [0235] ( = 0xa373)」とすればよい。 [Table 5] It may be set as “(= 0xa 373)”.
[0236] 漢字インデックス(漢字インデックテーブル) 41の単位サイズは、単語インデックス ( 単語インデックテーブル) 51をピンインの先頭文字のアルファベット順とし、同じピン インの先頭文字は四声の 1から 5の順に並ぶように構成していれば、漢字アドレスの バイト長となる。例えば、本実施として動作している装置では 4バイトとなっている。  The unit size of the kanji index (kanji index table) 41 has word index (word index table) 51 in alphabetical order of the first letters of Pinyin, and the first letters of the same pinyin are arranged in the order of 1 to 5 in four voices If configured as above, it will be the byte length of the kanji address. For example, in the device operating as this implementation, it is 4 bytes.
[0237] ここでは、四声コードはないので、 「m ( = 0x6d)」の範囲のアドレスは(2)式により、 先頭アドレス力 S「(0x6d— 0x61) X漢字インデックスの単位サイズ X 5」(つまり、図 3 の cidx— ml)、終了アドレスが「(0x6d— 0x61 + 1) X漢字インデックスの単位サイ ズ X 5」(つまり、図 3の cidx— nl)によって計算され、各アドレスが指し示す漢字イン デッタスに記述している漢字アドレス(cp— ml、及び cp— nl)を内部バッファに記憶 する。図 3の cidx— mlは、読みの先頭が mで 1声のもの、の漢字インデックス 145内 でのアドレスを表している。それに対応する cp— mlは、読みの先頭が mで 1声の読 みの漢字対応情報 144のアドレスを表している。 cidx— m5、 cp— m5は読みの先頭 が mで軽声に対応して!/、る。  Here, since there is no four-voice code, the address in the range of “m (= 0x6d)” is the head address strength S “(0x6d−0x61) X Kanji index unit size X 5” according to equation (2). The end address is calculated by “(0x6d — 0x61 + 1) Unit size of X kanji index X 5” (ie, cidx — nl in FIG. 3), and each address points The kanji address (cp-ml and cp-nl) described in the kanji index is stored in the internal buffer. In FIG. 3, cidx-ml in the figure represents the address in the Kanji index 145 of the m-first reading of one reading. The corresponding cp-ml represents the address of the single-speech kanji correspondence information 144 with the m beginning of the reading. cidx-m5, cp-m5 has m at the beginning of the reading and corresponds to a light voice!
[0238] なお、(0 6(1—(^61) = 12でぁり、(0x6d— 0x61 + 1) = 13となる。  [0238] Note that (0 6 (1-(^ 61) = 12) and (0 x 6 d-0 x 61 + 1) = 13.
[0239] 辞書検索部 310は、検索対象のアドレス cpiを cp— mlとする。次に辞書検索部 31 0は、漢字対応情報 144からアドレス cpiの漢字情報を読み込み、入力ピンイン数を 基に検索条件を判定する。その結果、検索条件を満足していれば、入力読みと漢字 情報の読みとの比較を行い、入力読みが漢字情報の読みに前方一致であれば一致 したものとして一時記憶部 3に該漢字情報を記憶し、 cpiのみ更新する。  The dictionary search unit 310 sets the address cpi to be searched as cp−ml. Next, the dictionary search unit 310 reads the kanji information of the address cpi from the kanji correspondence information 144, and determines the search condition based on the number of input pinyins. As a result, if the search conditions are satisfied, the input reading and the reading of the kanji information are compared, and if the input reading matches the reading of the kanji information, the temporary storage unit 3 is regarded as a match. Remember and update cpi only.
[0240] 図 4に示す漢字対応情報 144の例では、検索条件を以下のようにしている。  In the example of the kanji correspondence information 144 shown in FIG. 4, the search condition is as follows.
検索条件は、  Search condition is
0 :無条件に検索対象とする  0: Unconditionally search target
1 :入力ピンイン数が 1文字であれば、検索対象としない  1: If the number of input pinyin is 1 letter, it will not be searched
2:入力ピンイン数が 2文字以下であれば、検索対象としな!/ヽ  2: If the number of input pinyin is 2 characters or less, search target! / ヽ
3:入力ピンイン数が 3文字以下であれば、検索対象としな!/ヽ  3: If the number of input pinyin is 3 characters or less, search target! / ヽ
とする。 [0241] 入力ピンインが「mei」の場合、入力ピンイン数が 3であるため、検索条件が 0、 1、 2 に対応している漢字が、検索処理の対象となる。 I assume. [0241] When the input pinyin is "mei", since the number of input pinyins is 3, kanji whose search condition corresponds to 0, 1, 2 are to be searched for.
[0242] 上記処理は、 cpiが cp_nlに等しくなるまで処理を継続し、 cpiが cp_nlと等しくな れば、一時記憶部 3に記憶している漢字情報を補助情報のひとつである出現頻度で 並べ換えを行い、漢字検索処理を終了する。  The above processing continues processing until cpi becomes equal to cp_nl, and if cpi becomes equal to cp_nl, the kanji information stored in temporary storage unit 3 is rearranged with the appearance frequency which is one of the auxiliary information. To complete the kanji search process.
[0243] なお、上記説明では、四声コードの入力が行われない場合で説明した力 四声コ ード付きで「mei3」が入力された場合は、「m」の第 3声のアドレスが上記(1)式により 計算され、先頭アドレスが「(0x6d— 0x61) X漢字インデックスの単位サイズ X 5+ ( 3- 1) X漢字インデックスの単位サイズ」(つまり、図 3の cidx— m3)、終了アドレスが 「(0x6d— 0x61) X漢字インデックスの単位サイズ X 5 + 3 X漢字インデックの単位 サイズ」(つまり、図 3の cidx— m4)となり、各アドレスが指し示す漢字インデックスに 記述して!/ヽる漢字アドレス(cp— m3、及び cp— m4)を内部バッファに記憶する。  In the above description, when “mei 3” is input with the four-voice code described in the case where the four-voice code is not input, the address of the third voice of “m” is Calculated by the above equation (1), the start address is "(0x6d-0x61) unit size of X kanji index X 5 + (3-1) unit size of X kanji index" (that is, cidx-m3 in Fig. 3), The end address is "(0x6d-0x61) X Kanji index unit size X 5 + 3 X Kanji index unit size" (that is, cidx-m4 in Fig. 3) and described in the Kanji index pointed to by each address! Store the kanji address (cp-m3 and cp-m4) in the internal buffer.
[0244] なお、漢字アドレスを求めた後の処理は上記検索処理と同様であるため説明を省 略する。  [0244] Note that the process after obtaining the kanji address is the same as the above search process, and hence the description is omitted.
[0245] (単語検索)  [Word Search]
以下に単語検索の説明を行う。具体的に、ユーザが「外字 1」の中国読みであるピ ンインの「111 」を入力し、漢字検索結果が一時記憶部 3に保持され、表示制御部 22 表示装置に表示した後に漢字検索結果から「外字 2」を指示入力したもとのして説明 を行う。  The following explains the word search. Specifically, the user inputs the pin-in “111” of Chinese reading “extra digit 1,” the kanji search result is stored in the temporary storage unit 3, and the display control unit 22 displays it on the display device. Give instructions from "" and "", and explain.
[0246] 単語辞書部 42の 1次インデックス 161aは、図 5に示すように、漢字コードの昇順に 記憶されているものとする。  As shown in FIG. 5, it is assumed that the primary index 161a of the word dictionary unit 42 is stored in ascending order of kanji code.
[0247] 制御部 10は、ユーザの指示位置 (XI、 Y1)と一時記憶部 3に記憶している漢字検 索結果の ¾: ¼ (XI― mm、 yi― mm、 xi― max、 yi― max) u= l〜n、 nは恢 数)を it較し、 xi _ min≤Xl≤xi _ max力つ yi _ min≤Yl≤yi _ maxが成立すると きに, i番目の候補が選択されたものと判断し、対応する漢字を辞書検索部 310に転 送し、単語検索を行うことを指示する。 [0247] Control unit 10 determines the position of the user (XI, Y1) and the result of the kanji search stored in temporary storage unit 3: 1⁄4 (XI-mm, yi-mm, xi-max, yi- max) u = l to n, n is a decimal number, and xi _ min ≤ Xl ≤ xi _ max yi _ min つ Yl ≤ yi _ max is satisfied, the i-th candidate is selected Judging that it has been done, the corresponding kanji is transferred to the dictionary search unit 310, and a word search is instructed to be performed.
[0248] 辞書検索部 310は、単語検索の指示を受けると、転送された漢字のコードに一致 する単語インデックス 161を検索し、単語対応情報 162での検索対象の範囲を求め る。具体的には、辞書検索部 310は、転送された「外字 2」について、 1次インデックス 161aを 2分探索し、「外字 2」と一致する漢字コード C3BFを取得する。そして、漢字 コード C 3BFに対応する 2次インデックス 16 lb内のアドレス「widx— mi」を用いて、 2 次インデックス 161bを参照し、「外字 2」が先頭の単語の単語対応情報 162における アドレス(単語アドレス)「wp— mi」を、「外字 2」が先頭の単語対応情報 162での開始 アドレスとして内部バッファに記憶する。さらに、辞書検索部 310は、 1次インデックス 161aにおいて C3BFの次の漢字コード C3C0 (図 5中の [0248] When receiving the word search instruction, the dictionary search unit 310 searches for the word index 161 that matches the transferred kanji code, and finds the search target range in the word correspondence information 162. Ru. Specifically, the dictionary search unit 310 searches the primary index 161a for the transferred “extra character 2” by two and obtains a kanji code C3BF that matches “extra character 2”. Then, referring to the secondary index 161b using the address "widx-mi" in the secondary index 16 lb corresponding to the kanji code C 3BF, the address in the word correspondence information 162 of the word in which "the extra character 2" is the first word Word address) “wp—mi” is stored in the internal buffer as the start address of the word correspondence information 162 at the head of “extra character 2”. Furthermore, the dictionary search unit 310 is configured to set the kanji code C3C0 next to C3BF in the primary index 161a (see FIG. 5).
[0249] [表 6] 冒 次インデックス 161bを参照し、「外字 6」が先頭の単語の単語対応情報 162における アドレス(単語アドレス)「wp— mi+l」を、「外字 2」が先頭の単語対応情報 162での 終了アドレスとして内部バッファに記憶する。 [Table 6] With reference to the next index 161b, the address (word address) “wp−mi + l” in the word correspondence information 162 of “head digit 6” is the first word, “star 2” is the head. It is stored in the internal buffer as the end address for the word correspondence information 162.
[0251] その後、辞書検索部 310の単語取得部 312は、「wp— mi」から「wp— mi+l」まで の単語情報を読み出し、単語情報の漢字列を検索結果として、一時記憶部 3に記憶 する。そして、候補順序決定部 32が、一時記憶部 3の単語情報 (候補単語)の補助 情報(図示せず)である出現頻度の高い順に並べ替えを行い、単語検索処理を終了 する。なお、この並べ替えは単なる例示であり、他の方法で並べ替えてもよい。例え ば、候補順序決定要素 (要素 A〜要素 G)に基づいて、表示順序を決定してもよい。 最後に、上記したように、検索され並べ替えられた単語を、表示部 12に表示する。  After that, the word acquisition unit 312 of the dictionary search unit 310 reads out the word information from “wp−mi” to “wp−mi + l”, and uses the kanji string of the word information as the search result as a temporary storage unit 3 Remember to Then, the candidate order determination unit 32 rearranges the word information (candidate words) in the temporary storage unit 3 in descending order of appearance frequency, which is supplementary information (not shown), and ends the word search process. In addition, this rearrangement is a mere illustration and may be rearranged by other methods. For example, the display order may be determined based on the candidate order determination factors (element A to element G). Finally, as described above, the retrieved and rearranged words are displayed on the display unit 12.
[0252] (実施例 5— 2)  (Example 5-2)
本実施例 5— 2では、ユーザが単語を入力するために、単語の読みを入力した場合 の動作を説明する。ここで、実施例 5— 1では、入力したい文字列の先頭文字の読み を入力するのに対して、本実施例 5— 2では、入力したい単語の読みを入力するとの 違いがある。しかし、入力部 11から入力された読み文字列を制御部 10の入力受付 部 21が受け付ける処理は実施例 5—1と同様となる。そのため、本実施例ではユー ザが「外字 1」を入力するために「外字 1」の中国読みであるピンインの「meinian」を 入力した後の処理を説明する。 [0253] なお、制御部 10は、入力読みの文字列のピンインを分割する機能を備えて 、るも のとする。入力読み文字列のピンインの分割処理は後述する。 Embodiment 5-2 describes the operation when the user inputs word reading to input a word. Here, in the embodiment 5-1, while the reading of the first character of the character string to be input is input, in the present embodiment 5.2, there is a difference that the reading of the word to be input is input. However, the process in which the input receiving unit 21 of the control unit 10 receives the read character string input from the input unit 11 is the same as that of the embodiment 5-1. Therefore, in the present embodiment, the process after the user inputs “meinian” of Pinyin, which is a Chinese reading of “extra character 1” to input “extra character 1” will be described. The control unit 10 has a function of dividing pinyin of the character string of the input reading, and is assumed to be a personal computer. The pinyin division process of the input read character string will be described later.
[0254] 制御部 10は、入力部 11から読み文字列を「meinian」を受けると、読み文字列をピ ンインに分割し、分割結果として、「mei」と「nian」とでピンインに分割されたものが 2 つあることを示す分割数 (あるいは、分割位置が 2つあることを示す分割位置数) 2を 一時記憶部 3に記憶する。  [0254] When control unit 10 receives the reading character string "meinian" from input unit 11, it divides the reading character string into pin-ins, and as a division result, it is divided into pinyins by "mei" and "nian". The number of divisions indicating that there are two (or the number of division positions indicating that there are two division positions) 2 is stored in the temporary storage unit 3.
[0255] 辞書検索部 310の漢字取得部 311は、一時記憶部 3から読み文字列「mei」を読み 込み、辞書検索部 310の内部バッファに記憶する。漢字取得部 311は、「mei」の先 頭文字「m」を基に前記(2)式を用いて、漢字インデックス 145を参照して、「m」の範 囲のアドレスを求め、それらのアドレスに記述している漢字アドレスを内部バッファに feす。。  The kanji acquisition unit 311 of the dictionary search unit 310 reads the read character string “mei” from the temporary storage unit 3 and stores the read character string “mei” in the internal buffer of the dictionary search unit 310. The kanji acquisition unit 311 refers to the kanji index 145 using the above equation (2) based on the initial letter “m” of “mei” to obtain an address in the range of “m”, and those addresses The kanji address described in is fed to the internal buffer. .
[0256] ここでも四声コードがないので、先頭アドレス力 S「(0x6d— 0x61) X漢字インデック スの単位サイズ X 5」(つまり、図 3の cidx— ml)、終了アドレスが「(0x6d— 0x61 + 1) X漢字インデックスの単位サイズ X 5」(つまり、図 3の cidx— nl)によって計算され 、漢字辞書部 41の漢字インデックス 145を参照し、各アドレスが指し示す漢字ァドレ ス(cp— ml、及び cp— nl)を内部バッファに記憶する。  Since there is no four-speech code here too, the start address strength S “(0x6d—0x61) X Kanji index unit size X 5” (ie, cidx—ml in FIG. 3), the end address “(0x6d— 0x61 + 1) The unit size of X kanji index X 5 "(that is, calculated according to cidx − nl in FIG. 3), referring to kanji index 145 of kanji dictionary section 41, and kanji address indicated by each address (cp — ml , And cp-nl) are stored in the internal buffer.
[0257] 辞書検索部 310の漢字取得部 311は、検索対象のアドレス cpiを cp— mlとする。  The kanji acquisition unit 311 of the dictionary search unit 310 sets the address cpi to be searched as cp−ml.
次に漢字取得部 311は、アドレス cpiの漢字情報を読み込み、漢字対応情報 144を 参照し、入力ピンイン数を元に検索条件を判定する。その結果、検索条件を満足し ていれば、入力読みと漢字情報の読みとを比較する。  Next, the kanji acquisition unit 311 reads kanji information at the address cpi, refers to the kanji correspondence information 144, and determines a search condition based on the number of input pins. As a result, if the search condition is satisfied, the input reading is compared with the reading of the kanji information.
[0258] 入力読みが漢字情報の読みと完全一致であれば、一時記憶部 3に該漢字情報と 完全一致であること (例えば、 1)を対応付けて記憶して、 cpiを更新する。  If the input reading is a complete match with the reading of the kanji information, the temporary storage unit 3 stores that the complete matching with the kanji information (for example, 1) is associated and stored, and cpi is updated.
[0259] 入力読みが漢字情報の読みに前方一致であれば一致したものとして一時記憶部 3 に該漢字情報と前方一致であること (例えば、 0)を対応付けて記憶して、 cpiを更新 する。  [0259] If the input reading matches the reading of the kanji information, it is regarded as a match and it is stored in the temporary storage unit 3 that the kanji information and the leading match (for example, 0) are associated and stored cpi Do.
[0260] 完全一致あるいは前方一致でなければ、 cpiのみ更新する。  [0260] If it is not a perfect match or a forward match, only cpi is updated.
[0261] そして、 cpiが cp— nlと等しくなれば、候補順序決定部 32が、一時記憶部 3に記憶 している漢字情報のうち、完全一致の漢字情報を出現頻度で並べ換えを行い、さら に、前方一致の漢字情報を完全一致の漢字情報より候補順位が下位となるように出 現頻度で並べ替えを行い、漢字検索処理を終了する。候補順序決定部 32は、さら に、他の補助情報 (例えば、候補順序決定要素 (要素 A〜要素 G) )に基づいて、表 示順序を決定してもよい。 Then, if cpi becomes equal to cp−nl, candidate order determination unit 32 rearranges perfect match kanji information among the kanji information stored in temporary storage unit 3 according to the appearance frequency, and Then, the front matching kanji information is rearranged at the appearance frequency so that the candidate order is lower than the perfect matching kanji information, and the kanji search processing is ended. The candidate order determination unit 32 may further determine the display order based on other auxiliary information (eg, candidate order determination factors (element A to element G)).
[0262] 上記処理にて、読み文字列「mei」に対して、「mei」に完全一致の漢字情報として「 外字 2」の漢字情報が検索結果として一時記憶部 3に記憶されている。 In the above process, kanji information of “extra character 2” is stored as a search result in the temporary storage unit 3 as kanji information that completely matches “mei” for the read character string “mei”.
[0263] 次に単語検索処理の説明を行う。 Next, the word search process will be described.
[0264] 上記の漢字検索後に表示された漢字検索候補力 ユーザが「外字 2」を指示したも のとして説明する。なお、ユーザが「外字 2」の指示に対する候補選択の処理は、実 施例 5— 1に記載の処理と同様であるため説明を省略する。  The kanji search candidate power displayed after the above-described kanji search is described as the user specifying “extra character 2”. The process of candidate selection in response to the instruction of “extra character 2” by the user is the same as the process described in the embodiment 5-1, and therefore the description thereof is omitted.
[0265] また、 1次インデックス 161a、 2次インデックス 161b、及び単語対応情報 162は、漠 字コードの昇順に並んでいるものとする。  Further, it is assumed that the primary index 161a, the secondary index 161b, and the word correspondence information 162 are arranged in the ascending order of the vague code.
[0266] 辞書検索部 310の単語取得部 312は、単語検索の指示を受けると、単語インデッ タス 161の 1次インデックスを参照して、転送された漢字のコードに一致する第 1のィ ンデッタス情報を取得する。例えば、 1次インデックスでは、単語の先頭漢字の漢字コ ード順に並んでおり、単語取得部 312は、 2分探索により転送された漢字のコードと 一致する第 1のインデックス情報を検索して、転送された漢字コードに一致する第 1の インデックス情報と、 1次インデックス内で並べられている次の漢字コードの第 1のイン デッタス情報と、を内部バッファに記憶する。  [0266] When the word acquisition unit 312 of the dictionary search unit 310 receives a word search instruction, it refers to the primary index of the word index 161, and the first index information that matches the transferred kanji code. To get For example, in the primary index, they are arranged in the kanji code order of the leading kanji of a word, and the word acquisition unit 312 searches for first index information that matches the kanji code transferred by binary search. The first index information matching the transferred kanji code and the first index information of the next kanji code arranged in the primary index are stored in the internal buffer.
[0267] ここでは、ユーザは「外字 2」を指示しているので、 1次インデックス 161aにおいて、 漢字コードに一致する 2次インデックスアドレスは widx— mi、次の漢字コードの 2次ィ ンデッタスアドレスは widx— mi+1であり、これらが内部バッファに記憶される。  Here, since the user designates “extra character 2”, in the primary index 161a, the secondary index address that matches the kanji code is widx−mi, and the secondary index address of the next kanji code is The address is widx-mi + 1, which are stored in an internal buffer.
[0268] 辞書検索部 310の単語取得部 312は、一時記憶部 3から分割した読み文字列の 2 番目以降の文字列を内部バッファに読み込む。辞書検索部 310は、 1番目の分割し た読みで検索された漢字 (先頭漢字、 1文字目漢字)と、 2番目の分割された読みの 先頭ピンイン文字とで単語対応情報 162の検索範囲を絞るために、 1次インデックス 161aに付カ卩されている 2次インデックス 161bのアドレスを基に、 2番目の分割された 読みの先頭文字「n」と一致するものを、 2次インデックス 16 lbにて検索し、内部バッ ファに検索された 2次インデックス 161bと、 2次インデックス 161b内で並べられている 次の第 2のインデックス情報を記憶する。ここでは、 1文字目の漢字「外字 2」と 2番目 の分割読みの先頭文字「n」とに一致する 2次インデックス情報 wp— mi、および、 2次 インデックス内で並べられて 、る次の 2次インデックス情報 wp— mi+k、が内部バッ ファに記憶される。 [0268] The word acquisition unit 312 of the dictionary search unit 310 reads the second and subsequent character strings of the divided read character string from the temporary storage unit 3 into the internal buffer. The dictionary search unit 310 searches the search range of the word correspondence information 162 for the first divided kanji (first kanji, first kanji) and the first pinyin character of the second divided reading. In order to narrow down, based on the address of the secondary index 161b added to the primary index 161a, the one that matches the first character "n" of the second divided reading is added to the secondary index 16 lb. Search It stores the secondary index 161b searched for in the file and the next second index information arranged in the secondary index 161b. Here, secondary index information wp— matching the first character kanji “extra character 2” and the first character “n” of the second split reading wp—mi, and the secondary index information arranged in the second index Secondary index information wp-mi + k is stored in the internal buffer.
[0269] 辞書検索部 310の単語取得部 312は、検索対象のアドレス wpiを第 2インデックス 情報に付カ卩されて 、る単語情報のアドレス wp— miとする。「wp— mi」から「wp— mi +k」までの単語情報を読み出しここで、単語対応情報 162での単語情報の読みは、 単語の各漢字とを対応付けるため、例えば、「外字 1」の読みを「MeiNian」のように 単語の各漢字に対応する読みの先頭部分を大文字表記されているものとする。  [0269] The word acquisition unit 312 of the dictionary search unit 310 appends the address wpi to be searched to the second index information as the address wp-mi of the word information to be searched. The word information from “wp—mi” to “wp—mi + k” is read out. Here, reading of the word information in the word correspondence information 162 corresponds to each kanji of the word. It is assumed that the reading is capitalized at the beginning of the reading corresponding to each kanji of the word as "MeiNian".
[0270] 次に辞書検索部 310の単語取得部 312は、アドレス wpiの単語情報を読み込み、 単語対応情報 162を参照して、単語対応情報 162の漢字の個数と入力読みの分割 数を比較する。単語情報の漢字の個数が入力読みの分割数以下であれば、単語情 報の各漢字に対応する読みと分割読みを先頭から比較し、読みが完全一致の漢字 数と読みが前方一致の漢字数をカウントする。  Next, the word acquisition unit 312 of the dictionary search unit 310 reads the word information of the address wpi, refers to the word correspondence information 162, and compares the number of kanji of the word correspondence information 162 with the number of divisions of the input reading. . If the number of kanji in the word information is equal to or less than the number of divisions in the input reading, the reading and the divided reading corresponding to each kanji in the word information are compared from the beginning, and the number of kanji with exactly matching reading and the leading kanji reading Count the number.
[0271] 「単語情報の漢字の個数」と「読みが完全一致の漢字数 +読みが前方一致の漢字 数」がー致すれば、一時記憶部 3に単語情報と読みが完全一致の漢字数と読みが前 方一致の漢字数を記憶し、 wpiを更新する。「単語情報の漢字の個数」と「読みが完 全一致の漢字数 +読みが前方一致の漢字数」がー致しなければ、 wpiを更新する。  If “number of kanji in word information” and “number of kanji with exactly matching reading + number of kanji with leading matching” are matched, temporary memory unit 3 has the number of kanji with perfect matching of word information and reading. The word and the word memorize the number of kanji in the forward match, and wpi is updated. If “Number of Kanji in Word Information” and “Number of Kanji with Perfect Match of Reading + Number of Kanji with Forward Match” do not match, update wpi.
[0272] そして、 wpiく wp— mi +kであれば処理を継続し、 wpi=wp— mi+kとなれば、処 理を終了する。  Then, if wpi で あ れ ば wp-mi + k, the process is continued, and if wpi = wp-mi + k, the process is ended.
[0273] 辞書検索部 310は、一時記憶部 3に記憶している単語情報において、以下の条件 で順位付けを行う。  The dictionary search unit 310 ranks the word information stored in the temporary storage unit 3 under the following conditions.
(i)読みの分割数と「読みが完全一致の漢字数 +読みが前方一致の漢字数」とが等 しい単語情報  (i) Word information in which the number of divisions of reading is equal to the number of kanji with exactly matching reading + the number of kanji with leading reading.
•「読みが完全一致の漢字数」が多 、ほど上位候補とする  • The number of "perfectly matched kanji" is the top candidate
•「読みが完全一致の漢字数」が同じものは、出現頻度で順位付ける  • If the number of kanji with exactly the same reading is the same, rank by frequency of occurrence
(ii)読みの分割数と「読みが完全一致の漢字数 +読みが前方一致の漢字数」とが 等しくない単語情報 (ii) The number of divisions of reading and "the number of kanji with exactly matching reading + the number of kanji with leading reading match" Unequal word information
•「読みが完全一致の漢字数」が多 、ほど上位候補とする  • The number of "perfectly matched kanji" is the top candidate
•「読みが完全一致の漢字数」が同じものは、出現頻度で順位付ける  • If the number of kanji with exactly the same reading is the same, rank by frequency of occurrence
最終的にユーザが表示された検索結果から所望の文字列を選択することで、入力 が行われるが、上記 (i)の条件に対応する検索結果を選択すると、新たに読み文字 列の入力を行うことになるが、(ii)の条件に対応する検索結果を選択した場合は、入 力した読み文字列の一部に対応するため、選択した文字列に対応する読み文字列 を除く読み文字列を制御部 10の入力読みとして上記処理を繰り返すことができる。  When the user finally selects a desired character string from the displayed search results, input is performed, but when a search result corresponding to the condition of (i) is selected, a new reading character string is input. If you select the search result that corresponds to the condition of (ii), it corresponds to a part of the input reading character string, so the reading character except the reading character string that corresponds to the selected character string The above process can be repeated with the column as the input reading of the control unit 10.
[0274] 上記説明では、単語検索で単語情報の漢字の個数と「読みが完全一致の漢字数  [0274] In the above description, the number of kanji in the word information and the number of kanji in which the reading is a perfect match in the word search
+読みが前方一致の漢字数」がー致したときのみ単語検索結果としている。しかし、 単語情報の漢字の個数が「読みが完全一致の漢字数 +読みが前方一致の漢字数」 より多い場合も単語検索結果としてもよい。このような構成にすることで、所望の単語 の先頭力 数文字分の読みだけで入力することが可能となる。  + Only when the number of kanji characters in "+" matches, the word search result is used. However, if the number of kanji in the word information is more than “the number of completely matching kanji + the number of leading kanji reading”, the word search result may be used. With this configuration, it is possible to input only by reading the first few characters of the desired word.
[0275] また、上記説明では、表示された漢字検索結果から所望の漢字をユーザが選択す ることで単語検索を開始している。しかし、漢字検索が終了した後で、漢字検索結果 を表示せずに候補順位 1位の漢字について単語検索を開始し、単語検索が完了し たときに検索結果を表示してもよい。あるいは、漢字検索が終了した後で、漢字検索 結果を表示せずに候補の一部もしくは全部について単語検索を開始し、単語検索が 完了したときに検索結果を表示してもよ 、。  [0275] Further, in the above description, the word search is started by the user selecting a desired kanji from the displayed kanji search results. However, after the kanji search is completed, the word search may be started for the first ranked kanji candidate without displaying the kanji search result, and the search result may be displayed when the word search is completed. Alternatively, after the kanji search is completed, the word search may be started for some or all of the candidates without displaying the kanji search results, and the search results may be displayed when the word search is completed.
[0276] 次に、入力読みのピンイン (n文字)の分割処理について説明する。ここで分割の単 位を音節と呼ぶ。  [0276] Next, division processing of pinyin (n characters) of input reading will be described. Here, the unit of division is called syllable.
[0277] 分割処理は、以下のように行う。  Division processing is performed as follows.
(a)分割位置 iを 1にする  (a) Set the division position i to 1
(b)先頭文字力も i文字目を第 1音節とし、 i+ 1文字目力も韻母、声母、声母 +韻母と して成立する最長のピンインを求め、第 2音節とする  (b) The first character strength also sets the i-th character as the first syllable, and the i + 1st character also finds the longest pinyin that holds as a rhyme, a vocal mother and a vocal mother + a rhyme, and makes it a second syllable
(c) i文字目の分割位置の評価値 (以下、位置評価値と称する)として、第 1音節長 + 第 2音節長を保持する  (c) Hold the first syllable length + the second syllable length as an evaluation value of the i-th character division position (hereinafter referred to as a position evaluation value)
(d)分割位置 iに 1を加算し (i=i+ l)、 i>nであれば (e)、そうでなければ (b)へ処理 を移す (d) Add 1 to the division position i (i = i + l), and if i> n (e), otherwise process to (b) Transfer
(e)最大位置評価値を有するものとして処理された分割位置の個数により、最終的な 分割位置を確定する  (e) Determine the final division position by the number of division positions processed as having the maximum position evaluation value
(e— 1)最大位置評価値を有するものとして処理された分割位置の個数が 1の場合 最大位置評価値を有するものとして処理された分割位置を最終的な分割位置と する  (e-1) When the number of division positions processed as having the maximum position evaluation value is 1 The division position processed as having the maximum position evaluation value is set as the final division position
(e— 2)最大位置評価値を有するものとして処理された分割位置が複数個ある場合 (e-2) When there are a plurality of division positions processed as having the maximum position evaluation value
(1) 1音節だけで構成されている (1) It consists of only one syllable
(2)第 1音節、第 2音節とも 2文字以上  (2) 1st syllable, 2nd syllable both 2 letters or more
(3)第 1音節の文字数が第 2音節よりも多い  (3) The first syllable has more characters than the second syllable
の順に、上位の条件を有する分割位置を最終的な分割位置とする 具体例として、「meinian」の分割処理について説明する。入力ピンインの一部或い は全部がピンインとして成立する力否かは、ピンイン辞書に登録されて 、るピンインと 比較し、一致するピンインがあればピンインとして成立する可能性があるとする。ピン イン辞書は、図 9に示すように、ピンインとして成立する可能性のある長さ 1から 7のピ ンインが登録されている。  As a specific example, division processing of “meinian” will be described as a final example of division positions having upper conditions in the order of Whether a force that a part or all of the input pinyin is established as pinyin is registered in the pinyin dictionary and compared with the corresponding pinyin, it is assumed that if there is a matching pinyin, it may be established as pinyin. In the pin-in dictionary, as shown in FIG. 9, pin-ins of lengths 1 to 7 which can be established as pin-ins are registered.
[0278] ここで、入力ピンインとは、ピンインとして入力された文字列であり、 [0278] Here, input pinyin is a string input as pinyin,
(1)ピンインとして登録されて 、る文字あるいは文字列そのもの  (1) A character or string itself registered as Pinyin
(2)ピンインとして登録されて 、る文字あるいは文字列が組み合わされた文字列 (2) A character string or a combination of characters registered as Pinyin
(3) (1) (2)以外の文字列(間違い入力の場合) (3) (1) String other than (2) (in case of mistake input)
の 3種類がある。  There are three types of
[0279] なお、本実施形態 5において、分割処理時に算出される位置評価値は、入力ピンィ ンを 2音節に区切るための区切り位置を決定する場合に、その位置で音節が区切ら れる尤もらしさを数値ィ匕したものである。この位置評価値は、音節区切り位置ごとに算 出される。したがって、上述の実施形態 1〜4で用いられた、候補漢字 Z候補単語ご とに算出される評価値 (候補漢字 Z候補単語がユーザに選択される可能性の大きさ を示す評価値)とは区別される。  In the fifth embodiment, the position evaluation value calculated at the time of division processing determines the likelihood that the syllable is divided at that position when the division position for dividing the input pin into two syllables is determined. It is a numerical value. This position evaluation value is calculated for each syllable break position. Therefore, the evaluation value calculated for each candidate kanji Z candidate word (the evaluation value indicating the magnitude of the possibility that the candidate kanji Z candidate word is selected by the user) used in the above-described first to fourth embodiments Are distinguished.
[0280] 入力ピンイン「meinian」の分割位置 iを 1とする。先頭文字「m」を用いて、図 8で示 すピンイン辞書インデックス (ピンイン辞書インデックステーブル)を参照し、先頭文字 が「m」で 1文字のピンイン辞書アドレス pr— ml及び 2文字のピンイン辞書アドレス pr — m2を読み込む。そして、ピンイン辞書の pr— mlから pr— m2までに記憶されてい るピンインと比較する。この場合、「m」は存在するので、ピンインとして成立する。 [0280] The division position i of the input pinyin "meinian" is set to 1. Shown in Figure 8 with the first letter 'm' The pinyin dictionary index (pinyin dictionary index table) is referenced, and the first character is “m” and the 1-character pinyin dictionary address pr—ml and the 2 character pinyin dictionary address pr — read m2. Then, it compares with pinyin stored in pr−ml to pr−m2 of the Pinyin dictionary. In this case, since "m" exists, it is established as pinyin.
[0281] 次に、「m」を除く「einian」を第 2音節とし、先頭文字が「e」で 6文字のピンイン辞書 アドレス pr— e6及び 7文字のピンイン辞書アドレス pr— e7を読み込み、 pr— e6から p r—e7までに記憶されているピンインと比較する。一致するピンインがあれば、 1文字 目の位置評価値が 7となるが、一致するピンインが無い場合は、第 2音節の最終文字 から 1文字減らして新しい第 2音節とし、同様にピンイン辞書と比較し、一致するピン インがあれば、そのときの第 2音節の長さ +第 1音節の長さ(ここでは 1)を分割位置 1 の位置評価値とする。一致するピンインが無い場合には、さらに、第 2音節の最終文 字から 1文字減らして、ピンイン辞書と比較するという処理を行い、これを、位置評価 値が出るまで繰り返す。 [0281] Next, pinyin dictionary address pr-e6 with 7 characters and "z" as the first character is "e" with "einian" excluding "m" as the second character and "z" in 7 characters is read, pr — Compare with pinyins stored in e6 to pr-e7. If there is a matching pinyin, the position evaluation value of the first character is 7, but if there is no matching pinyin, one character is subtracted from the last character of the second syllable to create a new second syllable, and similarly, If there is a matching pin-in, compare the length of the second syllable + the length of the first syllable (1 in this case) as the position evaluation value of division position 1. If there is no matching pinyin, one more character is subtracted from the last character of the second syllable, and the process is compared with the Pinyin dictionary, and this is repeated until the position evaluation value is obtained.
[0282] 上記処理が終わると、入力ピンイン「meinian」の分割位置 iを i=i+ l ( = 2)とする。  After the above processing is completed, the division position i of the input pinyin “meinian” is set to i = i + l (= 2).
そこで、先頭文字から 2文字「me」を元にピンイン辞書インデックステーブルから「m」 が先頭の 2文字のピンイン辞書アドレス pr—m2から pr—m3までに記憶されているピ ンインと比較する。「11½」が存在すれば、第 1音節を「me」とする。次に「me」を除く「i nian」を第 2音節とし、同様にピンイン辞書に一致するピンインがあるかを検索する。 先頭から切り出し位置 7までの処理を行 ヽ、最大位置評価値の分割位置の評価を行 う。図 10では、最大位置評価値が 7であり、最大位置評価値が 7となる分割位置の個 数が 1なので、最終分割位置を 3と確定する。なお、図 10では、分割位置が 3のもの についての、「分割位置数 2」とは、「mainian」は、第 3文字 iと第 4文字 nの間に 1つ の目の分割位置があり、第 7文字 nとその後(文字はなし)の間に 2つ目の分割位置が あるとここでは判断されているので、よって、分割位置が 2つあるいうことを示している  Therefore, the pin-in dictionary index table based on the first two characters "me" is compared with the pin-ins stored in the first two characters of the pin-in dictionary address pr-m2 to pr-m3. If “111⁄2” is present, let the first syllable be “me”. Next, with the exception of "me", "inian" is taken as the second syllable, and it is similarly searched for whether there is a pinyin that matches the pinyin dictionary. The processing from the top to cutout position 7 is performed, and the division position of the maximum evaluation value is evaluated. In FIG. 10, since the maximum position evaluation value is 7 and the number of division positions for which the maximum position evaluation value is 7 is 1, the final division position is determined to be 3. In FIG. 10, “division number 2” for the division position 3 means that “mainian” has one eye division position between the third character i and the fourth character n. Since it is determined here that there is a second division position between the seventh character n and the subsequent character n (there is no character), it indicates that there are two division positions.
[0283] 次に、分割位置までのピンインを除く「nian」について、上記と同様の処理を繰り返 し、入力ピンインの分割位置を求める。「nian」では、図 11に示すように、分割位置が 求まり、位置評価値 4の分割位置は 2個あるので、 1音節で構成されている「nian」が 選択され、分割位置 4が求まる。 Next, the processing similar to the above is repeated for “nian” except for pinyin up to the dividing position, and the dividing position of the input pinyin is obtained. In “nian”, as shown in FIG. 11, the division position is determined, and since there are two division positions of position evaluation value 4, “nian” made up of one syllable is It is selected and division position 4 is found.
[0284] 分割位置までのピンインを除くと、残りのピンインが無!、ので、入力ピンインの先頭 位置からの位置に補正 (分割位置 3、 7)し、処理を終了する。  [0284] Except for pinyin up to the division position, there is no remaining pinyin, so correction is made to the position from the top position of the input pinyin (division position 3, 7), and the processing is ended.
[0285] なお、以上では、ピンインに対する分割処理について説明した力 注音についても 同様に注音テーブルを構成することで、上記処理にて分割を行うことができる。ピンィ ン辞書のインデックステーブルは、ピンインの最大長が 7であるため、各ピンインの先 頭文字について、長さ 1から 7までの 7個単位で構成している。他方で、注音の最大 長は、 4であるため、各注音の先頭文字について、長さ 1から 4までの 4個単位で構成 すればよい。  [0285] In the above, with regard to the power injection sound described for the division processing for pinyin, division can be performed in the above processing by configuring the sound injection table in the same manner. Since the Pinyin index table has a maximum Pinyin length of 7, each Pinyin initial character is composed of seven pieces from 1 to 7 in length. On the other hand, since the maximum length of the note sound is 4, the first character of each note sound may be composed of four pieces of 1 to 4 in length.
[0286] 以上で説明したように本願発明では、同一先頭漢字の単語をグループィ匕し、漢字 辞書及び単語辞書に入力ピンインに応じて検索対象を制限するための情報を持つ ている。具体的には、漢字辞書は、ピンインの長さに応じた検索対象か否かの情報を 持っている。単語辞書は、インデックス情報に 2文字目漢字の読みの先頭文字を持つ ており、ピンイン列 (漢字 n文字分)を分割し、漢字検索された 1文字目漢字と 2文字 目の先頭読み力も単語検索対象を制限できる。このような構成であるため、検索条件 に一致するものを検索するので、語彙数を増加しても、入力文字列と読みの情報の 比較を行う回数が制限され、応答速度の速い文字列入力装置を提供することができ る。読みの入力が入力したい漢字の 2文字目以降の読みを入力すると、さらに単語の 検索対象を絞ることができるので、検索処理の速!、文字変換装置を提供することが できる。  As described above, in the present invention, the words of the same leading kanji are grouped, and the kanji dictionary and the word dictionary have information for restricting the search target according to the input pinyin. Specifically, the kanji dictionary has information as to whether or not it is a search target according to the pinyin length. The word dictionary has the first character of the second character kanji in the index information, divides the Pinyin sequence (n characters of kanji), and the first character of the first character kanji searched for kanji and the second character also have word reading power You can limit the search target. With such a configuration, search is made for items that match the search conditions. Therefore, even if the number of vocabulary is increased, the number of comparisons between the input character string and the reading information is limited, and character string input with high response speed is performed. Equipment can be provided. By inputting the second and subsequent readings of the kanji you want to input, you can further narrow down the search target of the word, so it is possible to provide a search processing speed and a character conversion device.
[0287] さらに、上述の検索処理によって求まった漢字 (単語)の候補は、候補ごとに対応付 けられている補助情報 (あるいは、候補順序決定要素)に基づいて、ユーザによって 選択される可能性が高いものほど優先して表示されるような順序に並べ替えて表示 するので、候補を表示するまでの応答時間を短縮できる上に、ユーザが所望する順 序で候補を表示することができるので、ユーザの利便性をさらに向上させることが可 能となる。  Furthermore, the kanji (word) candidates obtained by the above-described search processing may be selected by the user based on the auxiliary information (or the candidate order determining element) associated with each candidate. Since the higher the priority is displayed in order of priority, the response time until the candidate is displayed can be shortened, and in addition, the candidate can be displayed in the order desired by the user. It is possible to further improve the convenience of the user.
[0288] 最後に、文字変換装置 1の各ブロック、特に候補順序決定部 32、辞書検索部 310 、表示制御部 22、および制御部 10は、ハードウェアロジックによって構成してもよい し、次のように CPUを用いてソフトウェアによって実現してもよい。具体的には、記録 部 40、漢字辞書部 41、単語辞書部 42、一時記憶部 3はメモリ、ハードディスクなどの 記憶装置によって実現され、候補順序決定部 32、辞書検索部 310、表示制御部 22 、制御部 10はコンピュータの中央演算装置 (CPU)及び CPUに実行されるソフトゥェ ァにより実現される。このソフトウエアも記憶装置に格納され、メモリに読み出されて C PUにより実行される。 Lastly, each block of the character conversion device 1, in particular, the candidate order determination unit 32, the dictionary search unit 310, the display control unit 22, and the control unit 10 may be configured by hardware logic. And may be realized by software using a CPU as follows. Specifically, the recording unit 40, the kanji dictionary unit 41, the word dictionary unit 42, and the temporary storage unit 3 are realized by a storage device such as a memory or a hard disk, and the candidate order determination unit 32, the dictionary search unit 310, the display control unit 22. The control unit 10 is realized by a central processing unit (CPU) of a computer and software executed by the CPU. This software is also stored in the storage device, read out to the memory, and executed by the CPU.
[0289] つまり、文字変換装置 1は、各機能を実現する制御プログラムの命令を実行する C PU (central processing unit)、上 dプログフムを格糸内した ROM (read only memory)、 上記プログラムを展開する RAM (random access memory) ,上記プログラムおよび各 種データを格納するメモリ等の記憶装置 (記録媒体)などを備えている。そして、本発 明の目的は、上述した機能を実現するソフトウェアである文字変換装置 1の制御プロ グラムのプログラムコード(実行形式プログラム、中間コードプログラム、ソースプロダラ ム)をコンピュータで読み取り可能に記録した記録媒体を、上記文字変換装置 1に供 給し、そのコンピュータ (または CPUや MPU)が記録媒体に記録されているプロダラ ムコードを読み出し実行することによつても、達成可能である。  That is, the character conversion device 1 executes a CPU (central processing unit) that executes instructions of a control program that realizes each function, a ROM (read only memory) with an upper d program frame, and expands the above program A storage device (recording medium) such as a random access memory (RAM), a memory for storing the above programs and various data, and the like are provided. Then, the object of the present invention is to record the program code (the executable program, the intermediate code program, the source program) of the control program of the character conversion device 1 which is software that realizes the functions described above in a computer readable manner. It is also possible to supply the above recording medium to the character conversion device 1 and the computer (or CPU or MPU) reads out and executes the program code recorded on the recording medium.
[0290] 上記記録媒体としては、例えば、磁気テープやカセットテープ等のテープ系、フロッ ピー(登録商標)ディスク Zハードディスク等の磁気ディスクや CD— ROMZMOZ MD/DVD/CD—R等の光ディスクを含むディスク系、 ICカード (メモリカードを含 む) Z光カード等のカード系、あるいはマスク ROMZEPROMZEEPROMZフラッ シュ ROM等の半導体メモリ系などを用いることができる。  Examples of the recording medium include tape-based media such as a magnetic tape and a cassette tape, magnetic disks such as a floppy disk Z hard disk, and optical disks such as a CD-ROM ZMOZ MD / DVD / CD-R. A disk system, an IC card (including a memory card), a card system such as a Z optical card, or a semiconductor memory system such as a mask ROMZEPROMZEEPROM Z flash ROM can be used.
[0291] また、文字変換装置 1を通信ネットワークと接続可能に構成し、上記プログラムコー ドを通信ネットワークを介して供給してもよい。この通信ネットワークとしては、特に限 定されず、例えば、インターネット、イントラネット、エキストラネット、 LAN, ISDN, V AN、 CATV通信網、仮想専用網(virtual private network)、電話回線網、移動体通 信網、衛星通信網等が利用可能である。また、通信ネットワークを構成する伝送媒体 としては、特に限定されず、例えば、 IEEE1394、 USB、電力線搬送、ケーブル TV 回線、電話線、 ADSL回線等の有線でも、 IrDAやリモコンのような赤外線、 Bluetoo th (登録商標)、 802. 11無線、 HDR、携帯電話網、衛星回線、地上波デジタル網 等の無線でも利用可能である。なお、本発明は、上記プログラムコードが電子的な伝 送で具現化された、搬送波に埋め込まれたコンピュータデータ信号の形態でも実現 され得る。 In addition, the character conversion device 1 may be configured to be connectable to a communication network, and the program code may be supplied via the communication network. The communication network is not particularly limited. For example, the Internet, intranet, extranet, LAN, ISDN, VAN, CATV communication network, virtual private network, telephone network, mobile communication network And satellite communication networks are available. The transmission medium constituting the communication network is not particularly limited. For example, IEEE1394, USB, power line carrier, cable TV line, telephone line, wired line such as ADSL line, IrDA, infrared ray such as remote control, Bluetooth (Registered trademark), 802.11 radio, HDR, mobile phone network, satellite link, terrestrial digital network Etc. can also be used. The present invention can also be realized in the form of a computer data signal embedded in a carrier wave, in which the program code is embodied by electronic transmission.
[0292] 以上のように、本発明に係る文字変換装置は、入力された 1以上の表音文字よりな る入力文字列を変換文字に変換し、当該変換文字力 なる単語を決定できる文字変 換装置において、  As described above, the character conversion device according to the present invention converts an input character string consisting of one or more input phonetic characters into a converted character, and can change the character character of the converted character. In the replacement
(a- 1)変換文字を識別するための変換文字識別情報と、該変換文字の表音を示 す変換文字表音情報とが対応付けられた変換文字対応情報および、(a— 2)入力さ れる表音文字と、該表音文字を上記変換文字表音情報の先頭として持つ上記変換 文字識別情報の、上記変換文字対応情報での格納先を示す変換文字格納先情報 とが対応付けられた変換文字インデックス、を有する変換文字辞書と、  (a-1) converted character correspondence information in which converted character identification information for identifying converted characters and converted character phonetic information indicating phonetic characters of the converted characters are associated; (a-2) input And the converted character storage destination information indicating the storage destination in the converted character correspondence information of the converted character identification information having the phonetic character as the head of the converted character Converted character dictionary, which has converted character index,
(b— 1)単語を識別するための単語識別情報と、該単語の表音を示す単語表音情 報とが対応付けられた単語対応情報および、(b— 2)単語の先頭となる先頭変換文 字を識別するための先頭変換文字識別情報と、該先頭変換文字識別情報が示す先 頭変換文字を先頭に持つ単語の、上記単語対応情報での格納先を示す単語格納 先情報とが対応付けられた単語インデックス、を有する単語辞書と、  (b-1) word correspondence information in which word identification information for identifying a word is associated with word phonetic information indicating the phonetic sound of the word; and (b-2) a head at the beginning of the word The head conversion character identification information for identifying a conversion character, and the word storage destination information indicating the storage destination in the word correspondence information of the word having the head conversion character indicated by the head conversion character identification information A word dictionary having associated word indexes,
(c)入力された上記入力文字列の先頭の表音文字を基に、上記変換文字辞書を 参照し、該入力文字列の先頭の表音文字を上記変換文字表音情報の先頭として持 つ変換文字識別情報を取得する先頭変換文字取得手段と、  (c) referring to the converted character dictionary based on the first phonetic character of the input character string inputted, and holding the first phonetic character of the input character string as the first character of the converted character phonetic information First converted character acquisition means for acquiring converted character identification information;
(d)取得された上記先頭変換文字識別情報を基に、上記単語辞書を参照し、該先 頭変換文字識別情報が示す変換文字を先頭に持つ単語を取得する単語取得手段 と、  (d) A word acquisition unit for acquiring the word having the conversion character indicated by the head conversion character identification information at the beginning by referring to the word dictionary based on the acquired head conversion character identification information.
を備えている。  Is equipped.
[0293] また、本発明に係る文字変換装置は、上記構成に加え、上記先頭変換文字取得手 段が取得した先頭変換文字識別情報が示す先頭変換文字を、ユーザが選択可能に 表示部に表示する表示制御手段と、上記ユーザにより選択された先頭変換文字を受 け付ける受付手段と、を備え、上記単語取得手段は、上記選択された先頭変換文字 の先頭変換文字識別情報に基づ ヽて、単語を取得する処理を行ってもょ ヽ。 [0294] 上記構成によると、ユーザが選択した先頭変換文字に基づいて、つまり、先頭変換 文字を絞って、単語を取得することができる。 In addition to the above configuration, the character conversion device according to the present invention displays the first conversion character indicated by the first conversion character identification information acquired by the first conversion character acquisition unit on the display unit in a user-selectable manner. Display control means for receiving the first conversion character selected by the user, and the word acquiring means is based on the first conversion character identification information of the selected first conversion character. Do you want to process to get the word. [0294] According to the above configuration, it is possible to acquire a word based on the head conversion character selected by the user, that is, by narrowing the head conversion character.
[0295] また、本発明に係る文字変換装置は、上記構成に加え、上記変換文字対応情報で は、さらに、上記変換文字識別情報に、該変換文字識別情報が示す変換文字につ いての表示補助情報が対応付けられており、上記表示制御手段は、上記表示補助 情報に基づき、先頭変換文字を並べ替えて、ユーザが選択可能に表示部に表示し てもよい。  Further, in the character conversion device according to the present invention, in addition to the above configuration, in the conversion character correspondence information, the conversion character identification information further includes a display for the conversion character indicated by the conversion character identification information. The auxiliary information may be associated, and the display control means may rearrange the head conversion characters based on the display auxiliary information and display the selected character on the display unit by the user.
[0296] 上記構成によると、表示補助情報に基づき、先頭変換文字を並べ替えて、ユーザ が選択可能に表示部に表示することができる。表示補助情報とは、例えば、その漠 字の出現頻度の情報であってもよい。表示補助情報はこれ以外の情報であってもよ ぐ表示制御手段が、先頭変換文字を並べ替えることができる情報であればよい。  According to the above configuration, the head conversion characters can be rearranged based on the display auxiliary information, and can be displayed on the display unit in a selectable manner by the user. The display auxiliary information may be, for example, information on the appearance frequency of the vague characters. The display auxiliary information may be any other information as long as the display control means can rearrange the head conversion characters.
[0297] また、本発明に係る文字変換装置は、上記構成に加え、上記単語辞書の単語イン デッタスは、単語の先頭となる先頭変換文字毎に、上記先頭変換文字識別情報と、 単語の 2文字目の先頭の表音文字である 2文字目先頭表音文字との組み合わせに 対して、上記単語対応情報における格納先が割り振られていてもよい。  Further, in the character conversion device according to the present invention, in addition to the above configuration, the word index of the word dictionary includes the head conversion character identification information for each head conversion character to be the head of the word; The storage destination in the word correspondence information may be allocated to the combination with the second phonetic alphabet, which is the first phonetic alphabet of the letter.
[0298] 上記構成によると、単語取得手段は、先頭変換文字識別情報と、単語の 2文字目 の先頭の表音文字である 2文字目先頭表音文字との組み合わせから、単語を検索 することができる。よって、検索範囲をより絞ることができ、より素早い応答が可能とな る。  [0298] According to the above configuration, the word acquiring unit searches for the word from the combination of the head converted character identification information and the second phonetic initial phonetic letter which is the first phonetic alphabet of the second word of the word. Can. Therefore, the search range can be narrowed further and quicker response is possible.
[0299] また、本発明に係る文字変換装置は、上記構成に加え、上記変換文字対応情報で は、さらに、上記変換文字識別情報に、該変換文字識別情報が示す変換文字につ いての取得補助情報が対応付けられており、上記先頭変換文字取得手段は、上記 取得補助情報を基に上記変換文字識別情報を取得してもよい。  Further, in the character conversion device according to the present invention, in addition to the above configuration, in the converted character correspondence information, the converted character identification information is further acquired for the converted character indicated by the converted character identification information. Auxiliary information may be associated, and the head converted character acquisition unit may acquire the converted character identification information based on the acquired auxiliary information.
[0300] 上記構成によると、先頭変換文字取得手段は、変換文字識別情報が示す変換文 字についての取得補助情報を基に、変換文字識別情報を取得することができる。取 得補助情報とは、先頭変換文字取得手段の変換文字辞書を用いた検索を補助する 情報であればよぐ例えば先頭変換文字の表音文字の長さ等である。  [0300] According to the above configuration, the lead conversion character acquisition unit can acquire the conversion character identification information based on the acquisition auxiliary information on the conversion character indicated by the conversion character identification information. The acquisition auxiliary information is, for example, information such as the length of a phonetic character of the head conversion character if it is information that assists the search using the conversion character dictionary of the head conversion character acquisition means.
[0301] また、本発明に係る文字変換装置は、上記構成に加え、上記取得補助情報は、以 下の 4つの情報に分類されて 、てもよ 、。 Further, in the character conversion device according to the present invention, in addition to the above configuration, the acquisition auxiliary information is It is classified into four information below.
(1)変換文字識別情報が示す変換文字が、上記入力文字列に対して、無条件に取 得対象となるという情報。  (1) Information that the converted character indicated by the converted character identification information is unconditionally obtained with respect to the input character string.
(2)変換文字識別情報が示す変換文字が、上記入力文字列が 1文字では、取得対 象外になるという情報。  (2) Information that the converted character indicated by the converted character identification information is excluded from acquisition if the input character string is one character.
(3)変換文字識別情報が示す変換文字が、上記入力文字列が 2文字以下では、取 得対象外になるという情報。  (3) Information that the converted character indicated by the converted character identification information is not obtained when the input character string is two characters or less.
(4)変換文字識別情報が示す変換文字が、上記入力文字列が 3文字以下では、取 得対象外になるとういう情報。  (4) Information that the converted character indicated by the converted character identification information is not obtained when the input character string is three characters or less.
[0302] 上記構成によると、先頭変換文字取得手段が、すばやぐ効果的に先頭変換文字 を取得することができる。  [0302] According to the above configuration, the head conversion character acquisition means can acquire the head conversion character quickly and effectively.
[0303] また、本発明に係る文字変換装置は、上記構成に加え、上記単語対応情報では、 単語の先頭の変換文字が同一のものが 1グループとしてまとめられていてもよい。 Further, in the character conversion device according to the present invention, in addition to the above configuration, in the word correspondence information, those having the same conversion character at the beginning of a word may be grouped as one group.
[0304] 上記構成によると、単語取得手段は、グループとしてまとめられた単語について検 索できるので、すばやぐ効果的に単語を取得することができる。 [0304] According to the above configuration, the word acquisition means can search for words grouped as a group, and therefore can acquire words quickly and effectively.
[0305] さらに、各上記グループ内で、単語は、単語の先頭の表音文字の所定の配列順に 並べられていてもよい。 [0305] Furthermore, in each of the above-mentioned groups, the words may be arranged in a predetermined arrangement order of the first phonetic characters of the words.
[0306] 上記構成によると、単語取得手段は、単語の先頭の表音文字の所定の配列順に並 ベられてグループ内で単語を検索できるので、よりすばやぐ単語を取得することが できる。ここで、単語の先頭の表音文字の所定の配列順とは、例えば、単語の読みが ピンインであればアルファベット順、ひらがなであれば 50音順、等である。  [0306] According to the above configuration, the word acquiring means can arrange words in the group in the order of the predetermined arrangement of the phonetic characters at the head of the words, and can acquire more rapid words. Here, the predetermined arrangement order of the phonetic characters at the beginning of the word is, for example, alphabetical order if the word is pinyin, 50-sentence if hiragana, and the like.
[0307] また、本発明に係る文字変換装置は、上記構成に加え、上記変換文字は漢字であ り、上記変換文字対応情報では、漢字の先頭の表音文字が同一、かつ、声調コード が同一の、上記変換文字識別情報である漢字識別情報がグループとしてまとめられ ていてもよい。  Further, in the character conversion device according to the present invention, in addition to the above configuration, the converted character is a kanji character, and in the converted character correspondence information, the phonetic character at the beginning of the kanji character is the same and the tone code is The same kanji identification information, which is the above-mentioned converted character identification information, may be grouped as a group.
[0308] 上記構成によると、先頭変換文字取得手段は、グループ内で求めた!/、先頭変換文 字につ!、て検索すればよ!、ので、応答速度が速くなる。  [0308] According to the above configuration, the start conversion character acquisition means increases the response speed because! /, It is necessary to search for the start conversion character! / Obtained in the group.
[0309] また、中国語変換装置として用いられる上記文字変換装置の、上記変換文字対応 情報では、各上記グループで、上記漢字識別情報が、漢字の先頭の表音文字の所 定の配列順に並べてられて 、てもよ 、。 [0309] Further, the above-mentioned converted character correspondence of the above-mentioned character conversion device used as a Chinese-language conversion device In the information, in each of the above groups, the kanji identification information is arranged in a predetermined arrangement order of the top phonetic characters of the kanji.
[0310] 漢字の先頭の表音文字の所定の配列順に並べてあるので、先頭変換文字取得手 段は、より素早く先頭変換文字を取得することができる。  [0310] Since the leading phonetic characters of the kanji are arranged in a predetermined arrangement order, the leading converted character acquisition unit can acquire the leading converted character more quickly.
[0311] また、中国語変換装置として用いられる上記いずれかの文字変換装置の、前記変 換文字インデックスでは、上記漢字の先頭の表音文字が所定の配列順となっており 、各上記グループの上記変換文字対応情報での格納先を示す情報を上記変換文 字格納情報として、上記漢字の先頭の表音文字に対応付けられて!/ヽてもよ ヽ。  Further, in any one of the character conversion devices used as a Chinese language conversion device, in the conversion character index, the first phonetic characters of the kanji are arranged in a predetermined arrangement order, and Information indicating the storage destination in the converted character correspondence information may be associated with the first phonetic character of the kanji as the converted character storage information, and may be! / !.
[0312] 上記構成によると、変換文字インデックスでは、漢字の先頭の表音文字が所定の配 列順となっており、各上記グループの変換文字対応情報での格納先を示す情報を 上記変換文字格納情報として、漢字の先頭の表音文字に対応付けられている。よつ て、先頭変換文字取得手段は、グループの変換文字対応情報での格納先から、グ ループを絞って、より応答速度を速めて、検索することができる。  According to the above configuration, in the converted character index, the first phonetic characters of the kanji are arranged in a predetermined arrangement order, and the information indicating the storage destination in the converted character correspondence information of each of the above groups is the converted character As the stored information, it is associated with the first phonetic character of the kanji. Therefore, the start conversion character acquisition means can narrow down the group from the storage destination in the conversion character correspondence information of the group, and can make a search with a faster response speed.
[0313] ここで、中国語変換装置として用いられる上記何れかの文字変換装置では、上記 漢字は簡体字であり、かつ、上記表音はピンインで表現されていてもよい。  Here, in any of the above character conversion devices used as a Chinese language conversion device, the kanji characters may be simplified characters and the phonetic sounds may be expressed by pinyin.
[0314] 上記構成によると、ユーザが 1文字以上のピンイン列を入力すると、ピンイン列の先 頭文字から先頭漢字の候補を絞り込み、先頭漢字の候補を基に検索対象の単語を 制限できるので、辞書検索処理に力かる時間を短縮できる。それゆえ、応答速度の 速い中国語入力装置を提供できる。また、入力したい文字列の先頭文字のピンイン だけを入力しても連続して 2文字目以降のピンインを入力しても所望の文字列を入力 することができる文字変換装置を提供できる。また一方では、入力したい文字列の先 頭文字のピンインが不完全であり、 2文字目以降のピンインを入力しても所望の文字 列を入力することができる中国語入力装置を提供できる  According to the above configuration, when the user inputs a pinyin string of one or more characters, it is possible to narrow down the leading kanji candidates from the first letter of the pinyin string and restrict the search target word based on the leading kanji candidate. The time spent on the dictionary search process can be shortened. Therefore, it is possible to provide a Chinese input device with high response speed. In addition, it is possible to provide a character conversion device capable of inputting a desired character string even if pinyin of the first character of the character string to be input is input and pinyin of the second and subsequent characters is continuously input. On the other hand, it is possible to provide a Chinese input device in which the pinyin of the first letter of the character string to be input is incomplete and a desired character string can be input even if pinyin of the second and subsequent characters is input.
あるいは、中国語変換装置として用いられる上記何れかの文字変換装置では、上 記漢字は繁体字であり、かつ、上記表音は注音で表現されていてもよい。  Alternatively, in any of the above character conversion devices used as a Chinese language conversion device, the above kanji characters may be traditional characters, and the above-mentioned phonetic sounds may be expressed by a gush sound.
[0315] 上記構成によると、ユーザが 1文字以上の注音文字列を入力すると、注音文字列の 先頭文字から先頭漢字の候補を絞り込み、先頭漢字の候補を基に検索対象の単語 を制限できるので、辞書検索処理に力かる時間を短縮できる。それゆえ、応答速度の 速い中国語入力装置を提供できる。また、入力したい文字列の先頭文字の注音文字 だけを入力しても、連続して 2文字目以降の注音文字を入力しても、所望の文字列を 入力することができる中国語入力装置を提供することができる。他方で、入力したい 文字列の先頭文字の注音が不完全であり、 2文字目以降の注音を入力しても所望の 文字列を入力することができる中国語入力装置を提供できる。 [0315] According to the above configuration, when the user inputs a vignette character string of one or more characters, it is possible to narrow down the leading kanji candidates from the first character of the vignette character string and restrict the search target word based on the leading kanji candidate. The time spent on dictionary search processing can be shortened. Therefore, the response speed A fast Chinese input device can be provided. Also, a Chinese input device that can input a desired character string even if only the first character of the first character of the character string to be input is input or the second and subsequent characters are continuously input. Can be provided. On the other hand, it is possible to provide a Chinese input device in which the sound injection of the first character of the character string to be input is incomplete and the desired character string can be input even if the sound injection of the second and subsequent characters is input.
[0316] また、本発明に係る文字変換装置の制御方法は、入力された 1以上の表音文字よ りなる入力文字列を変換文字に変換し、当該変換文字力 なる単語を決定すること ができる文字変換装置の制御方法において、(a— 1)変換文字を識別するための変 換文字識別情報と、該変換文字の表音を示す変換文字表音情報とが対応付けられ た変換文字対応情報および、(a— 2)入力される表音文字と、該表音文字を上記変 換文字表音情報の先頭として持つ上記変換文字識別情報の、上記変換文字対応情 報での格納先を示す変換文字格納先情報とが対応付けられた変換文字インデックス 、を有する変換文字辞書と、(b— 1)単語を識別するための単語識別情報と、該単語 の表音を示す単語表音情報とが対応付けられた単語対応情報および、 (b 2)単語 の先頭となる変換文字を識別するための先頭変換文字識別情報と、該先頭変換文 字識別情報が示す変換文字を先頭に持つ単語の、上記単語対応情報での格納先 を示す単語格納先情報とが対応付けられた単語インデックス、を有する単語辞書と、 を用いて、 Further, the control method of the character conversion device according to the present invention can convert an input character string consisting of one or more input phonetic characters into a converted character and determine a word having the converted character strength. In the control method of the character conversion device that can be used, (a-1) conversion character correspondence in which conversion character identification information for identifying a conversion character and conversion character phonetic information indicating the phonetic character of the conversion character are associated Information, (a-2) a storage destination of the converted character identification information having the phonogram input and the converted character identification information having the phonogram as the head of the converted character phonetic information, A converted character dictionary having a converted character index associated with the indicated converted character storage destination information, (b-1) word identification information for identifying a word, and word sound information indicating the phonetic sound of the word Word correspondence information in which and are associated, and (b 2) the point of the word Head conversion character identification information for identifying a head conversion character; word storage destination information indicating a storage destination in the word correspondence information of a word having a conversion character at the head indicated by the head conversion character identification information; A word dictionary having a word index associated with, using
入力された上記入力文字列の先頭の表音文字を基に、上記変換文字辞書を参照 し、該入力文字列の先頭の表音文字を上記変換文字表音情報の先頭として持つ変 換文字識別情報を取得する先頭変換文字取得ステップと、  Based on the first phonetic character of the input character string input, the conversion character identification is made by referring to the conversion character dictionary and having the first phonetic character of the input character string as the first character of the converted character phonetic information. A leading conversion character acquisition step for acquiring information;
取得された上記先頭変換文字識別情報を基に、上記単語辞書を参照し、該先頭 変換文字識別情報が示す変換文字を先頭に持つ単語を取得する単語取得ステップ と、  A word acquiring step of referring to the word dictionary based on the acquired head converted character identification information and acquiring a word having the converted character indicated by the head converted character identification information at the head;
を含む処理を行う。  Perform processing including
[0317] 上記方法によると、上記文字変換装置と同様に、単語の検索処理に係る時間を短 縮することができる。  [0317] According to the above method, as in the above character conversion device, it is possible to shorten the time involved in the word search process.
[0318] また、本発明に係る文字変換装置は、コンピュータによって実現してもよぐこの場 合には、コンピュータを上記何れかの文字変換装置における上記各手段として動作 させることにより上記文字変換装置をコンピュータにて実現させる文字変換プログラム 、及びその文字変換プログラムを記録したコンピュータ読み取り可能な記録媒体も、 本発明の範疇に入る。 Also, the character conversion device according to the present invention may be realized by a computer. In addition, a character conversion program that causes the computer to realize the character conversion device by operating the computer as any of the means in any of the character conversion devices, and a computer readable recording medium storing the character conversion program Even falls within the scope of the present invention.
[0319] これらの構成によれば、文字変換プログラムを、コンピュータに読み取り実行させる ことによって、上記文字変換装置と同一の作用効果を実現することができる。  According to these configurations, by causing the computer to read and execute the character conversion program, it is possible to realize the same function and effect as the above-described character conversion device.
[0320] 〔補足事項〕  [Supplementary matter]
本発明に係る文字変換装置は、入力された表音文字または表音文字列を対応す る表意文字に変換して、ユーザが選択する表意文字または表意文字列を入力する 文字変換装置において、上記表音文字と、該表音文字を含む表音文字列に対応す る表意文字とを関連付けて記録する文字記録部と、ユーザにより入力された表音文 字を受け付けて、該表音文字に関連付けられた表意文字を、表意文字の候補として 上記文字記録部から抽出する文字候補抽出手段と、上記候補抽出手段が抽出した 候補の表示順序を、所定の条件に基づき決定する候補順序決定手段と、上記候補 順序決定手段が決定した表示順序に基づいて候補を表示するようデータを出力する 表示制御手段とを備えて 、ることを特徴として 、る。  A character conversion device according to the present invention is a character conversion device for converting an input phonetic character or phonetic character string into a corresponding ideographic character and inputting an ideographic character or ideographic character string selected by the user. A character recording unit for recording a phonogram character and an ideographic character corresponding to a phonogram string including the phonogram character, and receiving the phonogram character input by the user, the phonogram character is selected as the phonogram character. Character candidate extraction means for extracting associated ideographic characters from the character recording unit as candidates for ideographic characters, Candidate order determination means for determining the display order of candidates extracted by the candidate extraction means based on a predetermined condition And display control means for outputting data so as to display the candidates based on the display order determined by the candidate order determination means.
[0321] 上記構成によれば、文字候補抽出手段は、ユーザが入力した表音文字を受け付け て、その表音文字を含む、表音文字列を特定する。表音文字列は、表意文字と対応 している。表音文字と表音文字列との対応関係、および、表音文字列と表意文字との 対応関係は、文字記録部に記録されている。したがって、文字候補抽出手段は、ュ 一ザ力 入力を受け付けた表音文字に基づいて、ユーザが入力したいと所望する表 意文字の候補を上記文字記録部力 抽出することができる。  According to the above configuration, the character candidate extraction unit receives the phonetic characters input by the user, and identifies a phonetic character string including the phonetic characters. The phonetic strings correspond to ideographic characters. The correspondence between the phonetic characters and the phonetic character strings, and the correspondence between the phonetic character strings and the ideographic characters are recorded in the character recording unit. Therefore, the character candidate extraction means can extract the candidate of the desired alphabetic character that the user desires to input based on the phonetic characters for which the user's input has been received.
[0322] 続、て、候補順序決定手段は、上記文字候補抽出手段が抽出した表意文字の候 補を所定の条件に基づいて並べ替え、候補の表示順序を決定する。表示制御手段 は、表意文字の候補が、上記候補順序決定手段が決定した表示順序で表示される よう候補のデータを出力する。  Subsequently, the candidate order determination means rearranges the candidates of the ideographic character extracted by the character candidate extraction means on the basis of a predetermined condition, and determines the display order of the candidates. The display control means outputs candidate data so that the ideographic character candidates are displayed in the display order determined by the candidate order determination means.
[0323] これにより、ユーザは、表示された候補の中から所望の表意文字を指定するのみで 、該表意文字を入力することができる。 [0324] 以上のことから、ユーザは、表音文字の入力に係る操作数について、より少ない操 作数で、所望の表意文字の入力を得ることが可能となる。 [0323] Thereby, the user can input the ideographic character only by designating the desired ideographic character from the displayed candidates. From the above, the user can obtain desired ideographic character input with a smaller number of operations regarding the number of operations related to the input of phonetic characters.
[0325] より具体的には、例えば、表音文字がアルファベットであり、表意文字が中国語の 漢字であるとする。すなわち、アルファベット文字列は、ピンインであり、 1つのピンィ ンには 1または複数の漢字が対応している。上記文字記録部には、アルファベット ピンイン、ピンイン—漢字の対応関係が記録されている。 More specifically, for example, it is assumed that phonetic characters are alphabets and ideographic characters are Chinese kanji. That is, the alphabet string is pinyin, and one pin corresponds to one or more kanji. In the character recording section, correspondences between alphabetic pinyin and pinyin-kanji are recorded.
[0326] したがって、アルファベットが 1文字入力されると、文字候補抽出手段は、文字記録 部を参照して、当該アルファベットから始まるピンインを特定し、特定したピンインに対 応する 1または複数の漢字を候補として抽出する。 Therefore, when one alphabetic character is input, the character candidate extraction unit refers to the character recording unit, identifies pinyin starting from the alphabet, and identifies one or more kanji corresponding to the identified pinyin. Extract as a candidate.
[0327] 上記候補として抽出された漢字は、ユーザに提示される。ユーザは、提示された候 補力も所望の漢字を指定するたけでよい。つまり、アルファベットを入力したのち、そ れに対応するピンインを選択する操作を必要とすることなぐ所望の漢字を指定して 得ることが可能となる。 The kanji extracted as the above candidate is presented to the user. The user may also specify the desired kanji for the presented candidate power. That is, after inputting the alphabet, it becomes possible to specify and obtain a desired kanji that does not require an operation to select the corresponding pinyin.
[0328] 以上のことから、所望の表意文字 (漢字)を得るための、表音文字 (ピンイン)の入力 操作を簡素化することが可能な文字変換装置を提供することができる。  From the above, it is possible to provide a character conversion device capable of simplifying input operation of phonetic characters (pinyin) for obtaining desired ideographic characters (kanji characters).
[0329] なお、中国語の発音には四声があるために、 1つのピンイン表記に対して 4種類の 発音が存在する。この四声を区別する情報を加えて、対応する漢字を記録してもよい 。例えば、上記文字記録部に、四声の区別を格納するためのフィールドを用意し、ュ 一ザが入力する四声の区別に基づいて、対応する漢字をさらに絞り込めば、上記文 字候補抽出手段の抽出処理を効率よく行うことができる。  [0329] It should be noted that there are four types of pronunciation for one pinyin notation because there are four voices in Chinese pronunciation. Information to distinguish the four voices may be added and the corresponding kanji may be recorded. For example, a field for storing the distinction of four voices is prepared in the above-mentioned character recording unit, and if the corresponding kanji is further narrowed based on the distinction of the four voices inputted by the user, the above character candidate extraction The extraction process of the means can be performed efficiently.
[0330] あるいは、例えば、表音文字が注音の字母であり、表意文字が繁体字であってもよ い。すなわち、表音文字列は注音であり、 1つの注音には 1または複数の繁体字が対 応している。上記文字記録部には、注音の字母 注音、注音 繁体字の対応関係 が記録されていればよい。  [0330] Alternatively, for example, the phonogram may be a punctuation character and the ideographic character may be a traditional ideographic character. That is, the phonetic character string is a gushing sound, and one or more traditional characters correspond to one gushing sound. In the above-mentioned character recording section, the correspondence relationship between the vowels of vowels and the traditional vignettes of the vowels should be recorded.
[0331] 上記文字変換装置は、さらに、上記表意文字と、該表意文字を含む表意文字列と を関連付けて記録する文字列記録部と、ユーザにより選択された表意文字を受け付 けて、該表意文字に関連付けられた表意文字列を、表意文字列の候補として上記文 字列記録部から抽出する文字列候補抽出手段とを備えていてもよい。 [0332] あるいは、本発明の文字変換装置は、入力された表音文字または表音文字列を、 対応する表意文字に変換して、ユーザが選択する表意文字を入力する文字変換装 置において、上記表意文字と、該表意文字を含む表意文字列とを関連付けて記録 する文字列記録部と、ユーザにより選択された表意文字を受け付けて、該表意文字 に関連付けられて 、る表意文字列を、表意文字列の候補として上記文字列記録部 から抽出する文字列候補抽出手段と、上記文字列候補抽出手段が抽出した候補の 表示順序を、所定の条件に基づき決定する候補順序決定手段と、上記候補順序決 定手段が決定した表示順序に基づいて候補を表示するようデータを出力する表示制 御手段とを備えて 、ることを特徴として 、る。 The character conversion apparatus further receives a character string recording unit that records the ideographic character and the ideographic character string including the ideographic character in association with each other, and the ideographic character selected by the user. Character string candidate extraction means for extracting an ideographic character string associated with an ideographic character from the character string recording unit as an ideographic character string candidate may be provided. Alternatively, the character conversion device of the present invention converts the input phonetic character or phonetic character string into the corresponding ideographic character, and inputs the ideographic character selected by the user. A character string recording unit for recording the ideographic character in association with an ideographic character string including the ideographic character, and an ideographic character selected by the user are accepted, and the ideographic character string associated with the ideographic character is Character string candidate extraction means for extracting from the character string recording unit as candidates for figurative character strings, candidate order determination means for determining the display order of candidates extracted by the character string candidate extraction means based on a predetermined condition, and And display control means for outputting data so as to display candidates based on the display order determined by the candidate order determination means.
[0333] 上記構成によれば、文字列記録部には、表意文字 (例えば、漢字)と該表意文字を 含む表意文字列 (例えば、当該漢字を先頭にした熟語、など)との対応関係が記録さ れている。したがって、上記文字列候補抽出手段は、表意文字がユーザにより選択さ れたのに応じて、該表意文字に関連付けられた表意文字列を、候補として上記文字 列記録部から抽出する。  According to the above configuration, in the character string recording portion, the correspondence between an ideographic character (for example, kanji characters) and an ideographic character string including the ideographic characters (for example, an idiomatic headed by the kanji character, etc.) is obtained. It is recorded. Therefore, the character string candidate extraction unit extracts an ideographic character string associated with the ideographic character as a candidate from the character string recording unit in response to the selection of the ideographic character by the user.
[0334] 例えば、中国語の例を用いて説明する。ある漢字が入力されると、その漢字の次に 続く文字 (ユーザが所望する漢字)は、何の脈絡もない漢字よりも、その漢字力も始ま る熟語を構成する漢字である可能性が高い。したがって、入力が確定された漢字か ら始まる熟語を候補として表示すれば、ユーザは、その熟語の候補の中力 所望の 熟語を選択するのみで、別途ピンインを入力することなぐ中国語を入力することが可 能となる。  For example, description will be made using a Chinese example. When a kanji is input, the character following the kanji (the kanji desired by the user) is more likely to be a kanji constituting a idiom in which the kanji power also starts than a kanji without any context. Therefore, if a idiom that starts from a kanji whose input has been confirmed is displayed as a candidate, the user only inputs Chinese language without separately entering pinyin simply by selecting the desired idiom of the idiom of the candidate of the idiom. It will be possible.
[0335] これにより、ユーザは、表音文字の入力に係る操作数について、さらに少ない操作 数で、所望の表意文字の入力を得ることが可能となる。  [0335] This enables the user to obtain desired ideographic character input with a smaller number of operations relating to the input of the phonetic characters.
[0336] さらに、上記文字変換装置は、ユーザにより筆記された表意文字を入力する筆記 文字入力部をさらに備え、上記筆記文字入力部に入力された表意文字を認識し、該 表意文字の情報を上記文字列候補抽出手段に供給する文字認識手段を備えている ことが好ましい。 Furthermore, the character conversion apparatus further includes a writing character input unit for inputting ideographic characters written by the user, recognizes the ideographic characters input to the writing character input unit, and obtains information of the ideographic characters. It is preferable that a character recognition unit supplied to the character string candidate extraction unit is provided.
[0337] これにより、所望の表意文字に対応する表音文字 (列)が明瞭でない場合でも、所 望の表意文字の入力を得ることが可能となる。 [0338] さらに、上記文字変換装置の文字候補抽出手段は、ユーザにより入力された表音 文字を受け付けると、連続して受け付けた表音文字列に対応する表意文字を、候補 として抽出することが好ま 、。 [0337] This makes it possible to obtain the desired ideographic character input even when the phonetic characters (strings) corresponding to the desired ideographic character are not clear. [0338] Furthermore, the character candidate extraction means of the character conversion device, when receiving the phonetic characters input by the user, may extract the ideographic character corresponding to the continuously received phonetic character string as a candidate. Preferred.
[0339] 上記構成によれば、表音文字がユーザによって入力されるたびに、特定するべき 表音文字列の候補数が絞られる。よって、文字候補抽出手段は、対応する表意文字 の候補数を大幅に絞り込んで抽出することができ、表示制御手段はその絞り込まれ た表意文字の候補を出力することができる。  [0339] According to the above configuration, each time a phonetic character is input by the user, the number of phonetic character string candidates to be specified is reduced. Therefore, the character candidate extraction means can narrow down and extract the number of corresponding ideographic character candidates significantly, and the display control means can output the narrowed-down ideographic character candidates.
[0340] 以上のことから、表音文字がユーザによって入力されるごとに、表示された表意文 字の候補が、ユーザが所望する表意文字を含んでいる確率が上がるよう、文字変換 装置を構成することができる。結果として、所望の表意文字を得るための、表音文字 の入力操作を簡素化することが可能な文字変換装置を提供することができる。  From the above, the character conversion device is configured such that the probability that the displayed ideographic character candidate contains the ideographic character desired by the user is increased each time the phonetic character is input by the user. can do. As a result, it is possible to provide a character conversion device capable of simplifying input operations of phonetic characters to obtain desired ideographic characters.
[0341] さらに、上記文字変換装置の上記候補順序決定手段は、上記文字列候補抽出手 段により抽出された表意文字列の候補が、上記文字候補抽出手段により抽出された 表意文字の候補よりも優先して表示されるよう表示順序を決定することが好ましい。  Furthermore, the candidate order determination means of the character conversion device is configured such that the candidate of the ideographic character string extracted by the character string candidate extraction means is more than the candidate of the ideographic character extracted by the character candidate extraction means. It is preferable to determine the display order so as to be displayed preferentially.
[0342] 上記構成によれば、文字候補抽出手段が文字記録部より抽出した表意文字の候 補より、文字列候補抽出手段が文字列記録部より抽出した表意文字列の候補の方 が先に表示されるよう、候補順序決定手段が候補の表示順序を決定する。  According to the above configuration, the candidate ideographic character string extracted by the character string candidate extraction unit from the character string recording unit is preceded by the candidate ideographic character extracted by the character candidate extraction unit from the character recording unit. The candidate order determining means determines the display order of the candidates to be displayed.
[0343] 例えば、日本語や中国語の漢字入力において、何の脈絡もない漢字よりも、すでに 入力されて 、る漢字に続 、て熟語 (表意文字列)を構成するような漢字が続く (ユー ザが所望する)可能性が高い。したがって、例えば、入力が確定された漢字から始ま る熟語を候補として表示すれば、ユーザは、漢字 (表意文字)の候補から表示された 場合に比べて、より早ぐ少ない操作数で所望の漢字を入力することが可能となる。  [0343] For example, in kanji input in Japanese or Chinese, kanji that is already input and follows kanji and that forms idiom (idea character string) follow rather than kanji without any context ( It is highly likely that the user wants. Therefore, for example, if a idiom that starts from a kanji whose input has been confirmed is displayed as a candidate, the user can perform a desired kanji with a smaller number of operations faster than when displayed from a kanji (ideographic character) candidate. It becomes possible to input
[0344] 上記文字変換装置は、さらに、ユーザにより選択された表意文字を学習文字として 学習語記録部に記録する学習制御手段を備え、上記候補順序決定手段は、上記文 字候補抽出手段が抽出した候補が上記学習語記録部に記録された学習文字である 場合に、該候補が優先して表示されるよう表示順序を決定してもよ ヽ。  The character conversion apparatus further includes a learning control unit that records the ideographic character selected by the user as a learning character in the learning word recording unit, and the candidate order determining unit is extracted by the character candidate extraction unit. If the selected candidate is a learning character recorded in the learning word recording unit, the display order may be determined such that the candidate is displayed with priority.
[0345] あるいは、ユーザにより選択された表意文字列を学習文字列として学習語記録部 に記録する学習制御手段を備え、上記候補順序決定手段は、上記文字列候補抽出 手段が抽出した候補が上記学習語記録部に記録された学習文字列である場合に、 該候補が優先して表示されるよう表示順序を決定してもよい。 [0345] Alternatively, a learning control means for recording the ideographic character string selected by the user as a learning character string in the learning word recording unit is provided, and the candidate order determination means extracts the character string candidate When the candidate extracted by the means is a learning character string recorded in the learning word recording unit, the display order may be determined so that the candidate is displayed with priority.
[0346] 上記構成によれば、学習制御手段は、文字変換装置において入力が確定された( ユーザが所望した)表意文字または表意文字列を学習文字 (列)として学習語記録 部に記録させる。  [0346] According to the above configuration, the learning control means causes the learning word recording unit to record the ideographic character or ideographic character string whose input has been determined (desired by the user) in the character conversion device as a learning character (string).
[0347] そして、上記候補抽出手段が候補を文字 (列)記録部より抽出したのち、候補順序 決定手段は、その抽出された候補のうち、上記学習語記録部に記録されているもの を学習文字として判定することができる。候補が学習文字であるということは、すなわ ち、ユーザが過去に所望した表意文字 (列)であって、学習文字でない候補よりも、ュ 一ザに再び所望される可能性が高 、ことを意味する。  Then, after the candidate extraction unit extracts the candidates from the character (string) recording unit, the candidate order determining unit learns, of the extracted candidates, those recorded in the learning word recording unit. It can be judged as a character. That the candidate is a learning character means that the ideographic character (string) desired by the user in the past is more likely to be desired by the user than the candidate who is not a learning character. Means
[0348] 以上のことから、学習文字である候補の表示順序を上位にすることにより、ユーザが より早ぐ少ない操作数で所望の漢字を入力することが可能な文字変換装置を実現 することができる。  From the above, it is possible to realize a character conversion device capable of inputting a desired kanji character by a user with a smaller number of operations faster by increasing the display order of candidates that are learning characters. it can.
[0349] あるいは、上記文字変換装置は、さらに、ユーザにより選択された表意文字を、該 表意文字の直前に入力されている直前文字または直前文字列と関連付けて、ペア 学習文字として学習語記録部に記録する学習制御手段を備え、上記候補順序決定 手段は、上記文字候補抽出手段が抽出した候補が、直前文字または直前文字列と 関連付けられて上記学習語記録部に記録されたペア学習文字である場合に、該候 補が優先して表示されるよう表示順序を決定してもよい。  [0349] Alternatively, the character conversion device further associates the ideographic character selected by the user with the immediately preceding character or the immediately preceding character string input immediately before the ideographic character, and sets the learning word recording unit as a pair learning character The candidate order determination means is a pair learning character recorded in the learning word recording section, with the candidate extracted by the character candidate extraction means being associated with the immediately preceding character or the immediately preceding character string. In some cases, the display order may be determined such that the candidates are displayed in priority.
[0350] もしくは、ユーザにより選択された表意文字列を、該表意文字列の直前に入力され て!、る直前文字または直前文字列と関連付けて、ペア学習文字列として学習語記録 部に記録する学習制御手段を備え、上記候補順序決定手段は、上記文字列候補抽 出手段が抽出した候補が、直前文字または直前文字列と関連付けられて上記学習 語記録部に記録されたペア学習文字列である場合に、該候補が優先して表示される よう表示順序を決定してもよ 、。  [0350] Alternatively, the ideographic string selected by the user is input immediately before the ideographic string! And a learning control means for recording in the learning word recording section as a pair learning character string in association with the immediately preceding character or the immediately preceding character string, and the candidate order determining means is a candidate extracted by the character string candidate extraction means. When the pair learning character string is recorded in the learning word recording unit in association with the immediately preceding character or the immediately preceding character string, the display order may be determined such that the candidate is displayed with priority.
[0351] 上記構成によれば、学習制御手段は、文字変換装置において入力が確定された( ユーザが所望した)表意文字または表意文字列を学習文字 (列)として学習語記録 部に記録させる際、その学習文字 (列)の直前にすでに入力が確定して 、る文字 (列 ) (以下、直前文字 (列)と称す)と関連付けて、ペア学習文字 (列)として記録させる。 According to the above configuration, the learning control means causes the learning word recording unit to record the ideographic character or ideographic character string whose input has been determined (desired by the user) in the character conversion device as a learning character (string). , The input character is already determined immediately before the learning character (string) ) (Hereafter, it is referred to as the immediately preceding character (string)) and recorded as a pair learning character (string).
[0352] 上記ペア学習文字 (列)は、上記関連付けられた直前文字 (列)に続 、て、ユーザ に所望された履歴があるということになる。したがって、上記直前文字 (列)が入力さ れた場合には、上記ペア学習文字 (列)がユーザに所望される可能性が高い。  [0352] The paired learning character (string) follows the associated immediately preceding character (string), which means that there is a history desired by the user. Therefore, when the preceding character (string) is input, the pair learning character (string) is likely to be desired by the user.
[0353] 上記候補順序決定手段は、候補抽出手段が抽出した候補が、ペア学習文字 (列) である場合には、その候補の表示順序を上位に変更する。  [0353] When the candidate extracted by the candidate extraction means is a pair learning character (string), the candidate order determination means changes the display order of the candidate to higher rank.
[0354] これにより、ユーザがより早ぐ少ない操作数で所望の漢字を入力することが可能な 文字変換装置を実現することができる。  [0354] This makes it possible to realize a character conversion device that allows the user to input a desired kanji character with a smaller number of operations faster.
[0355] さらに、上記文字変換装置の上記学習制御手段は、学習語としての、上記学習文 字または学習文字列、もしくは、上記ペア学習文字またはペア学習文字列を分類す るためのグループ識別子を生成し、該グループ識別子を各学習語に関連付けて上 記学習語記録部に記録し、上記グループ識別子は、上記学習語を構成する表意文 字にあら力じめ割り当てられた、所定の長さの文字コードに基づいて生成されること が好ましい。  [0355] Furthermore, the learning control means of the character conversion device may use the learning character or learning character string as a learning word, or a group identifier for classifying the pair learning character or pair learning character string. The group identifier is generated in association with each learning word and recorded in the above-described learning word recording unit, and the group identifier is assigned a predetermined length previously assigned to the ideographic letters constituting the learning word. Preferably, it is generated based on the character code of
[0356] 上記構成によれば、学習制御手段は、学習語記録部に記録される学習語 (学習文 字または学習文字列、もしくは、ペア学習文字またはペア学習文字列)に対して、そ れを複数のグループに分類するためのグループ識別子を生成して割り当てる。学習 制御手段は、上記学習語を構成する各表意文字にあらかじめ割り当てられた、文字 コードを基に求められる数値を用いて上記グループ識別子を生成する。  [0356] According to the above configuration, the learning control means determines the learning word (learning character or learning character string, or pair learning character or pair learning character string) recorded in the learning word recording unit. Generate and assign a group identifier to classify into a plurality of groups. The learning control means generates the group identifier using a numerical value determined based on the character code, which is assigned in advance to each ideographic character constituting the learning word.
[0357] このようにして生成されたグループ識別子は、どのような学習語に割り当てるもので あっても固定長となる。すなわち、具体例を挙げると、 2文字の熟語からなる学習語で あっても、 6文字の熟語からなる学習語であっても、それらには同サイズのグループ 識別子が割り当てられる。  [0357] The group identifier generated in this way has a fixed length no matter what learning word is assigned. That is, as a specific example, even if it is a learning word consisting of a two-character idiom or a learning word consisting of a six-character idiom, group identifiers of the same size are assigned to them.
[0358] 上記学習語記録部に記録される学習語の数が、膨大な量になればなるほど、また 、それが熟語の場合、文字数が多くなればなるほど、データサイズが大きくなる。つま り、候補が学習語である力否かを判定するために上記学習語記録部から合致する学 習語を検索するのに膨大な時間がかかり、文字入力の処理効率が低下するという問 題が発生する。 [0359] しかし、上述したとおり、各学習文字に固定長のグループ識別子を割り当てれば、 検索にお 、て、入力文字 (列)と学習語が一致あるいは同一グループの文字 (列)で あるかを判定する時間は、学習語がどのような文字数力もなつていようと関係なく一律 となる。以上のことから、学習語の増加に対して検索に力かる時間が比例的に増加す ることを抑制することができ、処理効率の低下への影響が小さくすることができる。 The data size increases as the number of learning words recorded in the learning word recording section increases, and in the case of idiom, as the number of characters increases. That is, it takes a great deal of time to search for matching learning words from the learning word recording section to determine whether the candidate is a learning word or not, and the processing efficiency of character input decreases. Occurs. However, as described above, if a fixed-length group identifier is assigned to each learning character, whether the input character (string) and the learning word match or are characters of the same group (string) in the search The time to determine the score is uniform regardless of the number of characters in the learning word. From the above, it is possible to suppress the proportional increase in the time spent on the search with respect to the increase in the number of learning words, and the influence on the decrease in processing efficiency can be reduced.
[0360] 上記文字変換装置は、さらに、ユーザにより選択された表意文字または表意文字 列と、該表意文字または表意文字列より前に入力が確定している前方文字または前 方文字列との関連度合いを記録する関連度記録部を備え、上記候補順序決定手段 は、上記候補が、上記前方文字または前方文字列との関連度が高いものほど優先し て表示されるよう表示順序を決定してもよ 、。  [0360] The above-mentioned character conversion device further associates the ideographic character or ideographic character string selected by the user with the forward character or the forward character string whose input is determined prior to the ideographic character or ideographic character string. The candidate order determination unit determines the display order so that the candidate is displayed with higher precedence as the degree of association with the front character or the front character string is higher. Well,
[0361] 上記構成によれば、候補順序決定手段は、前方文字 (列)との関連度合いに基づ いて、候補の表示順序を決定する。上記関連度合いとは、上記文字入力確定手段 が入力を確定した表意文字または表意文字列と、該表意文字または表意文字列より 前に入力が確定されている前方文字または前方文字列との関連性のことである。  According to the above configuration, the candidate order determination means determines the display order of the candidates based on the degree of association with the forward character (string). The degree of association refers to the ideographic character or ideographic character string whose input is determined by the character input determination means and the forward character or character string whose input is determined before the ideographic character or ideographic character string. It is
[0362] より具体的には、例えば、「上海交通大学」という熟語が頻繁に入力される場合につ いて説明する。「交通」に続いて「規則」などの別の熟語が連続して入力される可能性 が高い場合であっても、それよりも前方に「上海」がある場合は、候補順序決定手段 は、「上海」と「大学」との関連性を考慮して、「大学」を候補の上位に変更する。  More specifically, for example, the case where the idiom “Shanghai Jiaotong University” is frequently input will be described. Even if there is a high possibility that another idiom such as “rules” will be continuously input after “traffic”, if “Shanghai” is ahead of it, the candidate order determination means Considering the relationship between "Shanghai" and "University", change "University" to the top of the candidate.
[0363] これにより、候補順序決定手段は、直前の文字 (列)との連続性のみならず、それよ りも前方の文字 (列)との関連性を考慮して、候補となる表意文字 (列)の表示順序を 決定するので、よりユーザの希望を反映した順序で候補を提示することが可能となる  Thus, the candidate order determining means can consider candidate ideographic characters in consideration of not only continuity with the immediately preceding character (string) but also with the preceding character (string). Since the display order of (rows) is determined, it is possible to present the candidates in an order that reflects the user's wishes more
[0364] 結果として、ユーザは、表音文字の入力に係る操作数にっ 、て、より少な 、操作数 で、所望の表意文字の入力を得ることが可能となる。 [0364] As a result, the user can obtain desired ideographic character input with less number of operations relating to input of phonetic characters.
[0365] さらに、上記文字変換装置は、上記候補順序決定手段が候補の表示順序を決定 する際に参照する所定の条件であって、ユーザにより指定された条件を設定する候 補順序決定要素設定手段を備え、上記候補順序決定手段は、上記候補順序決定 要素設定手段が設定した条件に基づき、候補の表示順序を決定することが好まし 、 [0366] 上記構成によれば、上記候補順序決定手段が候補の表示順序を決定する際に参 照する所定の条件が複数ある場合に、どのような条件に基づいて表示順序を決定す るのかを、ユーザの指示に応じて、候補順序決定要素設定手段が設定する。 Furthermore, the character conversion device sets a candidate order determination element for setting a condition designated by the user, which is a predetermined condition to be referred to when the candidate order determination means determines the display order of the candidates. The candidate order determination means preferably determines the display order of the candidates based on the conditions set by the candidate order determination element setting means, According to the above configuration, when there are a plurality of predetermined conditions to be referred to when the candidate order determination means determines the display order of candidates, based on which conditions the display order is determined Is set by the candidate order determining element setting means in accordance with the user's instruction.
[0367] 上記所定の条件の例としてはいくつか考えられる。例えば、上述したとおり、  There are several conceivable examples of the above-mentioned predetermined conditions. For example, as mentioned above
(1)候補が文字 (漢字)であるか、文字列 (熟語)であるか、(2)候補が学習文字であ る力否力 (3)候補がペア学習文字である力否力、(4)いつの時期に学習されたの 力 (5)—般的にどのくらい頻繁に使用されるの力、(6)現在読みとして入力された 表音文字長は、候補の表音文字長と一致しているか否か (例えば、「技術」を入力し た!、場合に、「ぎじ (2文字)」と「ぎじゅつ (4文字)」とは表音文字長が不一致である) 、(7)直前文字列との連続性はある力、(8)前方文字列との関連性はある力、(9)当 該ユーザによってどのくらいの頻度で使用されているの力 など、さまざまな条件が考 えられる。  (1) Whether the candidate is a character (Kanji) or a character string (Idiom) or (2) the ability is a learning character candidate (3) the ability is a paired learning character 4) Force when learned (5)-Generally how often it is used, (6) Phonetic character length entered as current reading matches the candidate phonetic character length (For example, “Technology” has been entered !, and in the case where “Giji (2 letters)” and “Gijutsu (4 letters)” have different phonetic character lengths), (7) Continuity with a string is a force, (8) a relationship with a forward string is a force, (9) how often it is used by the user, and various conditions can be considered. .
[0368] ここで、どのような条件に基づ 、て (あるいは、どのような条件を重要視して)候補の 表示順序を決定するのかということを、ユーザが任意に設定できれば、候補順序決 定手段が決定する候補の表示順序は、ユーザの希望をより正確に反映したものとな る。  [0368] Here, if the user can arbitrarily set the display order of candidates based on what conditions (or what conditions are considered important), candidate order determination The display order of the candidates determined by the defining means more accurately reflects the user's wishes.
[0369] 結果として、ユーザは、表音文字の入力に係る操作数にっ 、て、より少な 、操作数 で、所望の表意文字の入力を得ることが可能となる。  As a result, the user can obtain the desired ideographic character input with a smaller number of operations relating to the input of the phonetic characters.
[0370] 本発明に係る複写機は、上述の文字変換装置と、各表音文字に対応するキーを備 えてユーザにより押下されたキーに対応する表音文字を上記文字変換装置に供給 する第 1入力部と、ユーザが選択した候補を上記文字変換装置に入力する第 2入力 部と、上記文字候補抽出手段または上記文字列候補抽出手段が抽出する候補を、 上記候補順序決定手段が定めた表示順序にしたがって表示する表示部とを備え、 上記第 1入力部および第 2入力部と、上記表示部とは、互いに一体に形成されたタツ チパネルまたはタブレットであることを特徴としている。 The copying machine according to the present invention is provided with the above-described character conversion device and a key corresponding to each phonetic character, and supplies, to the character conversion device, a phonetic character corresponding to a key pressed by the user. The candidate order determination means defines one input part, a second input part for inputting the candidate selected by the user to the character conversion device, and a candidate to be extracted by the character candidate extraction means or the character string candidate extraction means. And a display unit configured to display in accordance with the display order, wherein the first input unit and the second input unit, and the display unit are touch panels or tablets integrally formed with each other.
[0371] これにより、ユーザにより選択された表音文字や表意文字を上記文字変換装置に 入力するための第 1および第 2入力部と、候補としての表意文字 (列)をユーザに提 示するための表示部とを一体に備えたタブレットを有する複写機において、ユーザは 、表音文字の入力に係る操作数について、より少ない操作数で、所望の表意文字の 入力を得ることが可能となる。 Thus, the user is provided with the first and second input units for inputting phonetic characters and ideographic characters selected by the user into the character conversion device, and ideographic characters (strings) as candidates to the user. In a copying machine having a tablet integrally provided with a display unit for indicating, the user can obtain desired ideographic character input with a smaller number of operations relating to the input of phonetic characters. It becomes.
[0372] 上記複写機は、ユーザにより筆記された表意文字を入力する筆記文字入力部をさ らに備え、上記筆記文字入力部に入力された表意文字を認識し、該表意文字の情 報を上記文字変換装置に供給する文字認識手段を備えて 、ることが好ま 、。  The copying machine further includes a writing character input unit for inputting ideographic characters written by the user, recognizes the ideographic characters input to the writing character input unit, and detects information on the ideographic characters. It is preferable to include character recognition means for supplying the above character conversion device.
[0373] これにより、所望の表意文字に対応する表音文字 (列)が明瞭でない場合でも、所 望の表意文字の入力を得ることが可能な複写機を実現することが可能となる。  This makes it possible to realize a copying machine capable of obtaining desired ideographic character input even when the phonetic characters (strings) corresponding to the desired ideographic character are not clear.
[0374] 本発明に係る文字入力方法は、入力された表音文字または表音文字列を対応す る表意文字に変換して、ユーザが選択する表意文字または表意文字列を入力する 文字変換装置における文字入力方法において、ユーザにより入力された表音文字を 受け付けて、該表音文字に関連付けられている表意文字を、上記表音文字と、該表 音文字を含む表音文字列に対応する表意文字とを関連付けて記録する文字記録部 から、表意文字の候補として抽出する第 1ステップと、上記第 1ステップにて抽出した 候補の表示順序を、所定の条件に基づき決定する第 2ステップと、上記第 2ステップ にて決定した表示順序に基づいて候補を表示するようデータを出力するする第 3ステ ップとを含むことを特徴として 、る。  The character input method according to the present invention converts an input phonetic character or phonetic character string into a corresponding ideographic character, and inputs an ideographic character or ideographic character string selected by the user. In the character input method according to the present invention, the ideographic character input by the user is accepted, and the ideographic character associated with the phonogram character corresponds to the phonogram character and the phonogram character string including the phonogram character. A first step of extracting an ideographic character candidate from the character recording unit which records the ideographic character in association with the ideographic character; and a second step of determining the display order of the candidates extracted in the first step based on a predetermined condition. And a third step of outputting data so as to display candidates based on the display order determined in the second step.
[0375] 上記方法によれば、第 1ステップにて、ユーザ入力した表音文字を含む表音文字 列と対応する表意文字の候補が抽出され、第 2ステップにて、抽出された候補の表示 順序が決定される。このとき、表示順序は、例えば、上述したような所定の条件に基 づいて決定されればよい。そして、第 3ステップにて、決定された表示順序に基づい て表意文字の候補が出力される。  [0375] According to the above method, in the first step, ideographic character candidates corresponding to the phonetic character string including the phonetic characters input by the user are extracted, and in the second step, display of the extracted candidates is performed The order is determined. At this time, the display order may be determined based on, for example, the predetermined conditions as described above. Then, in the third step, ideographic character candidates are output based on the determined display order.
[0376] これにより、ユーザにより上記候補の中から所望の表意文字が指定されると、その 指定された表意文字の入力が確定される。  [0376] Thus, when a desired ideographic character is designated from the above candidates by the user, the input of the designated ideographic character is confirmed.
[0377] 以上のことから、所望の表意文字 (漢字)を得るための、表音文字 (ピンイン)の入力 操作を簡素化することが可能な文字変換装置を提供することができる。  From the above, it is possible to provide a character conversion device capable of simplifying input operation of phonetic characters (pinyin) for obtaining desired ideographic characters (kanji characters).
[0378] なお、上記文字変換装置および複写機は、コンピュータによって実現してもよぐこ の場合には、コンピュータを上記各手段として動作させることにより上記文字変換装 置または複写機をコンピュータにて実現させる制御プログラム、およびそれを記録し たコンピュータ読み取り可能な記録媒体も、本発明の範疇に入る。 [0378] If the character conversion device and the copying machine are realized by a computer, the character conversion device may be operated by operating the computer as the respective means. A control program that causes a computer or a copying machine to be realized by a computer, and a computer-readable recording medium recording the same also fall within the scope of the present invention.
[0379] 〔補足事項 2〕  [Supplementary matter 2]
本発明に係る文字変換装置は、入力された表音文字または表音文字列を対応す る表意文字に変換して、ユーザが選択する表意文字または表意文字列を入力する 文字変換装置において、上記表音文字と、該表音文字を含む表音文字列に対応す る表意文字とを関連付けて記録する文字記録部と、ユーザにより入力された表音文 字を受け付けて、該表音文字に関連付けられた表意文字を、表意文字の候補として 上記文字記録部から抽出する文字候補抽出手段と、上記候補抽出手段が抽出した 候補の表示順序を決定する候補順序決定手段と、上記候補順序決定手段が決定し た表示順序に基づいて候補を表示するようデータを出力する表示制御手段とを備え 、過去に入力された表意文字または表意文字列と、該表意文字または表意文字列 の直前よりさらに前方に入力された前方文字または前方文字列とを対応付けて、上 記前方文字または前方文字列の後方に該表意文字または表意文字列が入力される 可能性の高さを示す関連度が関連度記録部に記憶されており、上記候補順序決定 手段は、上記関連度記録部から、上記候補の前方に入力されている前方文字また は前方文字列と、各候補との関連度を検出し、該関連度が高い候補ほど優先して表 示されるよう表示順序を決定することを特徴として 、る。  A character conversion device according to the present invention is a character conversion device for converting an input phonetic character or phonetic character string into a corresponding ideographic character and inputting an ideographic character or ideographic character string selected by the user. A character recording unit for recording a phonogram character and an ideographic character corresponding to a phonogram string including the phonogram character, and receiving the phonogram character input by the user, the phonogram character is selected as the phonogram character. Character candidate extraction means for extracting the associated ideographic character from the character recording unit as an ideographic character candidate, candidate order determination means for determining the display order of the candidates extracted by the candidate extraction means, and the candidate order determination means And display control means for outputting data so as to display candidates based on the display order determined by the ideographic character, the ideographic character or ideographic character string input in the past, and the ideographic character or the ideographic character string Furthermore, in correspondence with the forward character or forward character string input further forward, the degree of association indicating the height of the possibility that the ideographic character or ideographic string is input behind the forward character or forward character string is The candidate order determination means detects the degree of association between each candidate and the forward character or string that has been input ahead of the candidate from the relevance degree recording unit. And the display order is determined such that the candidate with the higher degree of association is displayed with priority.
[0380] あるいは、本発明の文字変換装置は、入力された表音文字または表音文字列を対 応する表意文字に変換して、ユーザが選択する表意文字を入力する文字変換装置 において、上記表意文字と、該表意文字を含む表意文字列とを関連付けて記録する 文字列記録部と、ユーザにより選択された表意文字を受け付けて、該表意文字に関 連付けられて!/、る表意文字列を、表意文字列の候補として上記文字列記録部から抽 出する文字列候補抽出手段と、上記文字列候補抽出手段が抽出した候補の表示順 序を決定する候補順序決定手段と、上記候補順序決定手段が決定した表示順序に 基づいて候補を表示するようデータを出力する表示制御手段とを備え、過去に入力 された表意文字または表意文字列と、該表意文字または表意文字列の直前よりさら に前方に入力された前方文字または前方文字列とを対応付けて、上記前方文字ま たは前方文字列の後方に該表意文字または表意文字列が入力される可能性の高さ を示す関連度が関連度記録部に記憶されており、上記候補順序決定手段は、上記 関連度記録部から、上記候補の前方に入力されて!、る前方文字または前方文字列 と、各候補との関連度を検出し、該関連度が高い候補ほど優先して表示されるよう表 示順序を決定することを特徴として 、る。 [0380] Alternatively, the character conversion device of the present invention converts the input phonetic characters or phonetic character strings into corresponding ideographic characters, and inputs the ideographic characters selected by the user. A character string recording unit that associates and records an ideographic character with an ideographic character string including the ideographic character, and receives an ideographic character selected by the user, and is associated with the ideographic character! /, The ideographic character A character string candidate extraction unit that extracts a string from the character string recording unit as an ideographic character string candidate, a candidate order determination unit that determines the display order of the candidates extracted by the character string candidate extraction unit, the candidate And a display control means for outputting data to display candidates based on the display order determined by the order determination means, and an ideographic character or ideographic character string input in the past, and the ideographic character or ideographic character string from immediately before Et al in in association with the front character or forward string entered forward, the front character or Or a degree of association indicating the degree to which the ideogram or the ideographic character is likely to be input is stored in the degree-of-relevance recording unit after the forward character string, and the candidate order determining unit The unit detects the degree of association between each candidate and the forward character or forward character string input to the front of the above candidate, and the display order is prioritized so that the candidate with the higher degree of association is displayed with priority. It is characterized by deciding.
[0381] これにより、ユーザは、表音文字の入力に係る操作数について、さらに少ない操作 数で、所望の表意文字の入力を得ることが可能となる。  Thus, the user can obtain desired ideographic character input with a smaller number of operations relating to the input of the phonetic characters.
[0382] さらに、過去に入力された表意文字または表意文字列と、該表意文字または表意 文字列の直前に入力された直前文字または直前文字列とを対応付けて、上記直前 文字または直前文字列の直後に該表意文字または表意文字列が入力される可能性 の高さを示す連続度が連続度記録部に記憶されており、上記候補順序決定手段は 、上記前方文字または前方文字列との関連度、および、上記直前文字または直前文 字列との連続度が高い候補ほど優先して表示されるよう表示順序を決定することが 好ましい。 Furthermore, the ideographic character or ideographic character string input in the past is associated with the ideographic character or the immediately preceding character or character string immediately prior to the ideographic character string, and the immediately preceding character or the immediately preceding character string is associated. The degree of continuity indicating the height of the possibility that the ideogram or ideographic character string is input is stored in the continuity recording unit immediately after the It is preferable to determine the display order so that candidates with higher relevance and higher continuity with the preceding character or preceding character string are displayed preferentially.
[0383] さらに、上記文字変換装置の文字候補抽出手段は、ユーザにより入力された表音 文字を受け付ける度に、該表音文字をすでに受け付けた表音文字列の末尾に連結 し、該連結によって得られた表音文字列に対応する表意文字を候補として抽出する ことが好ましい。  Furthermore, each time the character candidate extraction unit of the character conversion device receives a phonetic character input by the user, the character candidate extraction unit concatenates the phonetic character to the end of the already-received phonetic character string, Preferably, ideographic characters corresponding to the obtained phonetic character string are extracted as candidates.
[0384] さらに、上記文字変換装置の上記学習制御手段は、上記学習語記録部に記録す る対象となる学習語としての、上記学習文字または学習文字列、もしくは、上記ペア 学習文字またはペア学習文字列を分類するためのグループ識別子を生成し、該グ ループ識別子を各学習語に関連付けて上記学習語記録部に記録し、上記グループ 識別子は、上記学習語を構成する表意文字にあらかじめ割り当てられた、所定の長 さの文字コードに基づ 、て生成されることが好ま 、。  Furthermore, the learning control means of the character conversion device is the learning character or learning character string as a learning word to be recorded in the learning word recording unit, or the pair learning character or pair learning A group identifier for classifying a character string is generated, the group identifier is associated with each learning word, and is recorded in the learning word recording unit, and the group identifier is assigned in advance to ideographic characters constituting the learning word. Preferably, it is generated based on the character code of a predetermined length.
[0385] さらに、上記文字変換装置は、上記候補順序決定手段が候補の表示順序を決定 する際に参照する所定の条件である、上記候補がユーザに選択される可能性の大き さを示す評価値の算出に使用可能な複数種類の要素から、ユーザにより指定された 要素を設定する候補順序決定要素設定手段を備え、上記各候補に対応付けて上記 関連度とは異なる要素が要素記録部に記憶されており、上記候補順序決定手段は、 上記関連度と、上記候補順序決定要素設定手段によって設定された上記要素記録 部に記憶されている要素とに基づき、候補の表示順序を決定することが好ましい。 Furthermore, in the character conversion device, an evaluation indicating the magnitude of the possibility that the candidate is selected by the user is a predetermined condition to be referred to when the candidate order determination means determines the display order of the candidates. A candidate order determining element setting unit configured to set an element designated by the user from a plurality of types of elements that can be used for calculating the value; An element different from the degree of association is stored in the element recording unit, and the candidate order determination unit determines the degree of association, and an element stored in the element storage unit set by the candidate order determination element setting unit. It is preferable to determine the display order of the candidates based on
[0386] 本発明に係る文字入力方法は、入力された表音文字または表音文字列を対応す る表意文字に変換して、ユーザが選択する表意文字または表意文字列を入力する 文字変換装置における文字入力方法において、上記文字変換装置は、文字候補抽 出手段と、候補順序決定手段と、表示制御手段とを備えており、上記文字候補抽出 手段が、ユーザにより入力された表音文字を受け付けて、該表音文字に関連付けら れている表意文字を、上記表音文字と、該表音文字を含む表音文字列に対応する 表意文字とを関連付けて記録する文字記録部から、表意文字の候補として抽出する 第 1ステップと、上記候補順序決定手段が、上記第 1ステップにて抽出した候補の表 示順序を決定する第 2ステップと、上記表示制御手段が、上記第 2ステップにて決定 した表示順序に基づいて候補を表示するようデータを出力する第 3ステップとを含み 、上記第 2ステップは、過去に入力された表意文字または表意文字列と、該表意文 字または表意文字列の直前よりさらに前方に入力された前方文字または前方文字列 とを対応付けて、上記前方文字または前方文字列の後方に該表意文字または表意 文字列が入力される可能性の高さを示す関連度を記憶する関連度記録部から、上 記候補の前方に入力されている前方文字または前方文字列と、各候補との関連度を 検出し、該関連度が高い候補ほど優先して表示されるよう表示順序を決定するステツ プを含むことを特徴として ヽる。  The character input method according to the present invention converts an input phonetic character or phonetic character string into a corresponding ideographic character, and inputs an ideographic character or ideographic character string selected by the user. In the character input method according to the present invention, the character conversion device includes character candidate extraction means, candidate order determination means, and display control means, and the character candidate extraction means uses phonetic characters input by the user. From the character recording unit, which receives and records the ideographic character associated with the phonogram character in association with the phonogram character and the ideographic character corresponding to the phonogram character string including the phonogram character, The first step of extracting as a character candidate, the second step of determining the display order of the candidates extracted in the first step by the candidate order determining means, and the second step of the display control means Decision A third step of outputting data to display candidates based on the display order, the second step including an ideographic character or ideographic character string input in the past, and the ideographic character or ideographic character string Relevance indicating the height of the possibility that the ideogram or ideographic character string will be input after the front character or the front character string in association with the front character or the front character string input further to the front than immediately before The degree of association between the preceding character or the front character string input ahead of the above candidate and each candidate is detected from the degree of association recording section that stores the It is characterized as including the step of determining the display order.
[0387] 以上のことから、所望の表意文字 (漢字)を得るための、表音文字 (ピンイン)の入力 操作を簡素化することが可能な文字変換装置を提供することができる。  From the above, it is possible to provide a character conversion device capable of simplifying input operation of phonetic characters (pinyin) for obtaining desired ideographic characters (kanji characters).
[0388] 本発明に係る文字入力方法は、入力された表音文字または表音文字列を対応す る表意文字に変換して、ユーザが選択する表意文字を入力する文字変換装置にお ける文字入力方法において、上記文字変換装置は、文字候補抽出手段と、候補順 序決定手段と、表示制御手段とを備えており、上記文字候補抽出手段が、ユーザに より選択された表意文字を受け付けて、該表意文字に関連付けられている表意文字 列を、上記表意文字と該表意文字を含む表意文字列とを関連付けて記録する文字 列記録部から、表意文字列の候補として抽出する第 1ステップと、上記候補順序決定 手段が、上記第 1ステップにて抽出した候補の表示順序を決定する第 2ステップと、 上記表示制御手段が、上記第 2ステップにて決定した表示順序に基づ 、て候補を表 示するようデータを出力する第 3ステップとを含み、上記第 2ステップは、過去に入力 された表意文字または表意文字列と、該表意文字または表意文字列の直前よりさら に前方に入力された前方文字または前方文字列とを対応付けて、上記前方文字ま たは前方文字列の後方に該表意文字または表意文字列が入力される可能性の高さ を示す関連度を記憶する関連度記録部から、上記候補の前方に入力されている前 方文字または前方文字列と、各候補との関連度を検出し、該関連度が高い候補ほど 優先して表示されるよう表示順序を決定するステップを含むことを特徴としている。 According to the character input method of the present invention, a character in a character conversion device for converting an input phonetic character or phonetic character string into a corresponding ideographic character and inputting an ideographic character to be selected by the user In the input method, the character conversion device includes character candidate extraction means, candidate order determination means, and display control means, and the character candidate extraction means receives an ideographic character selected by the user. A character that records the ideographic character string associated with the ideographic character in association with the ideographic character and the ideographic character string including the ideographic character A first step of extracting a candidate for an ideographic character string from the column recording unit; a second step of determining the display order of the candidates extracted in the first step by the candidate order determination means; and the display control means , And the third step of outputting data to display candidates based on the display order determined in the second step, and the second step includes the ideographic character or ideographic character string input in the past. The ideographic character or ideographic character string is associated with the ideographic character or the forward character string that is input ahead of the ideographic character or ideographic character string, The relevance degree recording unit that stores the degree of relevance that indicates the likelihood of being input, detects the degree of relevance between each candidate and the front character or forward character string input to the front of the candidate, Candidate with high degree of association It is characterized by comprising the step of determining the display order to appear in throat priority.
[0389] 本発明は上述した各実施形態、各実施例に限定されるものではなぐ請求項に示 した範囲で種々の変更が可能である。すなわち、請求項に示した範囲で適宜変更し た技術的手段を組み合わせて得られる実施形態についても本発明の技術的範囲に 含まれる。 The present invention can be variously modified within the scope shown in the claims which are not limited to the above-described embodiments and examples. That is, embodiments obtained by combining technical means appropriately modified within the scope of the claims are also included in the technical scope of the present invention.
産業上の利用可能性  Industrial applicability
[0390] 本発明は、読み (表音文字)を変換する、特に読みを漢字に変換する各種装置に 適用することができる。 [0390] The present invention can be applied to various devices for converting readings (phonetic characters), in particular, converting readings into kanji.

Claims

請求の範囲 The scope of the claims
[1] 入力された 1以上の表音文字よりなる入力文字列を変換文字に変換し、当該変換 文字力もなる単語を決定できる文字変換装置において、  [1] A character conversion device capable of converting an input character string consisting of one or more inputted phonetic characters into a converted character and determining a word also having the converted character strength,
変換文字を識別するための変換文字識別情報と、該変換文字の表音を示す変換 文字表音情報とが対応付けられた変換文字対応情報、および、入力される表音文字 と、該表音文字を上記変換文字表音情報の先頭として持つ上記変換文字識別情報 の、上記変換文字対応情報での格納先を示す変換文字格納先情報とが対応付けら れた変換文字インデックス、を有する変換文字辞書と、  Converted character correspondence information in which converted character identification information for identifying converted characters and converted character phonetic information indicating the phonetic characters of the converted characters are associated, input phonetic characters, and the phonetic characters A converted character index in which the converted character identification information having a character as the head of the converted character phonetic information is associated with the converted character storage destination information indicating the storage destination in the converted character correspondence information. With the dictionary
単語を識別するための単語識別情報と、該単語の表音を示す単語表音情報とが 対応付けられた単語対応情報、および、単語の先頭となる変換文字を識別するため の先頭変換文字識別情報と、該先頭変換文字識別情報が示す変換文字を先頭に 持つ単語の、上記単語対応情報での格納先を示す単語格納先情報とが対応付けら れた単語インデックス、を有する単語辞書と、  Word correspondence information in which word identification information for identifying a word is associated with word phonetic information indicating the phonetic sound of the word, and head converted character identification for identifying a converted character to be the head of the word A word dictionary having a word index in which information and a word storage destination information indicating a storage destination in the word correspondence information of a word having a conversion character at the head indicated by the head conversion character identification information are associated;
入力された上記入力文字列の先頭の表音文字を基に、上記変換文字辞書を参 照し、該入力文字列の先頭の表音文字を上記変換文字表音情報の先頭として持つ 変換文字識別情報を取得する先頭変換文字取得手段と、  The converted character dictionary is referred to based on the first phonetic character of the input character string input, and the first phonetic character of the input character string is included as the first character of the converted character phonetic information. Head conversion character acquisition means for acquiring information;
取得された上記先頭変換文字識別情報を基に、上記単語辞書を参照し、該先頭 変換文字識別情報が示す変換文字を先頭に持つ単語を取得する単語取得手段と、 を備えていることを特徴とする、文字変換装置。  Word acquisition means for acquiring a word having a conversion character at the head indicated by the head conversion character identification information by referring to the word dictionary based on the acquired head conversion character identification information; To, character conversion device.
[2] 上記先頭変換文字取得手段が取得した先頭変換文字識別情報が示す先頭変換 文字を、ユーザが選択可能に表示部に表示する表示制御手段と、  [2] Display control means for displaying on the display section the head conversion character indicated by the head conversion character identification information acquired by the head conversion character acquisition means, in a selectable manner by the user.
上記ユーザにより選択された先頭変換文字を受け付ける受付手段と、を備え、 上記単語取得手段は、上記選択された先頭変換文字の先頭変換文字識別情報に 基づ ヽて、単語を取得する処理を行うことを特徴とする請求項 1に記載の文字変換 装置。  Receiving means for receiving the head conversion character selected by the user, and the word acquiring means performs processing for acquiring a word based on the head conversion character identification information of the selected head conversion character. The character conversion device according to claim 1, characterized in that:
[3] 上記変換文字対応情報では、さらに、上記変換文字識別情報に、該変換文字識 別情報が示す変換文字についての表示補助情報が対応付けられており、  [3] In the converted character correspondence information, display auxiliary information on a converted character indicated by the converted character identification information is further associated with the converted character identification information,
上記表示制御手段は、上記表示補助情報に基づき、先頭変換文字を並べ替えて 、ユーザが選択可能に表示部に表示することを特徴とする請求項 2に記載の文字変 換装置。 The display control means rearranges the head conversion characters based on the display auxiliary information. The character conversion device according to claim 2, wherein the character conversion device is displayed on the display unit so as to be selectable by the user.
[4] 上記単語辞書の単語インデックスは、  [4] The word index of the above word dictionary is
単語の先頭となる先頭変換文字毎に、上記先頭変換文字識別情報と、単語の 2文 字目の先頭の表音文字である 2文字目先頭表音文字との組み合わせに対して、上 記単語対応情報における格納先が割り振られていることを特徴とする請求項 1〜3の 何れか 1項に記載の文字変換装置。  For each combination of the first converted character identification information and the second phonetic first phonetic character which is the first phonetic letter of the second word of the word, The storage destination in corresponding | compatible information is allocated, The character conversion apparatus of any one of the Claims 1-3 characterized by the above-mentioned.
[5] 上記変換文字対応情報では、さらに、上記変換文字識別情報に、該変換文字識 別情報が示す変換文字についての取得補助情報が対応付けられており、 [5] In the converted character correspondence information, the converted character identification information is further associated with acquisition auxiliary information on the converted character indicated by the converted character identification information,
上記先頭変換文字取得手段は、上記取得補助情報を基に上記変換文字識別情 報を取得することを特徴とする請求項 1〜4の何れか 1項に記載の文字変換装置。  The character conversion device according to any one of claims 1 to 4, wherein said leading conversion character acquisition means acquires said conversion character identification information based on said acquisition auxiliary information.
[6] 上記取得補助情報は、以下の 4つの情報に分類されることを特徴とする請求項 5に 記載の文字変換装置。 [6] The character conversion device according to claim 5, wherein the acquisition auxiliary information is classified into the following four pieces of information.
(1)変換文字識別情報が示す変換文字が、上記入力文字列に対して、無条件に取 得対象となるという情報  (1) Information that the converted character indicated by the converted character identification information is unconditionally obtained for the above input character string
(2)変換文字識別情報が示す変換文字が、上記入力文字列が 1文字では、取得対 象外になるという情報  (2) Information that the converted character indicated by the converted character identification information is excluded from acquisition if the input character string is one character.
(3)変換文字識別情報が示す変換文字が、上記入力文字列が 2文字以下では、取 得対象外になるという情報  (3) Information that the converted character indicated by the converted character identification information is not subject to acquisition if the input character string is two characters or less.
(4)変換文字識別情報が示す変換文字が、上記入力文字列が 3文字以下では、取 得対象外になるとういう情報  (4) Information that the conversion character indicated by the conversion character identification information will not be acquired if the input character string is 3 characters or less.
[7] 上記単語対応情報では、単語の先頭の変換文字が同一のものが 1グループとして まとめられていることを特徴とする請求項 1〜6の何れ力 1項に記載の文字変換装置  [7] The character conversion device according to any one of claims 1 to 6, characterized in that in the word correspondence information, the same conversion characters at the head of a word are grouped as one group.
[8] さらに、各上記グループ内で、単語は、単語の先頭の表音文字の所定の配列順に 並べられていることを特徴とする請求項 7に記載の文字変換装置。 [8] The character conversion device according to claim 7, wherein the words are arranged in a predetermined arrangement order of phonetic characters at the beginning of the words in each of the groups.
[9] 上記変換文字は漢字であり、 [9] The above conversion characters are kanji,
上記変換文字対応情報では、漢字の先頭の表音文字が同一、かつ、声調コードが 同一の、上記変換文字識別情報である漢字識別情報がグループとしてまとめられて おり、 In the converted character correspondence information, the phonetic characters at the beginning of the kanji are the same, and the tone code is The same kanji identification information which is the above-mentioned converted character identification information is grouped as a group,
中国語変換装置として用いられることを特徴とする請求項 1〜8の何れ力 1項に記載 の文字変換装置。  The character conversion device according to any one of claims 1 to 8, wherein the character conversion device is used as a Chinese language conversion device.
[10] 上記変換文字対応情報では、各上記グループで、上記漢字識別情報が、漢字の 先頭の表音文字の所定の配列順に並べてられていることを特徴とする請求項 9に記 載の文字変換装置。  [10] The character according to claim 9, wherein, in the converted character correspondence information, the kanji identification information is arranged in a predetermined arrangement order of the top phonetic characters of the kanji in each of the groups. Converter.
[11] 前記変換文字インデックスでは、上記漢字の先頭の表音文字が所定の配列順とな つており、各上記グループの上記変換文字対応情報での格納先を示す情報を上記 変換文字格納情報として、上記漢字の先頭の表音文字に対応付けられて ヽることを 特徴とする請求項 9または 10に記載の文字変換装置。  [11] In the converted character index, the top phonetic characters of the kanji are arranged in a predetermined arrangement order, and information indicating the storage destination of the converted character correspondence information of each group is used as the converted character storage information. 11. The character conversion device according to claim 9, wherein the character conversion device is associated with the first phonetic character of the kanji character.
[12] 上記漢字は簡体字であり、かつ、上記表音はピンインで表現される、中国語変換装 置として用いられることを特徴とする請求項 9〜11の何れ力 1項に記載の文字変換装 置。  [12] The character conversion according to any one of claims 9 to 11, characterized in that the kanji characters are simplified characters and the phonetic characters are expressed by pinyin and used as a Chinese language conversion device. Equipment.
[13] 上記漢字は繁体字であり、かつ、上記表音は注音で表現される、中国語変換装置 として用いられることを特徴とする請求項 9〜11の何れ力 1項に記載の文字変換装置  [13] The character conversion as set forth in any one of claims 9 to 11, characterized in that the kanji character is a traditional character and the phonetic sound is expressed by a sound injection, which is used as a Chinese language conversion device. Device
[14] 入力された 1以上の表音文字よりなる入力文字列を変換文字に変換し、当該変換 文字力 なる単語を決定することができる文字変換装置の制御方法において、 変換文字を識別するための変換文字識別情報と、該変換文字の表音を示す変換 文字表音情報とが対応付けられた変換文字対応情報、および、入力される表音文字 と、該表音文字を上記変換文字表音情報の先頭として持つ上記変換文字識別情報 の、上記変換文字対応情報での格納先を示す変換文字格納先情報とが対応付けら れた変換文字インデックス、を有する変換文字辞書と、 [14] In a control method of a character conversion device capable of converting an input character string consisting of one or more inputted phonetic characters into a converted character and determining the word of the converted character, to identify the converted character Conversion character correspondence information in which conversion character identification information of the conversion character conversion information indicating the phonetic character of the conversion character is associated, a phonetic character to be input, and the conversion character table of the phonetic characters A conversion character dictionary having a conversion character index associated with conversion character storage destination information indicating a storage destination in the conversion character correspondence information of the conversion character identification information as a head of sound information;
単語を識別するための単語識別情報と、該単語の表音を示す単語表音情報とが 対応付けられた単語対応情報、および、単語の先頭となる変換文字を識別するため の先頭変換文字識別情報と、該先頭変換文字識別情報が示す変換文字を先頭に 持つ単語の、上記単語対応情報での格納先を示す単語格納先情報とが対応付けら れた単語インデックス、を有する単語辞書と、 Word correspondence information in which word identification information for identifying a word is associated with word phonetic information indicating the phonetic sound of the word, and head converted character identification for identifying a converted character to be the head of the word The information is associated with word storage destination information indicating a storage destination in the word correspondence information of a word having a conversion character at the beginning indicated by the head conversion character identification information. A word dictionary, having a selected word index,
を用いて、  Using,
入力された上記入力文字列の先頭の表音文字を基に、上記変換文字辞書を参 照し、該入力文字列の先頭の表音文字を上記変換文字表音情報の先頭として持つ 変換文字識別情報を取得する先頭変換文字取得ステップと、  The converted character dictionary is referred to based on the first phonetic character of the input character string input, and the first phonetic character of the input character string is included as the first character of the converted character phonetic information. A leading conversion character acquisition step for acquiring information;
取得された上記先頭変換文字識別情報を基に、上記単語辞書を参照し、該先頭 変換文字識別情報が示す変換文字を先頭に持つ単語を取得する単語取得ステップ と、  A word acquiring step of referring to the word dictionary based on the acquired head converted character identification information and acquiring a word having the converted character indicated by the head converted character identification information at the head;
を含む処理を行うことを特徴とする、文字変換装置の制御方法。  A control method of a character conversion device, characterized by performing processing including.
[15] 請求項 1〜13のいずれか 1項に記載の文字変換装置の上記各手段としてコンビュ ータを機能させるための文字変換プログラム。  [15] A character conversion program for causing a computer to function as the above respective means of the character conversion device according to any one of claims 1 to 13.
[16] 請求項 15に記載の制御プログラムを記録した、コンピュータ読取可能な記録媒体。 [16] A computer readable recording medium recording the control program according to claim 15.
PCT/JP2007/064209 2006-08-10 2007-07-18 Character converting device and character converting device control method WO2008018274A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP2008528762A JP4890551B2 (en) 2006-08-10 2007-07-18 Character conversion device and method for controlling character conversion device

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
JP2006218970 2006-08-10
JP2006-218970 2006-08-10

Publications (1)

Publication Number Publication Date
WO2008018274A1 true WO2008018274A1 (en) 2008-02-14

Family

ID=39032808

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/JP2007/064209 WO2008018274A1 (en) 2006-08-10 2007-07-18 Character converting device and character converting device control method

Country Status (3)

Country Link
JP (1) JP4890551B2 (en)
CN (1) CN101553810A (en)
WO (1) WO2008018274A1 (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2015528968A (en) * 2012-07-30 2015-10-01 マイクロソフト コーポレーション Generating string prediction using context
WO2022038809A1 (en) * 2020-08-18 2022-02-24 株式会社日立製作所 Information expression creation assistance device, information expression creation assistance method, and computer program
US20230266875A1 (en) * 2020-08-31 2023-08-24 Kiyoshi Kasatani Display apparatus, input method, and program

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP6491438B2 (en) * 2014-08-29 2019-03-27 株式会社日立社会情報サービス Migration support device
CN105404677B (en) * 2015-11-20 2018-12-18 四川神琥科技有限公司 A kind of search method based on tree structure
CN105426490B (en) * 2015-11-20 2019-03-26 四川神琥科技有限公司 A kind of indexing means based on tree structure

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH04225415A (en) * 1990-12-26 1992-08-14 Mitsubishi Electric Corp Pinyin kanji input system
JPH0540946U (en) * 1991-10-29 1993-06-01 カシオ計算機株式会社 Word processor
JP2004005379A (en) * 2002-04-16 2004-01-08 Fujitsu Ltd Information processor, information processing method, information processing program and recording medium
JP2006050160A (en) * 2004-08-03 2006-02-16 Sharp Corp Device, program and recording medium for inputting chinese language
JP2006155322A (en) * 2004-11-30 2006-06-15 Nec Fielding Ltd Input prediction dictionary system and input prediction method
JP2007188449A (en) * 2006-01-16 2007-07-26 Sharp Corp Character input device, copying machine including the same, character input method, control program, and recording medium

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH0388662A (en) * 1989-08-31 1991-04-15 Toshiba Seiki Kk Form connecting device

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH04225415A (en) * 1990-12-26 1992-08-14 Mitsubishi Electric Corp Pinyin kanji input system
JPH0540946U (en) * 1991-10-29 1993-06-01 カシオ計算機株式会社 Word processor
JP2004005379A (en) * 2002-04-16 2004-01-08 Fujitsu Ltd Information processor, information processing method, information processing program and recording medium
JP2006050160A (en) * 2004-08-03 2006-02-16 Sharp Corp Device, program and recording medium for inputting chinese language
JP2006155322A (en) * 2004-11-30 2006-06-15 Nec Fielding Ltd Input prediction dictionary system and input prediction method
JP2007188449A (en) * 2006-01-16 2007-07-26 Sharp Corp Character input device, copying machine including the same, character input method, control program, and recording medium

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2015528968A (en) * 2012-07-30 2015-10-01 マイクロソフト コーポレーション Generating string prediction using context
WO2022038809A1 (en) * 2020-08-18 2022-02-24 株式会社日立製作所 Information expression creation assistance device, information expression creation assistance method, and computer program
US20230266875A1 (en) * 2020-08-31 2023-08-24 Kiyoshi Kasatani Display apparatus, input method, and program

Also Published As

Publication number Publication date
JPWO2008018274A1 (en) 2009-12-24
CN101553810A (en) 2009-10-07
JP4890551B2 (en) 2012-03-07

Similar Documents

Publication Publication Date Title
JP4413868B2 (en) Character input device, copier equipped with character input device, character input method, control program, and recording medium
JP4829901B2 (en) Method and apparatus for confirming manually entered indeterminate text input using speech input
TWI293455B (en) System and method for disambiguating phonetic input
CN102298582B (en) Data search and matching process and system
CN100555203C (en) Revise the system and method for input characters
US20070100619A1 (en) Key usage and text marking in the context of a combined predictive text and speech recognition system
US8977535B2 (en) Transliterating methods between character-based and phonetic symbol-based writing systems
US8199112B2 (en) Character input device
WO2008134208A1 (en) Recognition architecture for generating asian characters
TW200538969A (en) Handwriting and voice input with automatic correction
CN1248333A (en) Reduced keyboard disambiguating system
US20120296647A1 (en) Information processing apparatus
WO2008018274A1 (en) Character converting device and character converting device control method
TWI313425B (en) Method, system, and computer readable storage medium for processing user entry of an ideographic language phrase
CN101520693A (en) Method and system for rapidly inputting bulk information
US9171234B2 (en) Method of learning a context of a segment of text, and associated handheld electronic device
JPH08314910A (en) Different kind of code character string transcribing device and electronic dictionary
JP2015040908A (en) Information processing apparatus, information update program, and information update method
US20130073572A1 (en) Method of learning character segments from received text, and associated handheld electronic device
JP2013214187A (en) Character input device, method for controlling character input device, control program and recording medium
JP5751537B2 (en) International Japanese input system
JP5141130B2 (en) Electronic device and program having dictionary function
JP3803253B2 (en) Method and apparatus for Kanji input
KR100910302B1 (en) Apparatus and method for searching information based on multimodal
JP2000090193A (en) Character recognition device and item classifying method

Legal Events

Date Code Title Description
WWE Wipo information: entry into national phase

Ref document number: 200780037603.7

Country of ref document: CN

121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 07790963

Country of ref document: EP

Kind code of ref document: A1

WWE Wipo information: entry into national phase

Ref document number: 2008528762

Country of ref document: JP

NENP Non-entry into the national phase

Ref country code: DE

NENP Non-entry into the national phase

Ref country code: RU

122 Ep: pct application non-entry in european phase

Ref document number: 07790963

Country of ref document: EP

Kind code of ref document: A1