WO2008018274A1 - dispositif de conversion de caractère et procédé de commande de dispositif de conversion de caractère - Google Patents

dispositif de conversion de caractère et procédé de commande de dispositif de conversion de caractère Download PDF

Info

Publication number
WO2008018274A1
WO2008018274A1 PCT/JP2007/064209 JP2007064209W WO2008018274A1 WO 2008018274 A1 WO2008018274 A1 WO 2008018274A1 JP 2007064209 W JP2007064209 W JP 2007064209W WO 2008018274 A1 WO2008018274 A1 WO 2008018274A1
Authority
WO
WIPO (PCT)
Prior art keywords
character
word
kanji
input
information
Prior art date
Application number
PCT/JP2007/064209
Other languages
English (en)
Japanese (ja)
Inventor
Hitoshi Hirose
Mitsuhiro Hakaridani
Original Assignee
Sharp Kabushiki Kaisha
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Sharp Kabushiki Kaisha filed Critical Sharp Kabushiki Kaisha
Priority to JP2008528762A priority Critical patent/JP4890551B2/ja
Publication of WO2008018274A1 publication Critical patent/WO2008018274A1/fr

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/12Use of codes for handling textual entities
    • G06F40/126Character encoding
    • G06F40/129Handling non-Latin characters, e.g. kana-to-kanji conversion
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/103Formatting, i.e. changing of presentation of documents
    • G06F40/109Font handling; Temporal or kinetic typography

Definitions

  • Character conversion device control method of character conversion device
  • the present invention is an apparatus for a user to input characters or a character string using a pointing device such as a keyboard, a touch panel, a tablet, and a mouse, and receives a conversion instruction and receives the character or characters input.
  • a character conversion device related to a character conversion device which selectively displays a converted character or character string according to a column and the user can select a displayed character or character string to input a desired character or character string. It is a thing.
  • the information device increases the time to search the lexical data ability for the kanji that matches the input reading. Therefore, after the user inputs a reading and instructs conversion, the response from displaying the conversion result is deteriorated, and as a result, it is difficult to use the information device.
  • Patent Document 1 Japanese Patent Laid-Open Publication No. Hei 6-149787 (May 31, 1994 published) Disclosure of Invention
  • the number of characters of the input reading is within the maximum number of characters of the reading of the word starting with the reading of the first character of the input reading character string recorded in the index information.
  • the present invention has been made in view of the above problems, and an object thereof is to provide a character conversion device, a character input method, a character input program, and a character conversion device capable of shortening a word search time for input phonetic characters.
  • a computer readable recording medium is provided.
  • the character conversion device converts an input character string consisting of one or more input phonetic characters into a converted character, and determines a word that is the converted character.
  • Character conversion device that can
  • word correspondence information in which word identification information for identifying a word is associated with word phonetic information indicating the phonetic sound of the word; and (b-2) a head at the beginning of the word
  • the head conversion character identification information for identifying a conversion character, and the word storage destination information indicating the storage destination in the word correspondence information of the word having the head conversion character indicated by the head conversion character identification information A word dictionary having associated word indexes,
  • a word acquiring unit which refers to the word dictionary based on the acquired head converted character identification information and acquires a word having the converted character indicated by the head converted character identification information at the head And are characterized.
  • the first conversion character acquisition means first searches for the conversion character from the first phonetic character of the input character string with reference to the conversion character dictionary, and then the word acquisition means With reference to the word dictionary, it is possible to acquire a word having the conversion character acquired by the start conversion character acquisition means at the beginning. Therefore, the time concerning the search process of a word can be shortened. That is, since the number of comparisons between the input character string and the phonetic (reading) information can be limited, it is possible to provide a character conversion device with high response speed.
  • the conversion character indicates the character after conversion.
  • the converted character is kanji
  • the first character of the phonetic character (reading) is as follows. For example, in Chinese
  • the first conversion character the first conversion character
  • Converted phonetic information is, for example, “memian” to “for“ 1 ”.
  • the converted character identification information may be a converted character itself which is any information that can identify a converted character which is a character after conversion. For example, if the converted character is kanji, it may be kanji itself or kanji code. The same applies to the word identification information and the head conversion character identification information.
  • FIG. 1 shows an embodiment of the present invention, and is a block diagram showing the main configuration of a character conversion device.
  • FIG. 2 It is a figure showing the flow of processing about conversion of the character which the said character conversion device performs.
  • FIG. 3 is a diagram for explaining a kanji index according to an embodiment of the present invention.
  • FIG. 4 is a view for explaining kanji correspondence information according to an embodiment of the present invention.
  • FIG. 5 is a diagram for explaining a primary index according to an embodiment of the present invention.
  • FIG. 6 is a diagram for explaining a secondary index according to an embodiment of the present invention.
  • FIG. 7 is a diagram for explaining word correspondence information according to an embodiment of the present invention.
  • FIG. 8 is a diagram for explaining a Pinyin dictionary index according to an embodiment of the present invention.
  • FIG. 9 is a diagram for explaining a Pinyin dictionary according to an embodiment of the present invention.
  • FIG. 10 is a diagram showing an example of pin-in division processing according to an embodiment of the present invention.
  • FIG. 11 is a diagram showing an example of pin-in division processing excluding pin-in determined in the pin-in division processing according to an embodiment of the present invention.
  • FIG. 12 is a block diagram showing the main configuration of a character conversion device according to an embodiment of the present invention.
  • FIG. 13 is a block diagram showing a schematic configuration of a multifunction printer to which the character conversion device of the present invention is applied.
  • FIG. 14 is a view showing an example of an operation screen at the time of character input processing, displayed on the operation panel of the character conversion device.
  • FIG. 15 is a view showing an example of a kanji dictionary recorded in the kanji dictionary recording unit, which is referred to when the candidate extraction unit of the character conversion device extracts candidate kanji.
  • FIG. 16 is a block diagram showing a main configuration of a character conversion device for extracting candidate idiom according to another embodiment of the present invention.
  • FIG. 17 is a diagram showing an example of candidate idiom displayed on the display unit of the operation panel of the character conversion device.
  • FIG. 18 A diagram showing an example of a idiom dictionary recorded by the idiom dictionary recording unit.
  • FIG. 19 is a flowchart showing a process flow of the character conversion device according to another embodiment of the present invention.
  • 20] is a diagram showing a display example of a candidate character display area and a confirmed character display area on the operation panel.
  • FIG. 21 is a block diagram showing the main configuration of a character conversion device provided with a learning function according to another embodiment of the present invention.
  • FIG. 22 is a block diagram showing a more detailed configuration of a candidate order determination unit 32.
  • FIG. 23 is a diagram showing an example of a connectivity degree table indicating the connectivity degree with the immediately preceding (forward) character string for each kanji (idiom) recorded in the connectivity degree recording unit.
  • FIG. 24 is a diagram showing a classification procedure of candidates when the candidate order determination unit of the character conversion device determines the display order.
  • FIG. 25 is a diagram showing an example of a data structure for storing evaluation values of display order calculated for each candidate.
  • FIG. 26 is a diagram showing an example of priority information recorded in a candidate order determination element recording part.
  • Fig. 27 is a diagram showing an example of a kanji written in a character input frame of an operation panel, and an example of coordinate information of the kanji.
  • FIG. 28 is a view showing a display example of the operation panel
  • FIG. 28 (a) is a view showing an example of kanji written in the character input frame of the display integrated type tablet.
  • FIG. 28 is a diagram showing an example of a recognized character candidate displayed in the candidate character display area
  • FIG. 28 (c) is a diagram showing an example of a candidate idiom displayed in the candidate character display area.
  • FIG. 29 is a diagram showing an example of the recognition result of the handwritten data “extra character 2”.
  • FIG. 30 is a block diagram showing an essential configuration of a character conversion device for inputting a traditional Chinese character according to another embodiment of the present invention.
  • FIG. 31 is a view showing an example of an operation screen at the time of a phonetic character input process for traditional character input according to another embodiment of the present invention.
  • FIG. 33 is a diagram showing an example of a idiom dictionary recorded by the idiom dictionary recording unit of the character conversion device according to another embodiment of the present invention.
  • FIG. 34 An example of a connectivity table showing connectivity with the immediately preceding (forward) character string for each kanji (words) recorded in the connectivity recording part of the character conversion device in another embodiment of the present invention.
  • FIG. 35 is a diagram showing an example of an operation screen displayed on the display unit of the operation panel.
  • FIG. 36 is a view showing an example of an operation screen displayed on the display unit of the operation panel.
  • FIG. 37 is a diagram showing an example of an operation screen displayed on the display unit of the operation panel.
  • FIG. 38 is a diagram showing an example of an operation screen displayed on the display unit of the operation panel.
  • FIGS. 12 to 38 One embodiment (embodiments 1 to 4) of the present invention will be described below with reference to FIGS. 12 to 38.
  • the character conversion device of the present invention is applied to a multi-function printer (copier) having a touch panel in which an input device and an output device are integrated will be described.
  • the copying machine described in the present embodiment is an example, and the character conversion device of the present invention can be applied to a mobile phone, a portable terminal, a personal computer, a remote control, and the like.
  • the character conversion device of the present invention is not limited to Chinese characters.
  • Phonetic writing power of the desired ideographic character can be obtained.
  • the above pinyin notation is described in Chinese, in which there are four voices representing four different pronunciation. The concept of four voices will be omitted for simplicity of explanation.
  • FIG. 13 is a block diagram showing a schematic configuration of a multi-function printer to which the character conversion device 1 of the present invention is applied.
  • the multifunction printer includes a control unit 10, an operation panel (tablet) 2, a temporary storage unit 3, a recording unit 40, an image reading unit 4, an image transfer unit 5, and a paper feeding unit 6.
  • a control unit 10 an operation panel (tablet) 2
  • a temporary storage unit 3 a recording unit 40
  • an image reading unit 4 an image transfer unit 5
  • the configuration of the multifunction printer described here is shown as an example for implementing the present invention, and does not limit the scope of the present invention.
  • the control unit 10 centrally controls the multi-function printer.
  • the control unit 10 reads various programs recorded in the recording unit 40 and controls the image reading unit 4, the image transfer unit 5, the paper feeding unit 6 and the like to function as a copying machine, Perform processing such as. Further, each unit for performing character input processing according to the present invention is also included in the control unit 10.
  • the recording unit 40 records an operation panel 2, a control program of the above-mentioned various units for operating as a copying machine, and a control program of the character conversion device 1, and an ROM (read only memory) Etc.) are realized.
  • the temporary storage unit 3 is a working area for processing a program, and the power used for temporarily storing read images.
  • temporary storage such as learning characters (strings) at the time of character input processing is used. It is used. It is realized by volatile memory such as random access memory (RAM).
  • Operation panel 2 is used by the user to instruct the multi-function printer to execute various processes.
  • a touch panel in which a display device and an input device are integrally formed or a display integrated tablet.
  • display-integrated tablet It may have a handwritten character recognition function that recognizes written characters. It also has a function to detect information such as instructed characters, character strings, and other function keys based on the position indicated by the user on the display integrated tablet.
  • the user can input a desired alphabet to the multi-function printer by lightly touching each key corresponding to the alphabet (phonetic characters) displayed on operation panel 2 with a pen or a finger. Is possible.
  • the multifunction printer may be a keyboard.
  • (First Input Unit Z Third Input Unit) 7 (for example, a general-purpose keyboard used in a PC) may be provided. In this case, it is possible to input alphabets using any of the displayed keys of the operation panel 2 and the keys of the keyboard 7.
  • the input unit (first input unit) having a key for inputting phonetic characters can be realized simultaneously by both the operation panel 2 and the keyboard 7 or can be realized only by the keyboard 7. Alternatively, it may be configured to switch whether to input the phonetic characters on the operation panel 2 or to the keyboard 7.
  • FIG. 12 is a block diagram showing the main configuration of the character conversion device 1 according to the present embodiment.
  • the character conversion device 1 is configured to include an operation panel 2, a control unit 10, a recording unit 40, and a temporary storage unit 3 as shown in FIG.
  • the operation panel 2 is realized by a touch panel in which the display device and the input device are integrally formed !.
  • FIG. 14 is a diagram showing an example of an operation screen displayed on the operation panel 2 during character input processing.
  • the display unit 12 has a character input area 121 for displaying a keyboard for the user to input characters, and a candidate sentence for displaying characters (or character strings) listed as input candidates.
  • a character (column) display area 122 and a confirmed character (column) display area 123 displaying an input confirmed character that has already been input and confirmed are displayed.
  • the input unit 11 is a user in China It is for inputting pinyin (phonetic character string z phonetic information) of word pronunciation notation, and when the key of the alphabet displayed in the character input area 121 is pressed by the user, the pressed position on the panel is displayed. The signal of the alphabet corresponding to is input to the input / output control unit 20 of the character conversion device 1. Further, the input unit 11 is also for the user to select a character that the user wants to confirm the input. When a kanji (ideographic character Z converted character) displayed in the candidate character display area 122 is pressed, a signal of the kanji displayed in the pressed position is input to the input / output control unit 20, and the kanji character is displayed. The input is confirmed.
  • pinyin phonetic character string z phonetic information
  • the control unit 10 controls the input / output control unit 20 and the character input control unit 30.
  • the input / output control unit 20 controls input / output of signals exchanged between the operation panel 2 and the control unit 10, and includes an input receiving unit 21, a display control unit (display control means) 22, and A supplementary selection acceptance unit 23 is included.
  • the character input control unit 30 executes character input processing, and the candidate extraction unit (character candidate extraction unit Z character string candidate extraction unit) 31, candidate order determination unit (candidate order determination unit) 32, and character confirmation Part 33 (character input confirmation means) is included.
  • each unit in the control unit 10 is realized by a central processing unit (CPU) of a computer and software executed by the CPU. This software is also stored in the above recording unit 40, read out to the temporary storage unit 3, and executed by the CPU.
  • the input receiving unit 21 of the input / output control unit 20 receives signals of characters (such as alphabets) input from the input unit 11 and supplies the signals to the candidate extracting unit 31.
  • the display control unit 22 controls the display unit 12 so that information to be displayed is displayed in each of the character input area 121, the candidate character display area 122, and the confirmed character display area 123 of the display unit 12. is there.
  • Candidate selection accepting portion 23 accepts the signal of the selected candidate kanji in candidate character display area 122 input from input portion 11 and the instruction signal for determining the input of the selected candidate kanji, It is supplied to the determination unit 33.
  • the candidate extraction unit 31 of the character input control unit 30 stores candidate kanji (or) words from the respective dictionary units stored in the recording unit 40 based on the input characters. .
  • the candidate extraction unit 31 extracts a kanji as a candidate from the kanji dictionary unit (character recording unit) 41 of the recording unit 40 based on the input alphabet (string).
  • pinyins corresponding to input alphabets (rows) are identified,
  • the kanji corresponding to each pinyin identified is extracted as a candidate kanji.
  • the candidate extraction unit 31 stores the extracted candidate kanji in an internal buffer (not shown), and instructs the candidate order determination unit 32 and the display control unit 22 to display the candidate kanji.
  • kanji dictionary unit 41 to which the candidate extraction unit 31 refers, alphabet (row) -pinyin-vague characters are recorded in association with one another. Details of the kanji dictionary unit 41 will be described later.
  • the candidate order determination unit 32 determines the display order of the candidate kanji extracted by the candidate extraction unit 31 based on a predetermined condition.
  • the display order is an order of displaying candidate kanji in the candidate character display area 122 as shown in FIG. If all extracted candidate vowels can be displayed at once in the candidate character display area 122, the order is higher at the upper part of the candidate character display area 122 as the upper candidate kanji is selected, and the user selects It should be easy. In addition, when the number of candidate kanji exceeds the number that can be displayed at one time, the page may be switched to display candidate kanji in lower order.
  • the candidate order determination unit 32 determines the order such that the extracted candidate kanji are displayed in order from those likely to be selected by the user based on a predetermined condition. Based on what conditions the order of candidate kanji will be determined, and will be described in detail later.
  • the character determination unit 33 receives from the candidate selection reception unit 23 the signal of the candidate kanji selected by the user input from the input unit 11 and the instruction signal for determining the input of the selected candidate kanji, The input of the candidate kanji selected by the user is determined. First, the selected candidate kanji is displayed in the definite character display area 123 together with the underline indicating the selected state before confirmation. Furthermore, the candidate kanji determined by the character determination unit 33 according to the instruction signal for input determination is displayed in the determined character display area 123 via the display control unit 22 without being underlined as an input determined character.
  • FIG. 15 is a diagram showing an example of a vague dictionary recorded in the kanji dictionary unit 41 which is referred to when the candidate extraction unit 31 extracts candidate kanji.
  • the kanji dictionary 140 according to the present embodiment is an alphabet piny correspondence table (AZP correspondence table 141) in which correspondence between the alphabets (rows) received by the input reception unit 21 and pinyins is recorded.
  • AZP correspondence table 141 an alphabet piny correspondence table
  • PZC correspondence table 142 pinyin-kanji correspondence table in which correspondence between pinyins and kanji is recorded. This makes it possible to directly extract candidate kanji corresponding to the input alphabet (column).
  • the candidate extraction unit 31 first identifies pinyin that matches “m” ahead (in the example of FIG. 15, according to AZP correspondence table 141 of the kanji dictionary 140). “Ma” Wmei ”) is identified, and the address of PZC correspondence table 142 in which the kanji corresponding to the pinyin is recorded is detected (“ cp ⁇ ml ”and“ cp ⁇ mi ”J).
  • the candidate extraction unit 31 selects the kanji corresponding to pinyin of “m” and the forward match.
  • the extracted kanji is supplied to the candidate order determination unit 32 as a candidate kanji as described above.
  • the candidate extraction unit 31 narrows down on the kanji that is in front agreement with “mei”, and extracts candidate kanji from the kanji dictionary unit 41.
  • the candidate extraction unit 31 receives an alphabet input by the user, and specifies pinyin including the alphabet.
  • the candidate extraction unit 31 can extract, from the kanji dictionary unit 41, candidates for kanji that the user desires to input based on the identified pin-ins.
  • the candidate order determination unit 32 determines the kanji candidate extracted by the candidate extraction unit 31 under a predetermined condition. Sort based on and determine the display order of the candidates.
  • the display control unit 22 controls the display unit 12 so that the display order is displayed in the display order determined by the candidate order determination unit 32 of the Chinese character candidate.
  • the character determination unit 33 determines the input of the designated kanji.
  • kanji characters that are candidates to be extracted by the candidate extraction unit 31 can be stored in an external storage device (not shown) and read from an external storage device by user's alphabet input operation. Even if the kanji is read, it may be used when displaying the candidates.
  • the character conversion device 1 is a kanji string (hereinafter referred to as a word (ideographic character string) in which one or more kanji powers having one meaning beginning with the kanji powers are selected after the ⁇ kanji is selected as the input determined characters
  • the candidate character display area 122 (FIG. 14) is displayed as a candidate word.
  • the user can obtain a kanji that can form a word with the ⁇ kanji in the candidate character display area 122, and thus can obtain a desired kanji without separately inputting the pinyin of the following kanji. It becomes. Therefore, it is possible to simplify the input operation for obtaining the desired kanji.
  • FIG. 16 is a block diagram showing the main configuration of the character conversion device 1 that outputs candidate words.
  • the reference numerals attached to the respective components in FIG. 16 correspond to the reference numerals attached to the respective components in FIG. 12, and the same reference numerals indicate the same components. Therefore, the description of the components already described in the above embodiments will not be repeated.
  • the character conversion device 1 shown in FIG. 16 differs from that shown in FIG. 12 in that the control signal is transmitted from the character determination unit 33 to the candidate extraction unit 31, and the word dictionary unit (character string recording unit 42) is included ⁇ ⁇ .
  • FIG. 17 is a diagram showing an example of candidate words displayed on the display unit 12 of the operation panel 2.
  • a word that starts with “extra character 2” is displayed as a candidate word in the candidate character display area 122 of the display unit 12.
  • Kanji words are recorded in association with each other in the word dictionary unit 42 to which the candidate extraction unit 31 refers.
  • FIG. 18 is a diagram showing an example of a word dictionary recorded by the word dictionary unit 42.
  • the word dictionary in this embodiment is a kanji word correspondence table (CZI correspondence table 143) in which correspondences between vague characters recorded in the kanji dictionary unit 41 and words starting from the kanji are recorded. Contains). This makes it possible to further extract candidate words corresponding to the selected kanji.
  • the candidate extraction unit 31 detects the address of “extra character 2” in the PZC correspondence table 142 (FIG. 15) (“ wp—mi + j ”). Subsequently, referring to the word dictionary unit 42, the words starting with "extra character 2" ("extra character 2" month, "extra character 2” heaven ' ⁇ ⁇ ⁇ are extracted based on the address "wp-mi + j". The selected word is supplied as a candidate word to the candidate order determination unit 32 as in the configuration described above, and displayed in the candidate character display area 122 (FIG. 17).
  • the character following the kanji may be a kanji that constitutes a word that also starts its kanji power than a kanji without any context. high. Therefore, if a word starting from a kanji whose input has been confirmed is displayed as a candidate, the user only selects the desired word from among the word candidates and does not separately input Chinese Pinyin. It becomes possible to input. As a result, the user can obtain a desired kanji input with a smaller number of operations related to the input of pinyin.
  • input reception unit 21 receives an input The signal of the alphabet “m” is supplied to the candidate extraction unit 31.
  • the candidate extraction unit 31 specifies pinyin corresponding to the input alphabet, and extracts a kanji corresponding to the specified pinyin from the kanji dictionary unit 41 as a candidate kanji (S202).
  • the candidate order determination unit 32 determines the display order of the above extracted candidate kanji based on predetermined conditions (details will be described later) (S203). Subsequently, the display control unit 22 determines the display position so as to display n candidate kanji in the display order determined by the candidate order determination unit 32, and outputs the candidate kanji to the display unit 12 to present it to the user. (S204).
  • an instruction to display the next candidate kanji other than the kanji displayed at the first time in the candidate character display area 122 by the user (because there is no desired kanji “extra character 2” in (a) of FIG. 20)
  • the process returns to S204, and the display control unit 22 displays a list of the next candidate kanji (see FIG. 20 (b)).
  • the character determination unit 33 instructs the display control unit 22 to display the selected “extra character 2”, and, for the candidate extraction unit 31, extracts a word starting with “extra character 2” next. Point.
  • the candidate extraction unit 31 subsequently extracts a word beginning with “extra character 2” as a candidate word from the word dictionary unit 42 (S207) ((c) in FIG. 20).
  • the candidate order determination unit 32 determines the display order of the extracted candidate words based on predetermined conditions. The order is determined (details will be described later) (S208). Subsequently, the display control unit 22 outputs the candidate words to the display unit 12 in the display order determined by the candidate order determination unit 32 and presents them to the user (S209).
  • the process when the user inputs an instruction to display the next candidate word (NO in S210, YES in S211), the process returns to S209, and the display control unit 22 displays a list of the next candidate words. Do. Furthermore, neither the instruction to display the next candidate word nor the instruction to select the candidate word is input (S210, S211: NO), and if an alphabet is newly input, the process returns to S202 and the candidate extraction unit 31 The pinyin corresponding to the alphabet input so far is specified, and the kanji corresponding to the specified pinyin is extracted as a candidate kanji.
  • the candidate selection reception unit 23 receives the signal of the selected candidate word ““ extra digit 2 ”heaven” and supplies the signal to the character determination unit 33.
  • the character determination unit 33 instructs the display control unit 22 to display the selected candidate word, and the display control unit 22 displays the selected candidate word on the display unit 12 (S 212) (S 212) (S 212) Figure 20 (d)). At the same time, it instructs the candidate extraction unit 31 to extract a word starting from the candidate word. If an instruction to end the input of characters is not input (NO in S213), the candidate extraction unit 31 returns to S207 and extracts a word starting from the selected candidate word as a candidate word from the word dictionary unit 42.
  • the character conversion apparatus 1 of this invention is provided with the structure which records the kanji (word) with which the input was decided by user's selection.
  • the character conversion device 1 is based on the kanji input in the past, that is, based on the kanji recorded by the user.
  • Candidate kanji can be displayed. Kanji used in the past are likely to be used again by the user. Therefore, by using the list of candidate kanji in which such usage history is reflected, the user can obtain a desired kanji with a small number of operations.
  • FIG. 21 is a block diagram showing the main configuration of the character conversion device 1 having a learning function.
  • the reference numerals attached to the respective components in FIG. 21 correspond to the reference numerals attached to the respective components in FIG. 12, and the same reference numerals indicate the same components. Therefore, the description of the components already described in the above embodiments will not be repeated.
  • the character conversion device 1 shown in FIG. 21 differs from that in FIG. 12 in that the character input control unit 30 further includes a learning control unit (learning control means) 34 and a learning dictionary in the recording unit 40.
  • a recording unit (learning word recording unit) 43 and a pair learning dictionary recording unit (learning word recording unit) 44 are included.
  • the learning control unit 34 controls the recording (learning) of the kanji input (used) to the character conversion device 1 according to the instruction of the user.
  • the kanji used by the user is stored as learning kanji in the learning dictionary recording unit 43 (hereinafter referred to as single learning (function)), or the kanji used by the user is
  • the pair is stored in the pair learning dictionary recording unit 44 in association with the kanji input immediately before (hereinafter referred to as pair learning (function)). Since kanji characters that have been input by the user in the past are likely to be used again by the user, by using such learned kanji characters as candidate kanji characters, the user can input desired kanji characters with fewer operations. The probability of getting is increased.
  • the learning dictionary recording unit 43 records kanji (input determined characters) used in the past by the user as single learning kanji.
  • the learning dictionary recorded in the learning dictionary recording unit 43 contains information on the date and time when the single learning kanji was recorded, information on the frequency of use (how many times it has been used in the past, and how many times it has been used within a predetermined period. Etc.) may be recorded in association with each kanji.
  • the learning control unit 34 When receiving the signal of the candidate kanji selected by the user from the character determination unit 33, the learning control unit 34 writes the selected candidate kanji in the temporary storage unit 3.
  • the character determination unit 33 determines the input in accordance with the determination instruction signal
  • the learning control unit 34 receives an instruction of the single learning process of the input determined character from the character determination unit 33.
  • the learning control unit 34 temporarily writes The candidate kanji is read from the memory unit 3 into the internal buffer, and deletion of the candidate kanji read from the temporary storage unit 3 and updating of the contents are performed. Then, the candidate vowels stored in the internal buffer are stored in the learning dictionary recording unit 43 as single learning kanji.
  • Pair learning dictionary recording unit 44 associates a kanji used in the past by the user with information (preceding kanji) indicating which kanji follows the kanji and the pair learning vague Is recorded as The immediately preceding kanji has already been recorded as a single learning kanji in the learning dictionary recording unit 43.
  • the preceding kanji is associated and recorded for each pair learning kanji.
  • Information of the recorded date and time, and information of frequency of use may be associated with each kanji character and recorded.
  • the learning control unit 34 receives from the character determination unit 33 an instruction of a pair learning process of the input determined character.
  • the learning control unit 34 reads the input determined character and its immediately preceding kanji from the temporary storage unit 3 to the internal buffer, and deletes the character string read from the temporary storage unit 3 and updates the contents. Then, the input determined character stored in the internal buffer is associated with the immediately preceding kanji, and is stored in the pair learning dictionary recording unit 44 as a pair learning kanji.
  • a word may be recorded in a dictionary. That is, the word used may be recorded in the learning dictionary recording unit 43 by the single learning function of the learning control unit 34. For example, when the word “water supply” is used, the word “water supply” is recorded in the learning dictionary recording unit 43 as a single learning kanji (word) in addition to the kanji "supply".
  • the pair learning function of the learning control unit 34 when the words “college” and “professor” are continuously input, the kanji “teaching” is It can also be recorded as paired learning kanji in association with kanji (word). Furthermore, the word “professor” is associated with “ ⁇ ” and “Z” or “University” 's last kanji (word) and recorded as a paired learning word.
  • the candidate order determination unit 32 (FIGS. 12, 16, and 21) of the character conversion device 1 is a candidate (candidate kanji Z A predetermined condition (hereinafter referred to as a candidate order determining element) for determining the display order of the candidate words will be described.
  • the display unit 12 provided on the operation panel 2 can display more candidates at one time.
  • the operation panel 2 is realized by a display Z input integrated input / output device such as a touch panel, it is possible to significantly reduce the number of operations for the user to obtain a desired kanji.
  • the burden of searching for a desired candidate is increased. Therefore, when displaying a limited number of candidates in a limited display area, it is desirable that they be displayed in the order of the level of possibility (selecting) that the user desires (selects) kanji (word) power.
  • the order of candidates to be displayed is determined so that the candidates are displayed in the display order with higher accuracy and in accordance with the preference of the user.
  • a character conversion device capable of simplifying the input operation for obtaining a desired kanji.
  • the display order of deviations of the Z candidate words is also determined in the same manner.
  • the candidate order determination unit 32 determines the display order of candidates based on the following seven candidate order determination factors.
  • the appearance frequency is information that is an index of how often it appears in a character (string) of a certain language. The higher the frequency of occurrence, the more often it is used in the language usage environment. Therefore, the higher the frequency of appearance, the higher the display order.
  • the appearance frequency may be fixed information using known statistical data.
  • the display order is determined based on whether or not the number of characters of the input alphabet (column) matches the number of characters of the corresponding candidate pinyin.
  • Pinyin length ranks the display order of candidates whose number matches the number of input Pinyin characters.
  • the input confirmation character that has already confirmed input up to that point and Z or at this time Increase the display order of the possibility (continuity with the previous character string) immediately after the selected character (string) and the candidate.
  • the display order is determined depending on whether it is a single learning kanji (word) recorded in the learning dictionary recording unit 43 or not. If it is a kanji (word) that has been input (used) in the past, it is likely to be used again. Therefore, the display order of the candidate V recorded as a single learning kanji (word) is ranked high.
  • the display order is determined according to whether or not the pair learning kanji (word) is recorded in the pair learning dictionary recording unit 44 and whether or not it is a pair learning kanji (word). If it is a combination of kanji (word) that has been input (used) in the past, the possibility of being used in the combination is high again. Therefore, as a pair learning kanji (word), the display order of candidates is ranked high.
  • the display order is determined based on the time when the learning kanji (word) matching the candidate kanji (word) is recorded in the learning dictionary recording unit 43 and Z or the pair learning dictionary recording unit 44.
  • the candidate character display area 122 (FIG. 14) can be displayed preferentially as a candidate with a high possibility of being selected by the user. It is possible to obtain a desired kanji with a small number of key operations.
  • the candidate order determining element is information necessary for the candidate order determining unit 32 to determine the display order of candidate kanji Z candidate words. More specifically, candidate order decision The determination unit 32 is information used to calculate an evaluation value indicating the degree of possibility that the candidate kanji character Z candidate word is selected by the user. Information on candidate order determination elements is associated with each candidate (candidate kanji Z candidate word) and stored in each (recording) unit of the recording unit 40. The candidate order determining unit 32 refers to each unit of the recording unit 40 It is then possible to identify candidate order decision factors for each candidate. Furthermore, the recording unit 40 may store an evaluation value calculation rule that defines how to calculate the evaluation value from the candidate order determination element specified by the candidate order determination unit 32. As a result, the candidate order determination unit 32 can obtain an evaluation value for each candidate according to the specified candidate order determination element and the evaluation value calculation rule, and can determine the candidate display order. The evaluation value calculation method of the candidate order determination unit 32 will be described later.
  • FIG. 22 is a block diagram showing a more detailed configuration of the candidate order determination unit 32.
  • the candidate order determination unit 32 includes a learning kanji determination unit 131, a pinyin length determination unit 132, a connectivity degree determination unit 133, an appearance frequency determination unit 134, and a learning time determination unit 135.
  • the recording unit 40 (FIGS. 12, 16, and 21) includes an appearance frequency recording unit 45 and a connectivity degree recording unit (association degree recording unit) 46 in addition to the above-described storage units.
  • Temporary storage unit 3 internally includes extraction candidate temporary storage unit 51, evaluation value temporary storage unit 52, and display order temporary storage unit 53.
  • An extraction candidate temporary storage unit 51 in the temporary storage unit 3 temporarily holds kanji characters and Z or words as candidates extracted by the candidate extraction unit 31 (hereinafter referred to as extraction candidates). .
  • the candidate order determination unit 32 reads out the extraction candidates from the extraction candidate temporary storage unit 51, and performs processing of determining the display order of each extraction candidate.
  • the evaluation value temporary storage unit 52 temporarily holds the evaluation value given to each of the extraction candidates calculated based on the candidate order determination element and the evaluation value calculation rule.
  • the display order may be determined based on the finally calculated evaluation value.
  • the display order temporary storage unit 53 temporarily holds information of the display order determined for each of the extraction candidates.
  • the candidate order determination unit 32 stores the determined display order in the display order temporary storage unit 53.
  • Information on the display order of the display order temporary storage unit 53 is referred to by the display control unit 22 when displaying the extraction candidates on the display unit 12.
  • the learning kanji judging unit 131 of the candidate order determining unit 32 judges whether the extracted candidate is a learning kanji (word) and whether it is single learning or pair learning. It is.
  • the learning kanji judgment unit 131 reflects the judgment result on the evaluation value of the evaluation value temporary storage unit 52 or the display order of the display order temporary storage unit 53. This makes it possible to determine the display order based on elements E and Z or element F.
  • the pin-in length determination unit 132 determines whether or not the number of characters of the alphabet (row) input to the input reception unit 21 (FIGS. 12, 16, 21) matches the number of characters of the corresponding candidate pinyin. To determine the For example, suppose that you have entered up to "me” in order to obtain the kanji "" ⁇ ⁇ 2 "(pinyin: mei). At this point in time, the number of input characters is 2, and the pinyin length of the corresponding candidate kanji character "" 2 "(mei;)" is 3, so the pinyin length determination unit 132 determines that the pinyin lengths do not match.
  • Connectivity determination section 133 determines the degree of connection with the input determined character and the character Z or the character currently selected (kanji Z word) whose input has already been determined by that time, that is, according to the degree of connection. Next, the character desired by the user is predicted to be judged as a candidate.
  • the connectivity determination unit 133 refers to the connectivity table 150 recorded in the connectivity recording unit 46 to determine the connectivity with the determined or selected front character string and Z or the previous character string. The higher the degree of connectivity with the forward character string and Z or the immediately preceding character string, the higher the probability that the character is the character desired by the user, and the display order is made higher. Thereby, the display order based on element C and Z or element D can be determined. Details of the connectivity degree table 150 and the connectivity degree determination unit 133 of the connectivity degree recording unit 46 will be described later.
  • the appearance frequency determination unit 134 uses known statistical data to be recorded in the appearance frequency recording unit 45. Based on it, the frequency with which candidate kanji (words) appear (is used) is determined. The more frequently they occur, the more likely they will be used by the user, since the character will be used more frequently. Therefore, the higher the frequency of appearance, the higher the display order. Thereby, the display order based on element A can be determined.
  • the learning time determination unit 135 determines the time when the learning kanji (word) matching the candidate kanji (word) is recorded in the learning dictionary recording unit 43 and Z or the pair learning dictionary recording unit 44. is there. The newer the time when the learned kanji (word) is recorded, the more likely it is to be selected by the user. Therefore, the display order of the candidates is determined so that the learning time is higher for the latest kanji (word). This makes it possible to determine the display order based on the element G.
  • the candidate order determination unit 32 can determine the candidate display order based on the various conditions described above.
  • connection degree determination unit 133 Next, the operation of the connection degree determination unit 133 will be described in more detail.
  • FIG. 23 is a diagram showing an example of the connectivity degree table 150 recorded in the connectivity degree recording unit 46 and showing the connectivity degree with the immediately preceding (forward) character string for each kanji (word).
  • the continuity degree table 150 is stored for each kanji (word), and is used to detect the address of continuity information with the immediately preceding character string and the address of relevance information with the forward character string.
  • Connectivity degree correspondence table 151 and continuity table 1 52 that stores continuity with immediately preceding character string for each kanji (word), and forward relevance that stores association with forward character string for each kanji (word) Table 15 contains three.
  • connection degree determination unit 133 when the word “traffic” is selected by the user, the connection degree determination unit 133 has a high probability of continuing after “traffic” as the immediately preceding character string, and the kanji (word) has a degree of connectivity Search from Table 150.
  • the connectivity determination unit 133 refers to the connectivity correspondence table 151 and stores the continuity information strength with the immediately preceding character string “traffic” in the address “npi + l” of the continuity table 152 Detect that
  • the connectivity determination unit 133 refers to the address “npi + 1” of the continuity table 152, and “rule (continuity 21)”, “college (16)”, “tool (15)” ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ Detects that the continuity is high. Finally, it is decided that the display order of each candidate word mentioned above is to be in the order of "rule” "college” "tool” ⁇ ⁇ ⁇ ⁇ .
  • the connectivity determination unit 133 is input in connection with “Shanghai” as a forward character string. Search the connectivity table 150 for ⁇ kanji (words). The connectivity determination unit 133 refers to the connectivity correspondence table 151, and stores the relevancy information with the forward character string “Shanghai” in the address “ppi + s” of the forward relevancy table 153, Detect
  • the connectivity determination unit 133 refers to the address “ppi + s” of the forward relevancy table 153 and selects “University (Relevancy 10)”, “Rule (5)”, “Police (3) ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ Detect that the relevance is high. Finally, it is decided that the display order of each candidate word mentioned above should be in the order of "college” "rule” "police” ⁇ ⁇ ⁇ .
  • the display order of candidate kanji can be determined as well as the words that the user often inputs
  • the candidate kanji 'words are displayed in the display order that reflects the user's preference. It can be done. As a result, it is possible to reduce the number of key operations of the user and simplify the user's input operation for obtaining a desired kanji.
  • the judgment result based on element D may be prioritized .
  • the candidate word “rule” is not always prioritized and displayed, but if "Shanghai” is input ahead, it is The supplementary word “University” can be displayed preferentially.
  • the word “Shanghai” is displayed on the determined character display area 123 (FIG. 14).
  • the candidate extraction unit 31 of the character conversion device 1 extracts candidate words input following the input determined character "Shanghai".
  • the candidate order determination unit 32 refers to the connectivity recording unit 46 in FIG. 22 to determine the display order of candidate words in order to display the high probability word following “Shanghai” in the upper order.
  • the connectivity determination unit 133 refers to the address “npi + s” of the continuity table 152 based on the connectivity correspondence table 151 (FIG. 23), and directly transmits the word candidate to the word “Shanghai”. Arrange in descending order of continuity probability. That is, in the example shown in FIG. 23, candidates are obtained in the order of “University (15)”, “Transportation (13)”, “Police (12)”, “Rule (8)”,. Subsequently, the connectivity determination unit 133 rearranges the display order of candidate words following “Shanghai” based on the relevancy to the word located in front of the input determined character “Shanghai”. However, here, since there are no kanji characters or words located in front of “Shanghai”, the change in display order due to the relationship with the front character string is not executed.
  • the character conversion device 1 When “traffic” is selected by the user from among the candidate words displayed in the above procedure, the character conversion device 1 subsequently ranks candidate words that are directly consecutive to the input determined character “traffic”. Determine the display order to display.
  • the connectivity determination unit 133 refers to the address “npi + l” of the continuity table 152, and based on the continuity, “rule (21)”, “college (16)”, “tool (15)” , “Police (14)”...
  • the address “ppi + s” of the forward relevance table 153 based on the relevance (degree of relevance) with the kanji and words (here, “Shanghai”) located ahead of “traffic” Rearrange the display order of candidate words that follow “Cross”. More specifically, in the example shown in FIG. 23, according to the forward relevance table 153, the relevance of the candidate word to "Shanghai” is, in order, "University (10)", “Rule (5)”, “Police (3)” and “Tool (2)”..
  • the degree-of-connection determining unit 133 gives priority to the degree of association of the candidate word, and changes the display order in the order of the degree of association.
  • the display order of candidate words following “Shanghai traffic” is “University”, “Rule”, “Police” in the order of “Rule”, “University”, “Tool”, “Police”... , "Tool” ... is changed in order.
  • change the display order based on the sum of connectivity and relevancy. You may change it.
  • the display order may be changed based on the average value and the maximum value of connectivity and relevance.
  • an evaluation value calculation rule that specifies that the sum of the degree of connectivity and the degree of association is calculated as an evaluation value is stored in the recording unit 40, and the candidate order determination unit 32 determines in accordance with the evaluation value calculation rule.
  • the sum of connectivity and relevance can be calculated as the evaluation value, and the display order can be determined based on the evaluation value.
  • the above-mentioned candidate order determining elements are combined to determine the display order of the extracted candidates.
  • the present invention is not limited to the combination described below, and the display order can be determined by appropriately combining the above-described respective elements.
  • FIG. 24 is a diagram showing a classification procedure of candidates when the candidate order determination unit 32 determines the display order.
  • the candidate order determination unit 32 first classifies the candidates into several groups for each display order according to a predetermined condition (the above-mentioned elements A to G and their combination), and the final display order in that group. Decide.
  • the candidate word G extracted by the candidate extraction unit 31 is first stored in the extraction candidate temporary storage unit 51 (FIG. 22) of the temporary storage unit 3.
  • the learning kanji judgment unit 131 judges whether or not the learning word is Ka (step S301).
  • the pinyin length determination unit 132 compares the pinyin length (number of characters of the alphabet) inputted when the candidate word G is extracted with each pinyin length of the candidate word G (S302). .
  • candidate words G candidate words that are learning words and that are determined to have a matching pinyin length are classified into group A in the highest display order.
  • candidate words that are determined to be learning words but do not match the pinyin length are classified into group A in the upper layer dullup B next.
  • candidate words that are not learning words but whose pinyin length is determined to match are classified into group C under group B.
  • Even Pinyin long as learning word The candidate words judged to be unmatched ⁇ are classified into the lowest group D.
  • learning kanji judgment unit 131 further judges whether it is a single learning word or a pair learning word ( S303). Further, the connectivity degree determination unit 133 determines whether or not there is continuity with the immediately preceding character string (S 304).
  • candidate words that are paired learning words and determined to have continuity with the immediately preceding character string are classified into the topmost group A1.
  • Candidate words that are judged as pair learning words but have no continuity with the immediately preceding character string are classified into the upper group A2.
  • Candidate words that are judged not to be paired learning words but have continuity with the immediately preceding character string are classified into lower group A3.
  • Candidate words judged not to be consecutive with the immediately preceding character string that is not a paired learning word are classified into the lowest group A4.
  • Duppe B candidate words are classified similarly.
  • connectivity degree determining section 133 determines the presence or absence of continuity with the immediately preceding character string (S 305), and upper group C (D 1) and the lower group C (D) 2
  • all candidate words (G) are, in order of upper group strength, group Al, group A2, groupo A3, ancestral A4, group Bl, group B2, group B2, group B3, group B4, group Cl, group It is divided into small groups of C2, group Dl and group D2.
  • the order of each small group in groups A and B is further determined by the learning time (element G) (the higher the kanji learned recently, the higher rank), and the small groups in groups C and D have their frequency of occurrence (element A). Determines the order (most commonly used words are higher)
  • the learned candidate kanji 'words are rearranged under a predetermined condition (that is, in the order of newly learned), but the candidate kanji extracted as a candidate is You may arrange later as the candidate word learned in the past.
  • the words learned in the past may be arranged so that they are displayed prior to candidate kanji. If there are multiple words used in the past, including candidate kanji, they may be arranged in ascending order of the number of characters. Alternatively, it may be arranged in the order of learning or in the reverse order as time lapse. (Modification 3—Input of Four Voices)
  • an AZP correspondence table 141 shown in FIG. 15 may be created including a four-voice code (for example, one to four voices are represented by one to four voices and five light voices for PC input). More specifically, in the AZP correspondence table 141 and the PZC correspondence table 142 shown in FIG. 15, it can be seen that pinyin "shang" is associated with the Chinese character "upper".
  • pin-in with the four voices of “upper” is “three-voice shang”
  • “sh ang” corresponding to “cp— si” in the AZP correspondence table 141 should be described as “511 & 11 8 3”. Then, it is possible to associate pinyin "shang3" with "four voices" with "on”.
  • the character conversion device 1 of the present invention further adds priority information to the above-described candidate order determination element, and the user uses the condition (evaluation value calculation rule) for determining the display order of candidate kanji (words). It may be configured to be set arbitrarily. Since the conditions for determining the display order can be arbitrarily set by the user, it is possible to display the candidates in the display order that matches the user's preference on the multifunction printer equipped with the character conversion device 1. Become. As a result, the input operation for obtaining the desired kanji can be simplified.
  • the character conversion device 1 in the present embodiment includes a candidate order determination element setting unit (candidate order determination element setting unit) 35 in the character input control unit 30 in addition to the configuration of the above-described embodiment 1,
  • the candidate order determination factor recording unit 47 is further included in the recording unit 40 (FIG. 12).
  • evaluation values calculated for each candidate are temporarily stored in the evaluation value temporary storage unit 52 shown in FIG. 22, and the display order is determined based on the evaluation values. That is, the higher the priority order candidate determination element set by the candidate order determination element setting unit 35, the larger the influence on the calculation of the evaluation value.
  • the candidate order determining element set by the candidate order determining element setting unit 35 and the priority thereof are recorded in the candidate order determining element recording unit 47. Will be recorded.
  • FIG. 25 is a diagram showing an example of a data structure for storing evaluation values of display order calculated for each candidate.
  • the evaluation value for each candidate is expressed in 16 bits. That is, values of “0” or “1” indicating characteristics (elements A to G) of a candidate kanji (word) are stored in 16 areas shown in FIG.
  • the bit position is associated with the high and low of the evaluation value, and as it becomes the upper bit, it represents the high-order digit of the evaluation value. Therefore, the judgment result on the candidate order determination element with high priority is reflected in the upper bits.
  • the evaluation value calculation rule specifies how to calculate the value of “0” or “1” from the candidate order determining element element of the candidate kanji (word), and also determines the value thus calculated. It defines which bit position of the evaluation value is to be reflected.
  • the candidate order determination element recording section 47 is for recording priority information of candidate order determination elements for determining the display order of candidates.
  • (A) of FIG. 26 and (b) of FIG. 26 are diagrams showing an example of priority information recorded in the candidate order determining element recording section 47.
  • the priority information includes, for each candidate order determination element, bit position information indicating which bit position of the data structure of the evaluation value the determination result on the candidate order determination element is to be reflected.
  • bit position information indicating which bit position of the data structure of the evaluation value the determination result on the candidate order determination element is to be reflected.
  • the determination result of the element B is set to be stored in the area of the bit position 12 shown in FIG. That is, when the pin-in length matches, "1" is stored in the area of bit position 12; otherwise, "0" is stored as the evaluation value. Evaluation of candidate with higher priority for candidate order determinant assigned higher bit position The impact on determining the value is large.
  • the evaluation value is expressed in 16 bits and the evaluation value is calculated using the area of bit positions 0 to 12, but the present invention is not limited to this example.
  • the present invention is not limited to this example.
  • the bit length is also possible to make the bit length sufficient to accommodate them.
  • the appearance frequency is expressed by 8 bits
  • 16 bits are secured to clarify the difference in the appearance frequency or It is possible to reduce to 4 bits, etc., in order not to emphasize the difference in frequency.
  • empty bit positions 13 to 15 may be used.
  • the candidate order determination element setting unit 35 sets the priority of the candidate order determination element specified by the user via the input unit 11 (FIG. 12), and the priority information is stored in the candidate order determination element recording unit 47. It is something to store.
  • the display unit 12 of the operation panel 2 displays the candidate Display an operation screen prompting to specify the priority of the order determining element. For example, along with a message such as “Indicate in the order you want each element (A to G) to be prioritized”, a list of candidate order determining elements for which priority can be set may be displayed.
  • a message asking whether to determine the power is displayed. For example, you may display a message such as "Please press the Enter key to make a change.”
  • candidate order determination element setting unit 35 determines a candidate order determination element having a low priority designated by the user from the low bit position according to the input instruction signal. Allocating in order as the area of, change the information of the bit position storing the judgment result of each candidate order decision element.
  • the user determines element E, element F in order with the pair learning kanji Z word of element F as the candidate order determining element with the highest priority. If element B, element C, element D, and element AZG are specified, the priority information is changed as shown in (b) of FIG. In the example shown in (b) of FIG. 26, the judgment result of element F is the range of bit position 12 Will be stored in the area.
  • each part (FIG. 22) of candidate order determination unit 32 determines the determination result based on the priority information of the candidate order determination elements specified by candidate order determination element storage unit 47.
  • the evaluation value is stored in the bit position of the designated evaluation value temporary storage unit 52.
  • the evaluation value for determining the display order can be calculated based on the candidate order determination factor that the user wants to prioritize, so the candidates can be displayed in a display order that more closely matches the user's preference. Can be displayed.
  • the priority information of the candidate order determining element may be set separately for the candidate kanji and the candidate word, or may be similarly applied to the display order of both candidate kanji 'candidate words.
  • the configuration may be such that the degree information can be set. If the configuration is such that the candidate kanji and the candidate word can be set separately, the priority change operation can be changed by changing the priority of the candidate order determination element in order of the candidate kanji and the candidate word by one priority setting menu call operation. You may be able to select the priority setting menu for candidate kanji and candidate words, and change the priority.
  • a setting end key may be provided, and desired priority information may be changed with respect to the contents up to when the setting end key is pressed. For example, when the setting end key is pressed in the middle of changing the priority of the candidate order determination element, it is possible not to reflect the content of the change in priority but to reflect only the change of the candidate order determination element. is there.
  • the candidate extracted by the candidate extraction unit 31 is temporarily stored in the extraction candidate temporary storage unit 51.
  • the evaluation value temporary storage unit 52 for each candidate stored, a 16-bit area (FIG. 25) representing the evaluation value is secured.
  • the learning time determination unit 135 calculates the learning time or the appearance frequency, respectively, otherwise the appearance frequency determination unit 134 calculates the learning time or the appearance frequency.
  • the appearance frequency is about various document character data such as a large amount of newspapers and magazines. It is the frequency of use statistically determined for each letter and word, and the larger the frequency, the more commonly used letter 'word' is assumed. In this embodiment, since the area of appearance frequency is secured at 8 bits, it is possible to set the evaluation value of the candidate between 0 and 255 based on the appearance frequency. .
  • the learning time can be determined between 0 and 255 so that the more recently learned one has a higher evaluation value.
  • the learning time indicates when the candidate kanji (word) was learned. It will be as follows if an evaluation value calculation rule is explained concretely. For example, assuming that the learning time is X, the maximum number of learning is N, the learning kanji and the word learning time are n, and the current learning time is m, the learning time for any kanji and word is X It is determined by
  • the value of X in the above equation exceeds the bit length range (8 bits in this embodiment) secured as the evaluation value of the learning time, the value of X is set to the bit length range. You can do numerical conversion of or change the bit length.
  • Bit positions 8 to 12 store the determination results for elements D, C, F, E, and B, respectively.
  • the learning kanji determination unit 131, the pinyin length determination unit 132, and the connection degree determination unit 133 store the determination results for elements D, C, F, E, and B, respectively.
  • the candidate order determination unit 32 rearranges the candidates in order of the evaluation value magnitude stored in the evaluation value temporary storage unit 52, and displays the final display order Store in display order temporary storage unit 53. Finally, the display control unit 22 (FIG. 12) displays the candidates based on the display order stored in the display order temporary storage unit 53. Display on section 12.
  • the character conversion device 1 of the present invention is not limited to this.
  • the input unit 11 (FIG. 12) of the operation panel 2 also has a function of inputting handwritten characters, and the input / output control unit 20 recognizes the handwritten characters to constitute the character conversion device 1 for inputting Chinese characters. It is possible.
  • the input unit (written character input unit) 11 for inputting handwritten characters may display a frame for inputting handwritten characters on the display area of the operation panel 2 as a display integrated type, for example. It can be realized from this.
  • the character conversion device 1 is a character recognition unit (character recognition means) for recognizing writing information (coordinate data) of characters written by the user in the character input box in the input / output control unit 20. Not shown).
  • the recording unit 40 further includes a recognition dictionary (not shown) used to match the input character pattern recognized by the character recognition unit. Then, the display control unit 22 is configured to output the recognition result of the input character! .
  • operation panel 2 detects the coordinate information 60 (FIG. 27) of the written character, and the input / output control unit 20 internal buffer
  • the character recognition unit matches the coordinate information stored in the internal buffer with the dictionary pattern of the recognition dictionary (for example, DP matching), and obtains the similarity (for example, distance) between the input pattern and the dictionary pattern.
  • the character code corresponding to and the similarity are associated and stored in the internal buffer.
  • the character recognition unit matches the input pattern with each pattern of the recognition dictionary, the character recognition unit rearranges the recognition results of the internal buffer in descending order of similarity. Then, the recognition result is supplied to the candidate extraction unit 31 as a candidate kanji (word).
  • operation panel 2 detects the coordinate data of “extra character 2” and stores it in the internal buffer. (Fig. 27).
  • the character recognition unit performs DP matching between the coordinate data of “external character 2” in the internal buffer and each dictionary pattern of the recognition dictionary, calculates distances, and rearranges them in ascending order of distance (FIG. 29).
  • On The output control unit 20 transfers the recognition result to the display control unit 22.
  • Display control unit 22 determines the display position of n characters (here, kanji) as recognition character candidates based on the recognition result, and displays information on display positions and n recognition character candidates. Is displayed on the display unit 12 of the operation panel 2 ((b) in FIG. 28). Operation panel 2 associates the displayed recognized character candidate with the display position and stores it in the internal buffer.
  • n characters here, kanji
  • the function of the character recognition unit of the character conversion device 1 for recognizing handwritten characters is not limited to this, with the multifunction printer taken as an example.
  • the above-described character recognition unit can be applied to any device provided with a touch panel integrated with a display. Therefore, it is possible to realize the character conversion device of the present invention capable of inputting a desired kanji 'word by inputting a written character, on any device other than the above-mentioned multifunction printer. .
  • the character conversion device 1 of the present invention is not limited to the above configuration.
  • the character conversion device 1 of the present invention as a character conversion device for inputting the traditional characters mainly used in Taiwan.
  • a vowel character (jutsu character's letter) as a phonetic character is accepted, and a character variation is presented to the user as a candidate kanji character (word) as a traditional kanji character as an ideographic character.
  • the replacement device 1 will be described.
  • FIG. 30 is a block diagram showing the main configuration of the character conversion device 1 according to the present embodiment.
  • the character conversion device 1 shown in FIG. 30 differs from those in FIG. 12 and FIG. 22 in that the input unit 11 is provided with a key for inputting a note character, which is associated with each note character.
  • the kanji dictionary unit 41 includes a phononic character input acceptance unit 24 for accepting the input phononic character and supplying it to the candidate extraction unit 31. It is a point that contains a correspondence table that records the correspondences between the ring tones and the traditional characters of the ring tones.
  • FIG. 31 is a diagram showing an example of the operation screen at the time of the phonetic character input process for traditional character input in the present embodiment.
  • the display integrated type operation panel 2 in the present embodiment may display the vulgar character in addition to displaying the alphabet (character input area 121). It is common to enter vowels instead of the alphabet for traditional Chinese input.
  • the phonetic characters are assigned to the keys of the keyboard, so that it is possible to input a vocal sound corresponding to the traditional characters.
  • FIG. 32 is a diagram showing an example of a kanji dictionary (in Japanese) recorded in the kanji dictionary unit 41 (FIG. 30) according to the present embodiment.
  • FIG. 33 is a diagram showing an example of a word dictionary recorded in the word dictionary unit 42 in the present embodiment.
  • FIG. 34 is a diagram showing an example of the connectivity degree table 150a recorded in the connectivity degree recording unit 46 in the present embodiment.
  • the calculation of the learning order Y may be performed by the learning time determination unit 135 shown in FIG.
  • the learning number represents a unique number to be assigned to the newly learned kanji, and this number can be recorded together with the learning kanji as a learning counter to know when the kanji has been learned. It is. Assuming that it is possible to record 500 learning kanji (words) in the learning dictionary recording unit 43 (or the pair learning dictionary recording unit 44) of FIG. 21, the learning numbers are prepared from 0 to 499. do it.
  • the learning timing determination unit 135 may perform processing of rounding the learning order ⁇ obtained for each candidate, in order to save time and effort of rearranging the candidates.
  • the learning order can be rounded according to the following procedure.
  • the learning kanji learned in 43 has a learning counter equal to the current learning number Search for learning kanji. Delete the kanji at the address ID of the retrieved learning kanji and set a new learning kanji. If there is no learning kanji with a learning counter, etc., etc., for the current learning number, record both the learning kanji and the learning counter at the ID position. Then, add 1 to the learning number.
  • the learning control unit 34 overwrites the new learning vague “large” with “learning” recorded along with the learning counter 0. Record with At this time, learning counter 0 is recorded in association with "large”. Subsequently, the learning number is updated to 2 as described above.
  • a group identifier (GId value) generated from the character code of the kanji constituting the word is assigned to the learning word recorded in the learning dictionary recording unit 43, and the learning word is associated with the GId value. Are recorded. By using this GId value, the processing efficiency for searching for a target learning word from the learning dictionary recording unit 43 is improved.
  • both the kanji code and the GId value for uniquely identifying a kanji are represented by 16 bits.
  • the GId value is composed of the upper 4 bits indicating the number of characters of the word and the lower 12 bits indicating the numerical value obtained by calculation of the kanji code of each kanji constituting the word.
  • the data structure of the GId value used in the character conversion device 1 of the present invention is not limited to this.
  • the learning control unit 34 First, generate the GId value of the word “Shanghai".
  • the GId value generated as described above is a word having any number of characters (if the number of characters is represented by 4 bits as in the above example, a maximum of 8 characters can be used as a word). It will be summarized as a fixed length (eg, 16 bits). Therefore, since it is possible to execute the search process after narrowing down the target word by the GId value configured with the fixed length, it becomes possible to improve the search efficiency of the learning word.
  • the operation screen for character input processing displayed on operation panel 2 is not limited to the example shown in FIG. 14 or FIG. Another example of the operation screen displayed on the operation panel 2 will be described.
  • the alphabet keys for inputting pinyin displayed on operation panel 2 may be a key arrangement imitating a PC keyboard as shown in FIG. 35, or as shown in FIG. 36, an input character Let's fit in alphabetical order to m X n key array.
  • a candidate character display area 122 for displaying candidate kanji may be superimposed on the determined character display area 123 and displayed ((a) to (c) in FIG. 37).
  • second candidate character display area 122 ′ for displaying candidate words starting from the selected kanji is superimposed on candidate character display area 122 and determined character display area 123. It may be displayed (FIG. 38 (a)) or may be displayed side by side with the candidate character display area 122 (FIG. 38 (b)).
  • FIGS. 1 to 11. One embodiment of the present invention will be described below with reference to FIGS. 1 to 11. .
  • a device for inputting a reading (phonetic characters) of kanji characters (converted characters, characters after converted) to determine words including vague characters is used, but the present invention is not limited thereto. That is, the embodiments described below are merely examples for implementing the present invention, and are not limited to the implementation of the present invention.
  • FIG. 1 is a block diagram showing a configuration of character conversion device 1 according to the present embodiment.
  • the character conversion device 1 includes an input unit 11, a dictionary search unit 310, a display control unit 22, a kanji dictionary unit (conversion character dictionary) 41, a word dictionary unit (word dictionary) 42, a temporary storage unit 3, a control unit 10, a display It has part 12.
  • the reference numerals attached to the respective constituent elements in FIG. 1 correspond to the reference numerals attached to the respective constituent elements in FIG. 12, FIG. 16, FIG. 21, FIG. 22 and FIG. It shows the elements.
  • the character conversion device 1 shown in FIG. 1 may be equipped with the components shown in FIG. 12, FIG. 16, FIG. 21, and FIG. 22, FIG. 30, which are not shown in FIG.
  • the input unit 11 includes a keyboard and the like so that the user can input a desired input character string (phonetic character), selected kanji described later, selected words, and the like. It is configured. Further, the input unit 11 is configured to allow the user to input various instructions to the character conversion device 1.
  • the input unit 11 and the display unit 12 may be used as the operation panel 2 (touch panel).
  • Display control unit 22 displays the kanji, words, and their candidates searched by dictionary search unit 310 on display unit 12 in a selectable manner based on the readings and readings input from input unit 11. .
  • the display unit 12 displays various data under the control of the display control unit 22.
  • the candidate extraction unit 31 includes a dictionary search unit 310.
  • the dictionary search unit 310 searches each dictionary unit (Kanji dictionary unit 41 and the word dictionary unit 42) stored in the storage unit 40, and detects candidate kanji (or words).
  • the dictionary search unit 310 further includes a kanji acquisition unit (head conversion character acquisition unit) 311 and a word acquisition unit (word acquisition unit) 312.
  • the kanji acquisition unit 311 receives the character string of the reading accepted by the input acceptance unit (acceptance means) 21 as an input, and searches the kanji dictionary unit 41 to find the kanji that matches the desired conditions (or, described below.
  • the kanji information is stored in the temporary storage unit 3 as a search result.
  • the word acquisition unit 312 receives the search result of the kanji acquisition unit 311 and the character string of the reading accepted by the input accepting unit 21.
  • the word dictionary unit 42 is searched as a force, and a word matching the desired condition (or word information described below) is stored in the temporary storage unit 3 as a search result. Details of kanji acquisition and word acquisition will be described later.
  • the kanji dictionary unit 41 holds kanji correspondence information 144 in which the yomi and the kanji are associated.
  • a kanji index (kanji index table) 145 in which the first character of reading and the address information of the storage destination in the kanji correspondence information 144 of the kanji whose head is the first character is associated is held.
  • the kanji index 145 is arranged in the order of kanji reading.
  • the word dictionary unit 42 holds word correspondence information 162 in which a word and a reading of the word are associated with each other. Furthermore, a word index (word index table) 161 in which the head character which can be the head of the word and the address information of the storage destination in the word correspondence information 162 of the word having the head character at the head are associated is held. ing. The word correspondence information 162 and the word index 161 are grouped by the first letter of the word! .
  • the kanji correspondence information 144 stores auxiliary information in association with kanji.
  • the word correspondence information 162 stores auxiliary information in association with a word.
  • the auxiliary information includes, for example, the number of characters of reading (for example, the pin-in length (element B) in each of the embodiments described above) (search auxiliary information), the frequency of occurrence (element A described above) (display auxiliary information), etc. .
  • the above auxiliary information is a numerical value obtained by converting the number of characters and the frequency of occurrence of reading into a predetermined range, for example, 1 to 100.
  • kanji information refers to kanji and information (reading and supplementary information) associated with the kanji.
  • word information refers to a word and information (reading and supplementary information) associated with the word.
  • auxiliary information is, of course, not limited to the above, and may be other information.
  • the candidate order determining elements (element C to element G) described in each of the above-described embodiments may be stored as auxiliary information in association with the candidate (kanji Z word).
  • the control unit 10 centrally controls the operation of various components in the character conversion device 1. For example, the number of characters of the character string input from the input unit 11 is counted. Also, the dictionary search unit 310 controls the display control unit 22. In addition, an input receiving unit 21 is provided so that the user can receive a character string input by the input unit 11! Next, the processing operation of the character conversion device 1 will be described using the flowchart shown in FIG.
  • Control unit 10 counts the number of characters of the read character string transferred from input unit 11, and stores the read character string and the number of characters in temporary storage unit 3. Then, the control unit 10 instructs the dictionary search unit 310 to search for kanji (S101).
  • the Chinese character acquisition unit 311 of the dictionary search unit 310 reads the read character string and the number of characters of the read character string from the temporary storage unit 3 into an internal buffer (not shown) of the dictionary search unit 310.
  • the kanji acquisition unit 311 refers to the kanji index 145, determines the address (iNum) of the end position of the search target range to be searched in the kanji correspondence information 144 based on the first character of the read character string, and internally It holds in the buffer (S102). Further, the address (iStart) of the start position of the search target range to be searched in the kanji correspondence information 144 is obtained and held in the internal buffer (S103).
  • the kanji acquisition unit 311 reads kanji information by referring to the kanji correspondence information 144 based on the address held in the internal buffer of the dictionary search unit 310, and reads the kanji correspondence information 144 and the input characters.
  • the column reading is compared (S104). If the reading of the input character string matches the reading in the kanji correspondence information 144 (YES in S104), the kanji acquisition unit 311 stores the kanji information satisfying the reading condition in the temporary storage unit 3. It memorizes (S105).
  • iStart is updated (S106). Thereafter, the kanji acquisition unit 311 compares the iNum stored in the internal buffer with the iS tart updated in S 106 (S 107). As a result of comparison, in the kanji to be searched, iStart becomes iNum or more! /, And (in S 107,! /, NO), notifies the control unit 10 that the kanji search is finished.
  • control unit 10 When control unit 10 receives the notification that the kanji search is completed from dictionary search unit 310, control unit 10 rearranges the kanji search results stored in temporary storage unit 3 in a predetermined order (S 108), Information on the position is added and stored in temporary storage unit 3. Power! ],
  • the display control unit 22 is a display unit 1
  • the kanji search result is displayed at step 2 (S109).
  • sorting in the predetermined order it may be sorted in the order of appearance frequency of the auxiliary information, or as described in the above embodiments, based on the candidate order determination factor (and the evaluation value calculation rule).
  • the candidate order determination unit 32 calculates the evaluation value, and based on it, it may be rearranged!
  • the control unit 10 compares the position information received by the input receiving unit 21 with the display position information of the kanji search result stored in the temporary storage unit 3 and detects the input kanji (S 11). In addition, the control unit 10 transfers the kanji designated by the user to the dictionary search unit 310, and instructs to perform a word search Note that, at this time, the candidate selection reception unit 23 is selected by the user.
  • the kanji may be detected, and the learning control means may store it as a learning kanji in the learning dictionary recording unit 43 or the pair learning dictionary recording unit 44.
  • the word acquisition unit 312 of the dictionary search unit 310 When the word acquisition unit 312 of the dictionary search unit 310 receives a word search instruction, the word acquisition unit 312 refers to the word index 161 and, based on the acquired identification information (character code) of the kanji, in the word correspondence information 162.
  • the address (iNum) of the end position of the search target range to be searched in is determined and stored in the internal buffer (S112). Further, the address (iStart) of the start position of the search target range to be searched in the word correspondence information 162 is obtained and held in the internal buffer (S113).
  • the word acquisition unit 312 reads the word information with reference to the word correspondence information 162, and temporarily stores the word information matching the predetermined condition. It stores in the part 3 (S 114).
  • the word acquiring unit 312 updates iStart for the word to be searched (S 114 a), holds it in an internal buffer, and compares iNum with iStart updated in S 114 a (S 115) o As a result of comparison, when iStart is iNum or more for the search target word (NO in S115), the control unit 10 is notified that the word search has ended.
  • the control unit 10 When the control unit 10 receives the notification that the word search is completed from the dictionary search unit 310, the control unit 10 arranges the word search results stored in the temporary storage unit 3 in a predetermined order (S116), The position information is added and stored in the temporary storage unit 3.
  • the display control unit 22 causes the display unit 12 to display the word search result on the display unit 12 (S117). Note that when sorting in the above-mentioned predetermined order, it may be sorted in the order of appearance frequency of the auxiliary information, or, as described in each of the above-mentioned embodiments, candidates are determined based on candidate order determination elements (and evaluation value calculation rules).
  • the order determination unit 32 may calculate the evaluation value and rearrange it based on it.
  • the user also visually checks a desired word as the word search result displayed on the display unit 12, selects the desired word, and designates and inputs the position where the word is displayed.
  • the input unit 11 transmits position information of the input position to the input reception unit 21 of the control unit 10. At this time, even if the candidate selection receiving unit 23 detects the kanji selected by the user and the learning control means stores it as a learning kanji in the learning dictionary recording unit 43 or the pair learning dictionary recording unit 44.
  • the control unit 10 compares the position information received by the input receiving unit 21 with the information on the display position of the word search result stored in the temporary storage unit 3 to search for a word specified by the user. Then, the display control unit 22 displays the word retrieved by the display unit 12.
  • the kanji dictionary unit 41 and the word dictionary unit 42 are provided with the index and the kanji correspondence information 144 and the word correspondence information 162 to group the words of the same leading kanji.
  • Auxiliary information is added to the correspondence information and the word correspondence information, and the search time can be shortened by limiting the search target kanji and words based on the auxiliary information.
  • the display order of the retrieved kanji characters (words) is likely to be selected by the user by the candidate order determination unit 32 based on the auxiliary information (candidate order determination factor) of the candidate. It is decided that things will be displayed with priority. Therefore, candidate kanji (words) rearranged in the order desired by the user can be displayed in a shorter time. As a result, user convenience can be improved.
  • the Chinese character conversion device 1 described above is converted into a Chinese character (converted character) by inputting pinyin (phonetic characters).
  • pinyin phonetic characters
  • the kanji index 145 is as shown in FIG. 3.
  • the kanji index 145 has the first character of pinyin (the first phonetic character) and the first character of this pinyin as the first of the pinyin of kanji (phonetic information).
  • the kanji address (storage destination information) in the kanji correspondence information 144 of the kanji is associated!
  • the description on the left of the Kanji index 145 (cidx-ml, cidx-m2, etc.) is an address in the vague index 145.
  • cidx-ml is an address in the kanji index 145 of "the beginning of reading is m and one voice".
  • kanji and its pinyin are associated with auxiliary information (in FIG. 4, search conditions and frequency information).
  • auxiliary information in FIG. 4, search conditions and frequency information.
  • the description (cp-ml, cp-m2, etc.) on the left of the kanji correspondence information 144 in FIG. 4 is an address in the kanji correspondence information 144.
  • the word index 161 has a primary index (word primary index table) 161a shown in FIG. 5 and a secondary index (word secondary index table) 16 lb shown in FIG.
  • the primary index 161a the kanji and its code are associated with the address at the secondary index 161b of the word having the kanji as the head.
  • the secondary index 16 lb the top Pinyin of the kanji and the second kanji is associated with the address in the word correspondence information 162.
  • the description on the left of the secondary index 161b (widx-di, widx-gi, etc.) in FIG. 6 is an address in the secondary index 161b.
  • a pin is associated with a word and its reading.
  • the description (wp ⁇ di, wp ⁇ gi, etc.) on the left of the word correspondence information 162 in FIG. 7 is an address in the word correspondence information 162.
  • the primary index 161a is not necessarily required.
  • the character conversion device 1 described above is described as a Chinese conversion device which converts pinyin to kanji, the invention is not limited thereto.
  • a hiragana or katakana
  • the apparatus may be a device that determines a word including
  • the Chinese character acquisition unit 311 of the dictionary search unit 310 reads the read character string "mei" from the temporary storage unit 3 into the internal buffer of the dictionary search unit 310. Then, the dictionary search unit 310 obtains an address in the range of “m” of the kanji index 145 according to the following expression based on the first character “m” of the reading character, and the kanji address described in those addresses Store in buffer.
  • the address is a
  • the unit size of the kanji index (kanji index table) 41 has word index (word index table) 51 in alphabetical order of the first letters of Pinyin, and the first letters of the same pinyin are arranged in the order of 1 to 5 in four voices If configured as above, it will be the byte length of the kanji address. For example, in the device operating as this implementation, it is 4 bytes.
  • the end address is calculated by “(0x6d — 0x61 + 1) Unit size of X kanji index X 5” (ie, cidx — nl in FIG. 3), and each address points
  • the kanji address (cp-ml and cp-nl) described in the kanji index is stored in the internal buffer.
  • cidx-ml in the figure represents the address in the Kanji index 145 of the m-first reading of one reading.
  • the corresponding cp-ml represents the address of the single-speech kanji correspondence information 144 with the m beginning of the reading. cidx-m5, cp-m5 has m at the beginning of the reading and corresponds to a light voice!
  • the dictionary search unit 310 sets the address cpi to be searched as cp ⁇ ml. Next, the dictionary search unit 310 reads the kanji information of the address cpi from the kanji correspondence information 144, and determines the search condition based on the number of input pinyins. As a result, if the search conditions are satisfied, the input reading and the reading of the kanji information are compared, and if the input reading matches the reading of the kanji information, the temporary storage unit 3 is regarded as a match.
  • cpi only.
  • the search condition is as follows.
  • the address of the third voice of “m” is Calculated by the above equation (1), the start address is "(0x6d-0x61) unit size of X kanji index X 5 + (3-1) unit size of X kanji index” (that is, cidx-m3 in Fig. 3), The end address is "(0x6d-0x61) X Kanji index unit size X 5 + 3 X Kanji index unit size” (that is, cidx-m4 in Fig. 3) and described in the Kanji index pointed to by each address! Store the kanji address (cp-m3 and cp-m4) in the internal buffer.
  • the user inputs the pin-in “111” of Chinese reading “extra digit 1,” the kanji search result is stored in the temporary storage unit 3, and the display control unit 22 displays it on the display device. Give instructions from “" and "”, and explain.
  • the primary index 161a of the word dictionary unit 42 is stored in ascending order of kanji code.
  • the dictionary search unit 310 searches for the word index 161 that matches the transferred kanji code, and finds the search target range in the word correspondence information 162. Ru. Specifically, the dictionary search unit 310 searches the primary index 161a for the transferred “extra character 2” by two and obtains a kanji code C3BF that matches “extra character 2”.
  • the dictionary search unit 310 is configured to set the kanji code C3C0 next to C3BF in the primary index 161a (see FIG. 5).
  • the address (word address) “wp ⁇ mi + l” in the word correspondence information 162 of “head digit 6” is the first word, “star 2” is the head. It is stored in the internal buffer as the end address for the word correspondence information 162.
  • the word acquisition unit 312 of the dictionary search unit 310 reads out the word information from “wp ⁇ mi” to “wp ⁇ mi + l”, and uses the kanji string of the word information as the search result as a temporary storage unit 3
  • the candidate order determination unit 32 rearranges the word information (candidate words) in the temporary storage unit 3 in descending order of appearance frequency, which is supplementary information (not shown), and ends the word search process.
  • this rearrangement is a mere illustration and may be rearranged by other methods.
  • the display order may be determined based on the candidate order determination factors (element A to element G).
  • the retrieved and rearranged words are displayed on the display unit 12.
  • Embodiment 5-2 describes the operation when the user inputs word reading to input a word.
  • the embodiment 5-1 while the reading of the first character of the character string to be input is input, in the present embodiment 5.2, there is a difference that the reading of the word to be input is input.
  • the process in which the input receiving unit 21 of the control unit 10 receives the read character string input from the input unit 11 is the same as that of the embodiment 5-1. Therefore, in the present embodiment, the process after the user inputs “meinian” of Pinyin, which is a Chinese reading of “extra character 1” to input “extra character 1” will be described.
  • the control unit 10 has a function of dividing pinyin of the character string of the input reading, and is assumed to be a personal computer. The pinyin division process of the input read character string will be described later.
  • control unit 10 When control unit 10 receives the reading character string "meinian” from input unit 11, it divides the reading character string into pin-ins, and as a division result, it is divided into pinyins by "mei" and "nian". The number of divisions indicating that there are two (or the number of division positions indicating that there are two division positions) 2 is stored in the temporary storage unit 3.
  • the kanji acquisition unit 311 of the dictionary search unit 310 reads the read character string “mei” from the temporary storage unit 3 and stores the read character string “mei” in the internal buffer of the dictionary search unit 310.
  • the kanji acquisition unit 311 refers to the kanji index 145 using the above equation (2) based on the initial letter “m” of “mei” to obtain an address in the range of “m”, and those addresses The kanji address described in is fed to the internal buffer. .
  • the unit size of X kanji index X 5 "(that is, calculated according to cidx ⁇ nl in FIG. 3), referring to kanji index 145 of kanji dictionary section 41, and kanji address indicated by each address (cp — ml , And cp-nl) are stored in the internal buffer.
  • the kanji acquisition unit 311 of the dictionary search unit 310 sets the address cpi to be searched as cp ⁇ ml.
  • the kanji acquisition unit 311 reads kanji information at the address cpi, refers to the kanji correspondence information 144, and determines a search condition based on the number of input pins. As a result, if the search condition is satisfied, the input reading is compared with the reading of the kanji information.
  • the temporary storage unit 3 stores that the complete matching with the kanji information (for example, 1) is associated and stored, and cpi is updated.
  • the input reading matches the reading of the kanji information, it is regarded as a match and it is stored in the temporary storage unit 3 that the kanji information and the leading match (for example, 0) are associated and stored cpi Do.
  • candidate order determination unit 32 rearranges perfect match kanji information among the kanji information stored in temporary storage unit 3 according to the appearance frequency, and Then, the front matching kanji information is rearranged at the appearance frequency so that the candidate order is lower than the perfect matching kanji information, and the kanji search processing is ended.
  • the candidate order determination unit 32 may further determine the display order based on other auxiliary information (eg, candidate order determination factors (element A to element G)).
  • kanji information of “extra character 2” is stored as a search result in the temporary storage unit 3 as kanji information that completely matches “mei” for the read character string “mei”.
  • the kanji search candidate power displayed after the above-described kanji search is described as the user specifying “extra character 2”.
  • the process of candidate selection in response to the instruction of “extra character 2” by the user is the same as the process described in the embodiment 5-1, and therefore the description thereof is omitted.
  • the primary index 161a, the secondary index 161b, and the word correspondence information 162 are arranged in the ascending order of the vague code.
  • the word acquisition unit 312 of the dictionary search unit 310 receives a word search instruction, it refers to the primary index of the word index 161, and the first index information that matches the transferred kanji code.
  • the primary index they are arranged in the kanji code order of the leading kanji of a word, and the word acquisition unit 312 searches for first index information that matches the kanji code transferred by binary search.
  • the first index information matching the transferred kanji code and the first index information of the next kanji code arranged in the primary index are stored in the internal buffer.
  • the secondary index address that matches the kanji code is widx ⁇ mi
  • the secondary index address of the next kanji code is The address is widx-mi + 1, which are stored in an internal buffer.
  • the word acquisition unit 312 of the dictionary search unit 310 reads the second and subsequent character strings of the divided read character string from the temporary storage unit 3 into the internal buffer.
  • the dictionary search unit 310 searches the search range of the word correspondence information 162 for the first divided kanji (first kanji, first kanji) and the first pinyin character of the second divided reading. In order to narrow down, based on the address of the secondary index 161b added to the primary index 161a, the one that matches the first character "n" of the second divided reading is added to the secondary index 16 lb. Search It stores the secondary index 161b searched for in the file and the next second index information arranged in the secondary index 161b.
  • the word acquisition unit 312 of the dictionary search unit 310 appends the address wpi to be searched to the second index information as the address wp-mi of the word information to be searched.
  • the word information from “wp—mi” to “wp—mi + k” is read out.
  • reading of the word information in the word correspondence information 162 corresponds to each kanji of the word. It is assumed that the reading is capitalized at the beginning of the reading corresponding to each kanji of the word as "MeiNian".
  • the word acquisition unit 312 of the dictionary search unit 310 reads the word information of the address wpi, refers to the word correspondence information 162, and compares the number of kanji of the word correspondence information 162 with the number of divisions of the input reading. . If the number of kanji in the word information is equal to or less than the number of divisions in the input reading, the reading and the divided reading corresponding to each kanji in the word information are compared from the beginning, and the number of kanji with exactly matching reading and the leading kanji reading Count the number.
  • temporary memory unit 3 has the number of kanji with perfect matching of word information and reading.
  • the word and the word memorize the number of kanji in the forward match, and wpi is updated. If “Number of Kanji in Word Information” and “Number of Kanji with Perfect Match of Reading + Number of Kanji with Forward Match” do not match, update wpi.
  • the dictionary search unit 310 ranks the word information stored in the temporary storage unit 3 under the following conditions.
  • the word search result is used. However, if the number of kanji in the word information is more than “the number of completely matching kanji + the number of leading kanji reading”, the word search result may be used. With this configuration, it is possible to input only by reading the first few characters of the desired word.
  • the word search is started by the user selecting a desired kanji from the displayed kanji search results.
  • the word search may be started for the first ranked kanji candidate without displaying the kanji search result, and the search result may be displayed when the word search is completed.
  • the word search may be started for some or all of the candidates without displaying the kanji search results, and the search results may be displayed when the word search is completed.
  • the first character strength also sets the i-th character as the first syllable, and the i + 1st character also finds the longest pinyin that holds as a rhyme, a vocal mother and a vocal mother + a rhyme, and makes it a second syllable
  • the first syllable has more characters than the second syllable
  • division processing of “meinian” will be described as a final example of division positions having upper conditions in the order of Whether a force that a part or all of the input pinyin is established as pinyin is registered in the pinyin dictionary and compared with the corresponding pinyin, it is assumed that if there is a matching pinyin, it may be established as pinyin.
  • pin-in dictionary as shown in FIG. 9, pin-ins of lengths 1 to 7 which can be established as pin-ins are registered.
  • input pinyin is a string input as pinyin
  • the position evaluation value calculated at the time of division processing determines the likelihood that the syllable is divided at that position when the division position for dividing the input pin into two syllables is determined. It is a numerical value. This position evaluation value is calculated for each syllable break position. Therefore, the evaluation value calculated for each candidate kanji Z candidate word (the evaluation value indicating the magnitude of the possibility that the candidate kanji Z candidate word is selected by the user) used in the above-described first to fourth embodiments Are distinguished.
  • the division position i of the input pinyin "meinian" is set to 1. Shown in Figure 8 with the first letter 'm' The pinyin dictionary index (pinyin dictionary index table) is referenced, and the first character is “m” and the 1-character pinyin dictionary address pr—ml and the 2 character pinyin dictionary address pr — read m2. Then, it compares with pinyin stored in pr ⁇ ml to pr ⁇ m2 of the Pinyin dictionary. In this case, since "m" exists, it is established as pinyin.
  • pinyin dictionary address pr-e6 with 7 characters and "z" as the first character is “e” with “einian” excluding "m” as the second character and "z” in 7 characters is read, pr — Compare with pinyins stored in e6 to pr-e7. If there is a matching pinyin, the position evaluation value of the first character is 7, but if there is no matching pinyin, one character is subtracted from the last character of the second syllable to create a new second syllable, and similarly, If there is a matching pin-in, compare the length of the second syllable + the length of the first syllable (1 in this case) as the position evaluation value of division position 1. If there is no matching pinyin, one more character is subtracted from the last character of the second syllable, and the process is compared with the Pinyin dictionary, and this is repeated until the position evaluation value is obtained.
  • the pin-in dictionary index table based on the first two characters "me” is compared with the pin-ins stored in the first two characters of the pin-in dictionary address pr-m2 to pr-m3. If “111 ⁇ 2” is present, let the first syllable be “me”. Next, with the exception of "me”, "inian” is taken as the second syllable, and it is similarly searched for whether there is a pinyin that matches the pinyin dictionary.
  • the processing from the top to cutout position 7 is performed, and the division position of the maximum evaluation value is evaluated. In FIG. 10, since the maximum position evaluation value is 7 and the number of division positions for which the maximum position evaluation value is 7 is 1, the final division position is determined to be 3. In FIG.
  • division number 2 for the division position 3 means that “mainian” has one eye division position between the third character i and the fourth character n. Since it is determined here that there is a second division position between the seventh character n and the subsequent character n (there is no character), it indicates that there are two division positions.
  • each Pinyin initial character is composed of seven pieces from 1 to 7 in length.
  • the maximum length of the note sound is 4, the first character of each note sound may be composed of four pieces of 1 to 4 in length.
  • the words of the same leading kanji are grouped, and the kanji dictionary and the word dictionary have information for restricting the search target according to the input pinyin.
  • the kanji dictionary has information as to whether or not it is a search target according to the pinyin length.
  • the word dictionary has the first character of the second character kanji in the index information, divides the Pinyin sequence (n characters of kanji), and the first character of the first character kanji searched for kanji and the second character also have word reading power You can limit the search target. With such a configuration, search is made for items that match the search conditions.
  • the kanji (word) candidates obtained by the above-described search processing may be selected by the user based on the auxiliary information (or the candidate order determining element) associated with each candidate. Since the higher the priority is displayed in order of priority, the response time until the candidate is displayed can be shortened, and in addition, the candidate can be displayed in the order desired by the user. It is possible to further improve the convenience of the user.
  • each block of the character conversion device 1, in particular, the candidate order determination unit 32, the dictionary search unit 310, the display control unit 22, and the control unit 10 may be configured by hardware logic. And may be realized by software using a CPU as follows. Specifically, the recording unit 40, the kanji dictionary unit 41, the word dictionary unit 42, and the temporary storage unit 3 are realized by a storage device such as a memory or a hard disk, and the candidate order determination unit 32, the dictionary search unit 310, the display control unit 22.
  • the control unit 10 is realized by a central processing unit (CPU) of a computer and software executed by the CPU. This software is also stored in the storage device, read out to the memory, and executed by the CPU.
  • CPU central processing unit
  • the character conversion device 1 executes a CPU (central processing unit) that executes instructions of a control program that realizes each function, a ROM (read only memory) with an upper d program frame, and expands the above program
  • a storage device such as a random access memory (RAM), a memory for storing the above programs and various data, and the like are provided.
  • the object of the present invention is to record the program code (the executable program, the intermediate code program, the source program) of the control program of the character conversion device 1 which is software that realizes the functions described above in a computer readable manner. It is also possible to supply the above recording medium to the character conversion device 1 and the computer (or CPU or MPU) reads out and executes the program code recorded on the recording medium.
  • Examples of the recording medium include tape-based media such as a magnetic tape and a cassette tape, magnetic disks such as a floppy disk Z hard disk, and optical disks such as a CD-ROM ZMOZ MD / DVD / CD-R.
  • a disk system, an IC card (including a memory card), a card system such as a Z optical card, or a semiconductor memory system such as a mask ROMZEPROMZEEPROM Z flash ROM can be used.
  • the character conversion device 1 may be configured to be connectable to a communication network, and the program code may be supplied via the communication network.
  • the communication network is not particularly limited.
  • the communication network is not particularly limited.
  • the Internet intranet, extranet, LAN, ISDN, VAN, CATV communication network, virtual private network, telephone network, mobile communication network And satellite communication networks are available.
  • the transmission medium constituting the communication network is not particularly limited.
  • IEEE1394, USB power line carrier, cable TV line, telephone line, wired line such as ADSL line, IrDA, infrared ray such as remote control, Bluetooth (Registered trademark), 802.11 radio, HDR, mobile phone network, satellite link, terrestrial digital network Etc.
  • the present invention can also be realized in the form of a computer data signal embedded in a carrier wave, in which the program code is embodied by electronic transmission.
  • the character conversion device converts an input character string consisting of one or more input phonetic characters into a converted character, and can change the character character of the converted character. In the replacement
  • word correspondence information in which word identification information for identifying a word is associated with word phonetic information indicating the phonetic sound of the word; and (b-2) a head at the beginning of the word
  • the head conversion character identification information for identifying a conversion character, and the word storage destination information indicating the storage destination in the word correspondence information of the word having the head conversion character indicated by the head conversion character identification information A word dictionary having associated word indexes,
  • a word acquisition unit for acquiring the word having the conversion character indicated by the head conversion character identification information at the beginning by referring to the word dictionary based on the acquired head conversion character identification information.
  • the character conversion device displays the first conversion character indicated by the first conversion character identification information acquired by the first conversion character acquisition unit on the display unit in a user-selectable manner.
  • Display control means for receiving the first conversion character selected by the user, and the word acquiring means is based on the first conversion character identification information of the selected first conversion character. Do you want to process to get the word. [0294] According to the above configuration, it is possible to acquire a word based on the head conversion character selected by the user, that is, by narrowing the head conversion character.
  • the conversion character identification information further includes a display for the conversion character indicated by the conversion character identification information.
  • the auxiliary information may be associated, and the display control means may rearrange the head conversion characters based on the display auxiliary information and display the selected character on the display unit by the user.
  • the head conversion characters can be rearranged based on the display auxiliary information, and can be displayed on the display unit in a selectable manner by the user.
  • the display auxiliary information may be, for example, information on the appearance frequency of the vague characters.
  • the display auxiliary information may be any other information as long as the display control means can rearrange the head conversion characters.
  • the word index of the word dictionary includes the head conversion character identification information for each head conversion character to be the head of the word;
  • the storage destination in the word correspondence information may be allocated to the combination with the second phonetic alphabet, which is the first phonetic alphabet of the letter.
  • the word acquiring unit searches for the word from the combination of the head converted character identification information and the second phonetic initial phonetic letter which is the first phonetic alphabet of the second word of the word. Can. Therefore, the search range can be narrowed further and quicker response is possible.
  • the converted character identification information is further acquired for the converted character indicated by the converted character identification information.
  • Auxiliary information may be associated, and the head converted character acquisition unit may acquire the converted character identification information based on the acquired auxiliary information.
  • the lead conversion character acquisition unit can acquire the conversion character identification information based on the acquisition auxiliary information on the conversion character indicated by the conversion character identification information.
  • the acquisition auxiliary information is, for example, information such as the length of a phonetic character of the head conversion character if it is information that assists the search using the conversion character dictionary of the head conversion character acquisition means.
  • the acquisition auxiliary information is It is classified into four information below.
  • the head conversion character acquisition means can acquire the head conversion character quickly and effectively.
  • those having the same conversion character at the beginning of a word may be grouped as one group.
  • the word acquisition means can search for words grouped as a group, and therefore can acquire words quickly and effectively.
  • the words may be arranged in a predetermined arrangement order of the first phonetic characters of the words.
  • the word acquiring means can arrange words in the group in the order of the predetermined arrangement of the phonetic characters at the head of the words, and can acquire more rapid words.
  • the predetermined arrangement order of the phonetic characters at the beginning of the word is, for example, alphabetical order if the word is pinyin, 50-sentence if hiragana, and the like.
  • the converted character is a kanji character
  • the phonetic character at the beginning of the kanji character is the same and the tone code is
  • the same kanji identification information, which is the above-mentioned converted character identification information, may be grouped as a group.
  • the start conversion character acquisition means increases the response speed because! /, It is necessary to search for the start conversion character! / Obtained in the group.
  • the above-mentioned converted character correspondence of the above-mentioned character conversion device used as a Chinese-language conversion device in each of the above groups, the kanji identification information is arranged in a predetermined arrangement order of the top phonetic characters of the kanji.
  • the leading converted character acquisition unit can acquire the leading converted character more quickly.
  • the first phonetic characters of the kanji are arranged in a predetermined arrangement order, and Information indicating the storage destination in the converted character correspondence information may be associated with the first phonetic character of the kanji as the converted character storage information, and may be! / !.
  • the start conversion character acquisition means can narrow down the group from the storage destination in the conversion character correspondence information of the group, and can make a search with a faster response speed.
  • the kanji characters may be simplified characters and the phonetic sounds may be expressed by pinyin.
  • the above kanji characters may be traditional characters, and the above-mentioned phonetic sounds may be expressed by a gush sound.
  • control method of the character conversion device can convert an input character string consisting of one or more input phonetic characters into a converted character and determine a word having the converted character strength.
  • (a-1) conversion character correspondence in which conversion character identification information for identifying a conversion character and conversion character phonetic information indicating the phonetic character of the conversion character are associated Information (a-2) a storage destination of the converted character identification information having the phonogram input and the converted character identification information having the phonogram as the head of the converted character phonetic information, A converted character dictionary having a converted character index associated with the indicated converted character storage destination information, (b-1) word identification information for identifying a word, and word sound information indicating the phonetic sound of the word Word correspondence information in which and are associated, and (b 2) the point of the word Head conversion character identification information for identifying a head conversion character; word storage destination information indicating a storage destination in the word correspondence information of a word having a conversion character at the head indicated by the head conversion character
  • the conversion character identification is made by referring to the conversion character dictionary and having the first phonetic character of the input character string as the first character of the converted character phonetic information.
  • a leading conversion character acquisition step for acquiring information
  • the character conversion device may be realized by a computer.
  • a character conversion program that causes the computer to realize the character conversion device by operating the computer as any of the means in any of the character conversion devices, and a computer readable recording medium storing the character conversion program Even falls within the scope of the present invention.
  • a character conversion device is a character conversion device for converting an input phonetic character or phonetic character string into a corresponding ideographic character and inputting an ideographic character or ideographic character string selected by the user.
  • a character recording unit for recording a phonogram character and an ideographic character corresponding to a phonogram string including the phonogram character, and receiving the phonogram character input by the user, the phonogram character is selected as the phonogram character.
  • Character candidate extraction means for extracting associated ideographic characters from the character recording unit as candidates for ideographic characters
  • Candidate order determination means for determining the display order of candidates extracted by the candidate extraction means based on a predetermined condition
  • display control means for outputting data so as to display the candidates based on the display order determined by the candidate order determination means.
  • the character candidate extraction unit receives the phonetic characters input by the user, and identifies a phonetic character string including the phonetic characters.
  • the phonetic strings correspond to ideographic characters.
  • the correspondence between the phonetic characters and the phonetic character strings, and the correspondence between the phonetic character strings and the ideographic characters are recorded in the character recording unit. Therefore, the character candidate extraction means can extract the candidate of the desired alphabetic character that the user desires to input based on the phonetic characters for which the user's input has been received.
  • the candidate order determination means rearranges the candidates of the ideographic character extracted by the character candidate extraction means on the basis of a predetermined condition, and determines the display order of the candidates.
  • the display control means outputs candidate data so that the ideographic character candidates are displayed in the display order determined by the candidate order determination means.
  • the user can input the ideographic character only by designating the desired ideographic character from the displayed candidates. From the above, the user can obtain desired ideographic character input with a smaller number of operations regarding the number of operations related to the input of phonetic characters.
  • phonetic characters are alphabets and ideographic characters are Chinese kanji. That is, the alphabet string is pinyin, and one pin corresponds to one or more kanji. In the character recording section, correspondences between alphabetic pinyin and pinyin-kanji are recorded.
  • the character candidate extraction unit refers to the character recording unit, identifies pinyin starting from the alphabet, and identifies one or more kanji corresponding to the identified pinyin. Extract as a candidate.
  • the kanji extracted as the above candidate is presented to the user.
  • the user may also specify the desired kanji for the presented candidate power. That is, after inputting the alphabet, it becomes possible to specify and obtain a desired kanji that does not require an operation to select the corresponding pinyin.
  • the phonogram may be a punctuation character and the ideographic character may be a traditional ideographic character. That is, the phonetic character string is a gushing sound, and one or more traditional characters correspond to one gushing sound.
  • the correspondence relationship between the vowels of vowels and the traditional vignettes of the vowels should be recorded.
  • the character conversion apparatus further receives a character string recording unit that records the ideographic character and the ideographic character string including the ideographic character in association with each other, and the ideographic character selected by the user.
  • Character string candidate extraction means for extracting an ideographic character string associated with an ideographic character from the character string recording unit as an ideographic character string candidate may be provided.
  • the character conversion device of the present invention converts the input phonetic character or phonetic character string into the corresponding ideographic character, and inputs the ideographic character selected by the user.
  • a character string recording unit for recording the ideographic character in association with an ideographic character string including the ideographic character, and an ideographic character selected by the user are accepted, and the ideographic character string associated with the ideographic character is Character string candidate extraction means for extracting from the character string recording unit as candidates for figurative character strings, candidate order determination means for determining the display order of candidates extracted by the character string candidate extraction means based on a predetermined condition, and And display control means for outputting data so as to display candidates based on the display order determined by the candidate order determination means.
  • the character string candidate extraction unit extracts an ideographic character string associated with the ideographic character as a candidate from the character string recording unit in response to the selection of the ideographic character by the user.
  • a kanji When a kanji is input, the character following the kanji (the kanji desired by the user) is more likely to be a kanji constituting a idiom in which the kanji power also starts than a kanji without any context. Therefore, if a idiom that starts from a kanji whose input has been confirmed is displayed as a candidate, the user only inputs Chinese language without separately entering pinyin simply by selecting the desired idiom of the idiom of the candidate of the idiom. It will be possible.
  • the character conversion apparatus further includes a writing character input unit for inputting ideographic characters written by the user, recognizes the ideographic characters input to the writing character input unit, and obtains information of the ideographic characters. It is preferable that a character recognition unit supplied to the character string candidate extraction unit is provided.
  • the character candidate extraction means of the character conversion device when receiving the phonetic characters input by the user, may extract the ideographic character corresponding to the continuously received phonetic character string as a candidate. Preferred.
  • the character candidate extraction means can narrow down and extract the number of corresponding ideographic character candidates significantly, and the display control means can output the narrowed-down ideographic character candidates.
  • the character conversion device is configured such that the probability that the displayed ideographic character candidate contains the ideographic character desired by the user is increased each time the phonetic character is input by the user. can do.
  • a character conversion device capable of simplifying input operations of phonetic characters to obtain desired ideographic characters.
  • the candidate order determination means of the character conversion device is configured such that the candidate of the ideographic character string extracted by the character string candidate extraction means is more than the candidate of the ideographic character extracted by the character candidate extraction means. It is preferable to determine the display order so as to be displayed preferentially.
  • the candidate ideographic character string extracted by the character string candidate extraction unit from the character string recording unit is preceded by the candidate ideographic character extracted by the character candidate extraction unit from the character recording unit.
  • the candidate order determining means determines the display order of the candidates to be displayed.
  • the character conversion apparatus further includes a learning control unit that records the ideographic character selected by the user as a learning character in the learning word recording unit, and the candidate order determining unit is extracted by the character candidate extraction unit. If the selected candidate is a learning character recorded in the learning word recording unit, the display order may be determined such that the candidate is displayed with priority.
  • a learning control means for recording the ideographic character string selected by the user as a learning character string in the learning word recording unit is provided, and the candidate order determination means extracts the character string candidate
  • the display order may be determined so that the candidate is displayed with priority.
  • the learning control means causes the learning word recording unit to record the ideographic character or ideographic character string whose input has been determined (desired by the user) in the character conversion device as a learning character (string).
  • the candidate order determining unit learns, of the extracted candidates, those recorded in the learning word recording unit. It can be judged as a character. That the candidate is a learning character means that the ideographic character (string) desired by the user in the past is more likely to be desired by the user than the candidate who is not a learning character.
  • the character conversion device further associates the ideographic character selected by the user with the immediately preceding character or the immediately preceding character string input immediately before the ideographic character, and sets the learning word recording unit as a pair learning character
  • the candidate order determination means is a pair learning character recorded in the learning word recording section, with the candidate extracted by the character candidate extraction means being associated with the immediately preceding character or the immediately preceding character string.
  • the display order may be determined such that the candidates are displayed in priority.
  • the ideographic string selected by the user is input immediately before the ideographic string!
  • a learning control means for recording in the learning word recording section as a pair learning character string in association with the immediately preceding character or the immediately preceding character string and the candidate order determining means is a candidate extracted by the character string candidate extraction means.
  • the display order may be determined such that the candidate is displayed with priority.
  • the learning control means causes the learning word recording unit to record the ideographic character or ideographic character string whose input has been determined (desired by the user) in the character conversion device as a learning character (string).
  • the input character is already determined immediately before the learning character (string) ) (Hereafter, it is referred to as the immediately preceding character (string)) and recorded as a pair learning character (string).
  • the paired learning character (string) follows the associated immediately preceding character (string), which means that there is a history desired by the user. Therefore, when the preceding character (string) is input, the pair learning character (string) is likely to be desired by the user.
  • the candidate order determination means changes the display order of the candidate to higher rank.
  • the learning control means of the character conversion device may use the learning character or learning character string as a learning word, or a group identifier for classifying the pair learning character or pair learning character string.
  • the group identifier is generated in association with each learning word and recorded in the above-described learning word recording unit, and the group identifier is assigned a predetermined length previously assigned to the ideographic letters constituting the learning word. Preferably, it is generated based on the character code of
  • the learning control means determines the learning word (learning character or learning character string, or pair learning character or pair learning character string) recorded in the learning word recording unit. Generate and assign a group identifier to classify into a plurality of groups. The learning control means generates the group identifier using a numerical value determined based on the character code, which is assigned in advance to each ideographic character constituting the learning word.
  • the group identifier generated in this way has a fixed length no matter what learning word is assigned. That is, as a specific example, even if it is a learning word consisting of a two-character idiom or a learning word consisting of a six-character idiom, group identifiers of the same size are assigned to them.
  • the data size increases as the number of learning words recorded in the learning word recording section increases, and in the case of idiom, as the number of characters increases. That is, it takes a great deal of time to search for matching learning words from the learning word recording section to determine whether the candidate is a learning word or not, and the processing efficiency of character input decreases. Occurs.
  • a fixed-length group identifier is assigned to each learning character, whether the input character (string) and the learning word match or are characters of the same group (string) in the search
  • the time to determine the score is uniform regardless of the number of characters in the learning word. From the above, it is possible to suppress the proportional increase in the time spent on the search with respect to the increase in the number of learning words, and the influence on the decrease in processing efficiency can be reduced.
  • the above-mentioned character conversion device further associates the ideographic character or ideographic character string selected by the user with the forward character or the forward character string whose input is determined prior to the ideographic character or ideographic character string.
  • the candidate order determination unit determines the display order so that the candidate is displayed with higher precedence as the degree of association with the front character or the front character string is higher.
  • the candidate order determination means determines the display order of the candidates based on the degree of association with the forward character (string).
  • the degree of association refers to the ideographic character or ideographic character string whose input is determined by the character input determination means and the forward character or character string whose input is determined before the ideographic character or ideographic character string.
  • the candidate order determination means Considering the relationship between "Shanghai” and “University”, change "University” to the top of the candidate.
  • the candidate order determining means can consider candidate ideographic characters in consideration of not only continuity with the immediately preceding character (string) but also with the preceding character (string). Since the display order of (rows) is determined, it is possible to present the candidates in an order that reflects the user's wishes more
  • the character conversion device sets a candidate order determination element for setting a condition designated by the user, which is a predetermined condition to be referred to when the candidate order determination means determines the display order of the candidates.
  • the candidate order determination means preferably determines the display order of the candidates based on the conditions set by the candidate order determination element setting means, According to the above configuration, when there are a plurality of predetermined conditions to be referred to when the candidate order determination means determines the display order of candidates, based on which conditions the display order is determined Is set by the candidate order determining element setting means in accordance with the user's instruction.
  • the candidate is a character (Kanji) or a character string (Idiom) or (2) the ability is a learning character candidate (3) the ability is a paired learning character 4) Force when learned (5)-Generally how often it is used, (6) Phonetic character length entered as current reading matches the candidate phonetic character length (For example, “Technology” has been entered !, and in the case where “Giji (2 letters)” and “Gijutsu (4 letters)” have different phonetic character lengths), (7) Continuity with a string is a force, (8) a relationship with a forward string is a force, (9) how often it is used by the user, and various conditions can be considered. .
  • candidate order determination The display order of the candidates determined by the defining means more accurately reflects the user's wishes.
  • the user can obtain the desired ideographic character input with a smaller number of operations relating to the input of the phonetic characters.
  • the copying machine is provided with the above-described character conversion device and a key corresponding to each phonetic character, and supplies, to the character conversion device, a phonetic character corresponding to a key pressed by the user.
  • the candidate order determination means defines one input part, a second input part for inputting the candidate selected by the user to the character conversion device, and a candidate to be extracted by the character candidate extraction means or the character string candidate extraction means.
  • a display unit configured to display in accordance with the display order, wherein the first input unit and the second input unit, and the display unit are touch panels or tablets integrally formed with each other.
  • the user is provided with the first and second input units for inputting phonetic characters and ideographic characters selected by the user into the character conversion device, and ideographic characters (strings) as candidates to the user.
  • the user can obtain desired ideographic character input with a smaller number of operations relating to the input of phonetic characters. It becomes.
  • the copying machine further includes a writing character input unit for inputting ideographic characters written by the user, recognizes the ideographic characters input to the writing character input unit, and detects information on the ideographic characters. It is preferable to include character recognition means for supplying the above character conversion device.
  • the character input method converts an input phonetic character or phonetic character string into a corresponding ideographic character, and inputs an ideographic character or ideographic character string selected by the user.
  • the ideographic character input by the user is accepted, and the ideographic character associated with the phonogram character corresponds to the phonogram character and the phonogram character string including the phonogram character.
  • ideographic character candidates corresponding to the phonetic character string including the phonetic characters input by the user are extracted, and in the second step, display of the extracted candidates is performed The order is determined.
  • the display order may be determined based on, for example, the predetermined conditions as described above.
  • ideographic character candidates are output based on the determined display order.
  • the character conversion device and the copying machine are realized by a computer
  • the character conversion device may be operated by operating the computer as the respective means.
  • a control program that causes a computer or a copying machine to be realized by a computer, and a computer-readable recording medium recording the same also fall within the scope of the present invention.
  • a character conversion device is a character conversion device for converting an input phonetic character or phonetic character string into a corresponding ideographic character and inputting an ideographic character or ideographic character string selected by the user.
  • a character recording unit for recording a phonogram character and an ideographic character corresponding to a phonogram string including the phonogram character, and receiving the phonogram character input by the user, the phonogram character is selected as the phonogram character.
  • Character candidate extraction means for extracting the associated ideographic character from the character recording unit as an ideographic character candidate, candidate order determination means for determining the display order of the candidates extracted by the candidate extraction means, and the candidate order determination means And display control means for outputting data so as to display candidates based on the display order determined by the ideographic character, the ideographic character or ideographic character string input in the past, and the ideographic character or the ideographic character string Furthermore, in correspondence with the forward character or forward character string input further forward, the degree of association indicating the height of the possibility that the ideographic character or ideographic string is input behind the forward character or forward character string is The candidate order determination means detects the degree of association between each candidate and the forward character or string that has been input ahead of the candidate from the relevance degree recording unit. And the display order is determined such that the candidate with the higher degree of association is displayed with priority.
  • the character conversion device of the present invention converts the input phonetic characters or phonetic character strings into corresponding ideographic characters, and inputs the ideographic characters selected by the user.
  • a character string recording unit that associates and records an ideographic character with an ideographic character string including the ideographic character, and receives an ideographic character selected by the user, and is associated with the ideographic character!
  • the ideographic character A character string candidate extraction unit that extracts a string from the character string recording unit as an ideographic character string candidate, a candidate order determination unit that determines the display order of the candidates extracted by the character string candidate extraction unit, the candidate And a display control means for outputting data to display candidates based on the display order determined by the order determination means, and an ideographic character or ideographic character string input in the past, and the ideographic character or ideographic character string from immediately before Et al in in association with the front character or forward string entered forward, the front character or Or a degree of association indicating the degree to which the ideogram or the ideographic character is likely to be input is stored in the degree-of-relevance recording unit after the forward character string, and the candidate order determining unit
  • the unit detects the degree of association between each candidate and the forward character or forward character string input to the front of the above candidate, and the display order is prioritized so that the candidate with the higher degree of association is displayed with priority. It is characterized by deciding.
  • the user can obtain desired ideographic character input with a smaller number of operations relating to the input of the phonetic characters.
  • the ideographic character or ideographic character string input in the past is associated with the ideographic character or the immediately preceding character or character string immediately prior to the ideographic character string, and the immediately preceding character or the immediately preceding character string is associated.
  • the degree of continuity indicating the height of the possibility that the ideogram or ideographic character string is input is stored in the continuity recording unit immediately after the It is preferable to determine the display order so that candidates with higher relevance and higher continuity with the preceding character or preceding character string are displayed preferentially.
  • the character candidate extraction unit of the character conversion device each time the character candidate extraction unit of the character conversion device receives a phonetic character input by the user, the character candidate extraction unit concatenates the phonetic character to the end of the already-received phonetic character string, Preferably, ideographic characters corresponding to the obtained phonetic character string are extracted as candidates.
  • the learning control means of the character conversion device is the learning character or learning character string as a learning word to be recorded in the learning word recording unit, or the pair learning character or pair learning
  • a group identifier for classifying a character string is generated, the group identifier is associated with each learning word, and is recorded in the learning word recording unit, and the group identifier is assigned in advance to ideographic characters constituting the learning word. Preferably, it is generated based on the character code of a predetermined length.
  • an evaluation indicating the magnitude of the possibility that the candidate is selected by the user is a predetermined condition to be referred to when the candidate order determination means determines the display order of the candidates.
  • a candidate order determining element setting unit configured to set an element designated by the user from a plurality of types of elements that can be used for calculating the value; An element different from the degree of association is stored in the element recording unit, and the candidate order determination unit determines the degree of association, and an element stored in the element storage unit set by the candidate order determination element setting unit. It is preferable to determine the display order of the candidates based on
  • the character input method converts an input phonetic character or phonetic character string into a corresponding ideographic character, and inputs an ideographic character or ideographic character string selected by the user.
  • the character conversion device includes character candidate extraction means, candidate order determination means, and display control means, and the character candidate extraction means uses phonetic characters input by the user.
  • the first step of extracting as a character candidate, the second step of determining the display order of the candidates extracted in the first step by the candidate order determining means, and the second step of the display control means Decision A third step of outputting data to display candidates based on the display order, the second step including an ideographic character or ideographic character string input in the past, and the ideographic character or ideographic character string Relevance indicating the height of the possibility that the ideogram or ideographic character string will be input after the front character or the front character string in association with the front character or the front character string input further to the front than immediately before
  • the degree of association between the preceding character or the front character string input ahead of the above candidate and each candidate is detected from the degree of association recording section that stores the It is characterized as including the step of determining the display order.
  • a character in a character conversion device for converting an input phonetic character or phonetic character string into a corresponding ideographic character and inputting an ideographic character to be selected by the user
  • the character conversion device includes character candidate extraction means, candidate order determination means, and display control means, and the character candidate extraction means receives an ideographic character selected by the user.
  • a character that records the ideographic character string associated with the ideographic character in association with the ideographic character and the ideographic character string including the ideographic character A first step of extracting a candidate for an ideographic character string from the column recording unit; a second step of determining the display order of the candidates extracted in the first step by the candidate order determination means; and the display control means , And the third step of outputting data to display candidates based on the display order determined in the second step, and the second step includes the ideographic character or ideographic character string input in the past.
  • the ideographic character or ideographic character string is associated with the ideographic character or the forward character string that is input ahead of the ideographic character or ideographic character string,
  • the relevance degree recording unit that stores the degree of relevance that indicates the likelihood of being input, detects the degree of relevance between each candidate and the front character or forward character string input to the front of the candidate, Candidate with high degree of association It is characterized by comprising the step of determining the display order to appear in throat priority.
  • the present invention can be applied to various devices for converting readings (phonetic characters), in particular, converting readings into kanji.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Document Processing Apparatus (AREA)

Abstract

L'invention concerne un dispositif de conversion de caractère (1) comprenant unité de dictionnaire de caractères Kanji (41) contenant des informations de correspondance de caractère Kanji (144) incluant des caractères Kanji et leurs lectures associées aux caractères Kanji et des indices de caractère Kanji (145) dans lesquels les premiers caractères de pin-ins sont associés à des informations d'adresse sur les emplacements de stockage des caractères Kanji dont les premières parties sont les premiers caractères dans les informations correspondant au caractère Kanji (144), une unité de dictionnaire de mots (42) contenant des informations de correspondance de mot (162) incluant des mots et leurs lectures associées aux mots et des indices de mot (161) dans lesquels les premiers caractères des mots sont associés aux informations d'adresse sur les emplacements de stockage des mots dont les premières parties sont les premiers caractères dans les informations de correspondance de mot (162), et des moyens d'acquisition de mots qui font référence à l'unité de dictionnaire de caractères Kanji (41) et à l'unité de dictionnaire de mots (42) selon le premier caractère du premier pin-in d'une chaîne de caractères d'entrée et acquiert un mot correspondant à la chaîne de caractères d'entrée. Le délai mis pour rechercher un mot correspondant à la lecture d'entrée peut être raccourci.
PCT/JP2007/064209 2006-08-10 2007-07-18 dispositif de conversion de caractère et procédé de commande de dispositif de conversion de caractère WO2008018274A1 (fr)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP2008528762A JP4890551B2 (ja) 2006-08-10 2007-07-18 文字変換装置、文字変換装置の制御方法

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
JP2006-218970 2006-08-10
JP2006218970 2006-08-10

Publications (1)

Publication Number Publication Date
WO2008018274A1 true WO2008018274A1 (fr) 2008-02-14

Family

ID=39032808

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/JP2007/064209 WO2008018274A1 (fr) 2006-08-10 2007-07-18 dispositif de conversion de caractère et procédé de commande de dispositif de conversion de caractère

Country Status (3)

Country Link
JP (1) JP4890551B2 (fr)
CN (1) CN101553810A (fr)
WO (1) WO2008018274A1 (fr)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2015528968A (ja) * 2012-07-30 2015-10-01 マイクロソフト コーポレーション コンテキストを用いた文字列予測の生成
WO2022038809A1 (fr) * 2020-08-18 2022-02-24 株式会社日立製作所 Dispositif d'aide à la création d'expression d'informations, procédé d'aide à la création d'expression d'informations, et programme informatique
US20230266875A1 (en) * 2020-08-31 2023-08-24 Kiyoshi Kasatani Display apparatus, input method, and program

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP6491438B2 (ja) * 2014-08-29 2019-03-27 株式会社日立社会情報サービス マイグレーション支援装置
CN105404677B (zh) * 2015-11-20 2018-12-18 四川神琥科技有限公司 一种基于树形结构的检索方法
CN105426490B (zh) * 2015-11-20 2019-03-26 四川神琥科技有限公司 一种基于树形结构的索引方法

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH04225415A (ja) * 1990-12-26 1992-08-14 Mitsubishi Electric Corp ピン音漢字入力方式
JPH0540946U (ja) * 1991-10-29 1993-06-01 カシオ計算機株式会社 ワードプロセツサ
JP2004005379A (ja) * 2002-04-16 2004-01-08 Fujitsu Ltd 情報処理装置、情報処理方法、情報処理プログラム及び記録媒体
JP2006050160A (ja) * 2004-08-03 2006-02-16 Sharp Corp 中国語入力装置、中国語入力プログラム及び中国語入力記録媒体
JP2006155322A (ja) * 2004-11-30 2006-06-15 Nec Fielding Ltd 入力予測辞書システムおよび入力予測方法
JP2007188449A (ja) * 2006-01-16 2007-07-26 Sharp Corp 文字入力装置、該文字入力装置を備えた複写機、文字入力方法、制御プログラム、および、記録媒体

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH0388662A (ja) * 1989-08-31 1991-04-15 Toshiba Seiki Kk フォーム接続装置

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH04225415A (ja) * 1990-12-26 1992-08-14 Mitsubishi Electric Corp ピン音漢字入力方式
JPH0540946U (ja) * 1991-10-29 1993-06-01 カシオ計算機株式会社 ワードプロセツサ
JP2004005379A (ja) * 2002-04-16 2004-01-08 Fujitsu Ltd 情報処理装置、情報処理方法、情報処理プログラム及び記録媒体
JP2006050160A (ja) * 2004-08-03 2006-02-16 Sharp Corp 中国語入力装置、中国語入力プログラム及び中国語入力記録媒体
JP2006155322A (ja) * 2004-11-30 2006-06-15 Nec Fielding Ltd 入力予測辞書システムおよび入力予測方法
JP2007188449A (ja) * 2006-01-16 2007-07-26 Sharp Corp 文字入力装置、該文字入力装置を備えた複写機、文字入力方法、制御プログラム、および、記録媒体

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2015528968A (ja) * 2012-07-30 2015-10-01 マイクロソフト コーポレーション コンテキストを用いた文字列予測の生成
WO2022038809A1 (fr) * 2020-08-18 2022-02-24 株式会社日立製作所 Dispositif d'aide à la création d'expression d'informations, procédé d'aide à la création d'expression d'informations, et programme informatique
US20230266875A1 (en) * 2020-08-31 2023-08-24 Kiyoshi Kasatani Display apparatus, input method, and program

Also Published As

Publication number Publication date
JPWO2008018274A1 (ja) 2009-12-24
CN101553810A (zh) 2009-10-07
JP4890551B2 (ja) 2012-03-07

Similar Documents

Publication Publication Date Title
JP4413868B2 (ja) 文字入力装置、該文字入力装置を備えた複写機、文字入力方法、制御プログラム、および、記録媒体
JP4829901B2 (ja) マニュアルでエントリされた不確定なテキスト入力を音声入力を使用して確定する方法および装置
TWI293455B (en) System and method for disambiguating phonetic input
CN100555203C (zh) 修正已输入文字的系统及方法
US20070100619A1 (en) Key usage and text marking in the context of a combined predictive text and speech recognition system
US8977535B2 (en) Transliterating methods between character-based and phonetic symbol-based writing systems
US8199112B2 (en) Character input device
EP2153352A1 (fr) Architecture de reconnaissance pour la génération de caractères asiatiques
TW200538969A (en) Handwriting and voice input with automatic correction
CN1248333A (zh) 去多义性的简化键盘系统
US20120296647A1 (en) Information processing apparatus
WO2008018274A1 (fr) dispositif de conversion de caractère et procédé de commande de dispositif de conversion de caractère
TWI313425B (en) Method, system, and computer readable storage medium for processing user entry of an ideographic language phrase
CN101520693A (zh) 一种批量信息快速输入的方法及系统
US9171234B2 (en) Method of learning a context of a segment of text, and associated handheld electronic device
JPH08314910A (ja) 異種コード文字列転記装置および電子辞書
JP2015040908A (ja) 情報処理装置、情報更新プログラム及び情報更新方法
US20130073572A1 (en) Method of learning character segments from received text, and associated handheld electronic device
JP2013214187A (ja) 文字入力装置、文字入力装置の制御方法、制御プログラム、および記録媒体
JP5751537B2 (ja) 国際対応型日本語入力システム
JP5141130B2 (ja) 辞書機能を有する電子機器及びプログラム
JP3803253B2 (ja) 漢字入力のための方法および装置
KR100910302B1 (ko) 멀티모달 기반의 정보 검색 장치 및 방법
JP2000090193A (ja) 文字認識装置および項目分類方法
TWI406139B (zh) 具有聲調的拼音翻譯與拼音查詢系統及其方法

Legal Events

Date Code Title Description
WWE Wipo information: entry into national phase

Ref document number: 200780037603.7

Country of ref document: CN

121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 07790963

Country of ref document: EP

Kind code of ref document: A1

WWE Wipo information: entry into national phase

Ref document number: 2008528762

Country of ref document: JP

NENP Non-entry into the national phase

Ref country code: DE

NENP Non-entry into the national phase

Ref country code: RU

122 Ep: pct application non-entry in european phase

Ref document number: 07790963

Country of ref document: EP

Kind code of ref document: A1