CN114546127B - Keyboard design method based on initial consonant and vowel layout - Google Patents

Keyboard design method based on initial consonant and vowel layout Download PDF

Info

Publication number
CN114546127B
CN114546127B CN202210176856.9A CN202210176856A CN114546127B CN 114546127 B CN114546127 B CN 114546127B CN 202210176856 A CN202210176856 A CN 202210176856A CN 114546127 B CN114546127 B CN 114546127B
Authority
CN
China
Prior art keywords
initial
keyboard
initials
final
finals
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202210176856.9A
Other languages
Chinese (zh)
Other versions
CN114546127A (en
Inventor
张昱宽
杨毅
苏昱恺
张宇泓
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Lanzhou University
Original Assignee
Lanzhou University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Lanzhou University filed Critical Lanzhou University
Priority to CN202210176856.9A priority Critical patent/CN114546127B/en
Publication of CN114546127A publication Critical patent/CN114546127A/en
Application granted granted Critical
Publication of CN114546127B publication Critical patent/CN114546127B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/02Input arrangements using manually operated switches, e.g. using keyboards or dials
    • G06F3/0202Constructional details or processes of manufacture of the input device
    • G06F3/0219Special purpose keyboards

Abstract

The application discloses a keyboard design method based on initial consonant and vowel layout in the technical field of computer equipment, which comprises a keyboard, wherein initial consonant characters and punctuation characters are arranged on an input component of the keyboard; the scheme aims at the character editing workers using double spelling to simplify the typing speed, according to materials such as novels, news and the like of tens of thousands of words, and the relevance degree of the initials and the finals is judged by utilizing a relevance matrix, according to human engineering documents, the arrangement order of the initials and the finals is analyzed, an arrangement diagram of the initials and the finals is manufactured, after the relevance among the initials and the finals is calculated, according to the balance principle of left-right hand alternation rate in human engineering, the layout of the initials and the finals is reasonably distributed according to each finger of the size of working intensity, the layout of the design of the initial and the finals keyboard is applied to the existing keyboard, and the key positions of the existing keyboard are correspondingly changed into the initials and the finals, so that the Chinese input speed is improved, and the error rate, the fatigue degree and the left-right hand alternation rate of the code length are reduced.

Description

Keyboard design method based on initial consonant and vowel layout
Technical Field
The application relates to the technical field of computer equipment, in particular to a keyboard design method based on initial consonant and vowel layout.
Background
The qwerty keyboard layout based on the English input method of the traditional keyboard is designed for English input, is not efficient for Chinese input, has the main defects of low typing speed, designs an initial and final keyboard layout design scheme based on initial and final relevance aiming at Chinese input characteristics, and designs a keyboard layout for efficient Chinese input according to the scheme, and therefore, provides a keyboard design method based on the initial and final layout.
Disclosure of Invention
The application aims to provide a keyboard design method based on initial and final layout so as to solve the problems in the background technology.
In order to achieve the above purpose, the present application provides the following technical solutions: the keyboard design method based on the initial consonant and vowel layout comprises a keyboard, wherein initial consonant and vowel characters and punctuation characters are arranged on an input component of the keyboard;
the layout design steps of the initial consonant character and the punctuation mark character are as follows:
step one: target user data investigation;
step two: collecting an initial and final input data set and analyzing an algorithm;
step three: cleaning input data of initials and finals, and performing word segmentation operation to obtain a vocabulary sequence;
step four: extracting front and rear initials contained in the words, calculating the occurrence frequency of each different ordered binary group, drawing a correlation matrix according to statistics, and intuitively analyzing the correlation among the initials;
step five: investigation of data of the knocking difficulty of the initial and final input fingers, investigation of the load data born by the flexibility of the initial and final input fingers, and reasonable distribution of the initial and final layout according to the fingers of the working intensity according to the balance principle of left-right hand alternation rate in human engineering;
step six: the key position is correspondingly changed into initial consonant and final sound.
Preferably, the input component of the keyboard is a keycap.
Preferably, the target user in the first step is a text editing worker who uses double spelling to simplify and accelerate the typing speed.
Preferably, the data collection in the second step includes capturing near ten million word materials of three types of news, blogs and novels, counting the number of the initials and the finals, and judging the association degree of the initials and the finals by using a correlation matrix.
Preferably, the step three of cleaning the initial and final input data includes screening out data which do not meet the requirements, such as messy codes, wrongly written characters, english input and the like, and performing corresponding transcoding on the data meeting the requirements by using python to read the pronunciation rules of the pinyin dictionary to obtain the required initial and final.
Compared with the prior art, the application has the beneficial effects that: the scheme aims at the character editing workers using double spelling to simplify the typing speed, according to materials such as novels, news and the like of tens of thousands of words, and the relevance degree of the initials and the finals is judged by utilizing a relevance matrix, according to human engineering documents, the arrangement order of the initials and the finals is analyzed, an arrangement diagram of the initials and the finals is manufactured, after the relevance among the initials and the finals is calculated, according to the balance principle of left-right hand alternation rate in human engineering, the layout of the initials and the finals is reasonably distributed according to each finger of the size of working intensity, the layout of the design of the initial and the finals keyboard is applied to the existing keyboard, and the key positions of the existing keyboard are correspondingly changed into the initials and the finals, so that the Chinese input speed is improved, and the error rate, the fatigue degree and the left-right hand alternation rate of the code length are reduced.
Drawings
FIG. 1 is a schematic diagram of the structure of the present application;
FIG. 2 is a schematic diagram of the initial consonant layout of the present application;
FIG. 3 is a diagram of the statistical data of the investigation of the present application;
FIG. 4 is a diagram of the statistical data of the investigation of the present application;
FIG. 5 is a diagram of the statistical data of the investigation of the present application;
FIG. 6 is a diagram of a portion of code for data crawling in accordance with the present application;
FIG. 7 is a schematic diagram of a data cleansing portion of the present application;
FIG. 8 is a schematic diagram of a data cleansing portion of the present application;
FIG. 9 is a schematic diagram of the cleaning statistics of the initial consonant data of the present application;
FIG. 10 is a diagram showing the cleaning statistics of final data according to the present application;
FIG. 11 is a schematic diagram of the present application's review of the literature;
FIG. 12 is a schematic view of the present application in reference to the literature;
FIG. 13 is a partial code schematic diagram of a data analysis according to the present application;
FIG. 14 is a schematic diagram of the initial consonant layout statistics of the present application;
FIG. 15 is a schematic diagram of the initial consonant layout statistics of the present application;
FIG. 16 is a schematic view of the present application in reference to the literature;
fig. 17 is a schematic diagram of the present application in reference to the literature.
In the figure: 1. and a keyboard.
Detailed Description
Example 1
Referring to fig. 1-17, the present application provides a technical solution:
the novel keyboard appearance proposed by the scheme is shown in the attached figure 1 of the specification, and the keyboard 1 is convenient for Chinese input by arranging initial and final characters and punctuation characters on the key caps of the keyboard 1;
according to the scheme, english characters on the surface of the key cap of the traditional keyboard are replaced by initial consonant characters and punctuation characters through modification on the keyboard of the QWERT keyboard layout which is common in the market, so that the keyboard design of the scheme is more convenient for Chinese input.
The letter layout on the keyboard on the market is designed aiming at the English input method, no layout specially designed for Chinese input exists so far, the project abandons a single-letter spelling mode of English input, an initial consonant-final input mode is designed according to Chinese spelling habit and combining the character of Chinese input as the main word, complex final characters are added, special symbols of English input are removed, the weight of each initial consonant is calculated by utilizing an algorithm, and the Chinese keyboard layout is designed by combining ergonomics.
After investigation and research, it is found that the majority of word editors using double spelling to simplify and accelerate the typing speed are involved, and our keyboard can accelerate the typing speed of double spelling and increase the Chinese input speed.
Meanwhile, we grasp news, blogs and novel three kinds of data, namely, ten-thousand words, count the number of each initial consonant and vowel, judge the association degree of the initial consonant and vowel by utilizing a correlation matrix, analyze the arrangement order of the initial consonant and vowel according to human engineering documents, and manufacture an arrangement diagram of the initial consonant and vowel.
Firstly, a questionnaire is analyzed aiming at specific users and markets, namely, people (social newspaper editor, chinese novel author) needing to use a large amount of Chinese to perform typing work, and the questionnaire display results are shown in the accompanying figures 3-5 of the specification. The questionnaire shows that, in specific users, 78.85% of people consider that the existing english layout keyboard has low efficiency of inputting Chinese characters, people are generally familiar with the layout of the english keyboard but cannot better improve the efficiency, and for the group, we issue a second questionnaire to explore whether they are willing to accept the chinese keyboard layout, wherein 63.83% of users consider that only can improve the efficiency of the code words, they are willing to accept the chinese keyboard layout, most of word editing workers use double spelling, and we draw conclusions from the investigation report: the project has good market prospect.
Then we continue to collect the data set (related works, manuscripts, etc.) for the specific target user, crawl the text data using the beautfulsource module, we crawl according to the order or keyword information. By specifying the number of pages traversed, the content of the specified web site can be crawled and written locally.
Part of the code is shown in figure 6 of the specification.
And then the obtained data set is cleaned, data which do not meet the requirements such as messy codes, mispronounced words, english input and the like are screened out, and the data meeting the requirements are correspondingly transcoded by utilizing python to read the pronunciation rules of the pinyin dictionary to obtain the required initial consonant and vowel, wherein part of codes are shown in the accompanying drawings 7-8 of the specification.
The results are shown in the accompanying figures 9-10 of the specification.
Then we consult the related literature (the article number of the improved keyboard layout for English input: 1000-7024 (2004) 01-015304-04) and record that the letter area is bilaterally symmetrical, so that only one side needs to be studied, the other side can correspondingly push out the 4 types of conditions of the right side selected according to the movement of the fingers and the palm during the keystroke according to the characteristics 1 and 2 of the fingers during the keystroke, the letter area of the natural keyboard can be firstly distinguished into some basic areas, and the basic key positions 4 RMZ, RM3, RM4 and RMS (namely the key positions of the right hand fingers initially placed during the keystroke) can be obtained for the 1 st type of conditions (see the middle-row light gray area on the right side of the figure 4 in the figure 11 of the description); for the case of type 2, the combination characteristic 35 can obtain one key position RDZ of index finger back striking and two key positions RU3 and RU4 of middle finger and ring finger extension striking for a total of 3 key positions, so that an extended basic key position area consisting of 7 extended basic key positions (see the whole light gray area on the right side of fig. 4 in fig. 11 of the specification) including 4 basic key positions can be obtained; the corresponding most difficult to use key region can be obtained for the class 4 case (see the upper left dark grey key on the right side of fig. 3 in fig. 11 of the specification); finally, the common key area corresponding to the 3 RD type situation, namely, the area consisting of the rest 5 key positions RUZRMIRDIRD3 and RD4 except the extended basic key area and the most difficult to use key area, is required to be adjusted to be more easily compared with the use difficulty sequence of the key positions of the natural keyboard because the natural angle of the left side of the letter area is not in bilateral symmetry when the letter area is not in line with the left hand key, for example, the extended basic key area on the left side of the letter area is used for retracting and beating by a left index finger and does not contain the key position of the wrist, one extended basic key position is reduced (see the whole light gray area on the left side of fig. 6 in the specification), the key position numbers of the extended key area of the natural keyboard are sequentially increased according to the sequence of index finger ring fingers and small fingers in charge of the letter area, because the key position numbers of the non-basic key area are respectively larger than the key position numbers of the corresponding basic key area (respectively refer to the key fingers in charge of the same fingers), the key position of the non-basic key area on the left side of the natural keyboard is not in charge of the letter area, for easily striking by the same fingers in charge of the case 2, the key position of the key area is respectively most difficult to be pulled by the index fingers in the left finger and the key area, and the key position of the key area is respectively most difficult to be similar to the key position of the key position on the left finger in the specification 1 and the specification can be similarly obtained by using the key position of the key position in the left finger 11 in the specification to the specification 1.
Based on the difficulty of using each key in the key area in the literature, the initial consonants obtained after analysis are placed in the places where the keys are easy, because based on the project and based on the word input, most of the final is completed only by pressing tab keys.
In the key position design arrangement, the load amount born by the flexibility degree of each finger in the human engineering is also considered, and the reference document is the document number of the exploration of the letter layout of the keyboard of the Chinese pinyin input method, namely 1003-0077 (2010) 06-0108-06, which is recorded in the specification, namely index finger, middle finger, ring finger and little finger cloth according to the flexibility degree and the born load and the frequency of occurrence of each finger; according to engineering psychology experimental data, the interval of the same finger striking keyboard motion is 0.09s on average, the interval of striking motions among different fingers of the same hand is 0.03s on average, the interval of striking motions among fingers of different hands is 0.02s on average, the efficiency of alternate striking the keyboard by the left hand and the right hand is higher than that of continuous striking the keyboard by the single hand, and the fingers are less prone to fatigue in the condition, so that striking motions are alternately performed in the left hand and the right hand as much as possible, and recorded data are shown in figure 12 of the specification.
Secondly, when considering the common initial consonant statistics, in order to discharge the initial consonants of more Chinese characters to a more suitable position, the value of the alternation rate needs to be as close as possible to 1, the reference document is "exploration about the letter layout of a keyboard of a Chinese pinyin input method". If the chapter number of 1003-0077 (2010) 06-0108-06, wherein the "left key area is set as an A area, the right key area is set as a B area, the pinyin of a certain Chinese character is a1a2 … ai … an, wherein ai is { a, B, c, …, z }, the pinyin length a1a2 … ai … an <7,i =1, 2, …, N, N is E N +, if ai appears in the A area, ai+1 appears in the B area, the ratio of the number p of the Chinese characters alternately performed by left and right hands and the total Chinese character number q is called the alternation rate. Here we refer to the alternation rate, considering that the initial consonant combination can drive more words, i.e. more Chinese characters under the condition of collocation of left and right hands, so as to meet the value of the alternation rate as close to 1 as possible, we analyze a large number of data sets through python, and part of codes are shown in fig. 13 of the specification.
And counting the initial and final relevance of all the two words and constructing a relevance matrix. And the keyboard layout is primarily designed according to the statistical result, the display result is shown in the attached drawings 14-15 of the specification, the darker areas in the drawing are the consonants, more words can appear in the combination of the consonants, and the consonants can be placed in more reasonable places.
Continuing to further study the ergonomics, reference is made to Chapter number 1003-0077 (2010) 06-0108-06 of "exploration of the layout of the keyboard letters for Chinese Pinyin input method", wherein the description is that "because on the load that can be borne, the right hand > left hand should theoretically work the right keypad is larger than the left keypad, but the number of keys on which the Pinyin letters can be arranged on the right keypad is 11, the number of keys on the left keypad is 15, and the static workload of the left keypad and the right keypad calculated according to the arrangement program is represented by table 5 and table 6 in fig. 16 of the specification, the static and dynamic workload of the upper row and the lower row of the middle row is calculated by the program, and the details are shown in table 7 and table 8 in fig. 17 of the specification. "it is reasonable to show from the map data that the left keypad is larger than the right keypad in terms of workload, but should not be too much exceeded; the middle row is reasonable in terms of workload, the upper row is reasonable in terms of workload, and the lower row is reasonable in terms of workload. We also consider this as one of the important factors of the layout in the keyboard layout.
So the initial and final character layout in figure 1 of the specification is designed at present.
In summary, the design idea of the scheme is as follows:
1. specific user groups are determined, corpus data are collected on regular websites such as national statistical networks and CNKI and the like and used as a data set for empirical statistical analysis, and through technical means such as data cleaning, word segmentation and the like, the word frequencies of two-word words and multi-word words are respectively counted, a dictionary from Chinese characters to pinyin is built, and the Chinese characters are mapped to pinyin;
2. the correlation between initial consonants and vowels in the vocabulary is analyzed by utilizing a statistical principle, and the importance of the corresponding key positions is determined according to the strength and the occurrence frequency of the dependency relationship;
3. by combining the difficulty level of each key position of the traditional keyboard layout, the flexibility level of each finger and the corresponding load bearing capacity in human engineering, the theory is comprehensively designed to ensure that the fingers are less prone to fatigue, and the keyboard layout based on initial and final key positions has higher alternation rate;
4. analysis algorithm: the text data is collected for a group of people needing layout design and mapped to a corresponding pinyin sequence and an vowel sequence. The pinyin sequence is then converted to a directed weight graph using the TextRank algorithm. The important components in the text are ordered by using a voting mechanism, and the ordering of the keywords can be realized by using the information of the current document. The key words which are not finally extracted are needed, and the undirected weight graph learned by the model in the iterative training process is extracted to be used as a reference of the keyboard layout;
5. in the concrete implementation, according to each piece of original text data, word segmentation and part-of-speech tagging are carried out, stop words are filtered, transition to a pinyin sequence is completed, and sorting based on the correlation degree between initial consonants and vowels in pinyin is completed. The input of the sequence further refines the analysis of the original sentence level of the TextRank, n continuous initial consonant and vowel combinations in the pinyin sequence are used as metadata, and the co-occurrence degree between the initial consonants and the vowels is inspected. Firstly, a mapped pinyin sequence is used as a model to be input, sequence labeling is carried out, an undirected graph G= (V, E) of related initials and finals is constructed, wherein V is a node set and consists of all initials and finals, and E is an edge set and represents the correlation degree between connected nodes. And then adopting co-occurrence relation (co-current) to iteratively calculate the edge between two points, wherein any two nodes have a relation, and the model is to learn the relation strength between the nodes through the input of the Pinyin sequence. The setting of the size K of the collinear window is evaluated by experiments as a super parameter. According to the formula, iteratively propagating the weights of all nodes and the weights of edges. On the other hand, we break away from the correlation calculation at sentence level, directly calculate the correlation of different initials on global context;
6. in implementation, we still get the vocabulary sequence through data cleaning and word segmentation operation. And then extracting front and rear initials contained in the words, calculating the occurrence frequency of each different ordered binary group, drawing a correlation matrix according to statistics, and intuitively analyzing the correlation between the initials.
7. The basic use rule of the input method is that after the correlation between the initials is calculated by us, according to the balance principle of left-right hand alternation rate in human engineering, we hope that after the initials are input by the left hand of the user, the initials are input by the right hand, or after the initials are input by the right hand, the initials are input by the left hand, according to the balance principle of the left-right hand workload, the left-right manual workload is reasonably designed, so that the left-right manual workload is approximately the same, and all fingers are reasonably distributed according to the working intensity. Meanwhile, the work intensity of each finger is reasonably distributed on the layout.
8. Letter key position arrangement design idea: our input is divided into initials and finals, and correspondingly, the key positions are changed into initials and finals. The initial consonant is integrally positioned at the upper part of the keyboard. The final is input by directly inputting the initial consonant and using the full key to complement.
9. Based on the layout design thought, the keyboard layout design mode is compared with the full spelling mode and the double spelling mode:
1. contrast double spelling
The special vowel position mapping in the double spelling is not needed to be memorized, and the vowels are intuitively represented by different key positions in the keyboard layout;
2. contrast full spelling
The designed keyboard based on initial and final layout is superior to full spelling in terms of code length error rate (code length is the number of times that a word needs to be input, for example, zhang full spelling code length is 5, and the novel keyboard in the application only needs zh and ang, the code length is 2), fatigue degree and left-right hand alternation rate. Through logic analysis, the main reason for the difference is that the average code length is greatly reduced, the same content is input, the number of the fully-spliced key is more than that based on the vowels, the fully-spliced input is more slow on the premise of a certain key speed, and the possibility of error modification is also higher.

Claims (5)

1. The keyboard design method based on initial and final layout comprises a keyboard (1) and is characterized in that: the input assembly of the keyboard (1) is provided with initial consonant characters and punctuation characters;
the layout design steps of the initial consonant character and the punctuation mark character are as follows:
step one: target user data investigation;
step two: collecting an initial and final input data set and analyzing an algorithm;
step three: cleaning input data of initials and finals, and performing word segmentation operation to obtain a vocabulary sequence;
step four: extracting front and rear initials contained in the words, calculating the occurrence frequency of each different ordered binary group, drawing a correlation matrix according to statistics, and intuitively analyzing the correlation among the initials;
step five: investigation of data of the knocking difficulty of the initial and final input fingers, investigation of the load data born by the flexibility of the initial and final input fingers, and reasonable distribution of the initial and final layout according to the fingers of the working intensity according to the balance principle of left-right hand alternation rate in human engineering;
step six: the key position is correspondingly changed into initial consonant and final sound.
2. The keyboard design method based on initial and final layout according to claim 1, wherein: the input component of the keyboard (1) is a keycap.
3. The keyboard design method based on initial and final layout according to claim 1, wherein: the target user in the first step is a word editor who uses double spelling simplification to accelerate the typing speed.
4. The keyboard design method based on initial and final layout according to claim 1, wherein: and step two, collecting data, namely capturing three types of data of news, blogs and novels, counting the number of the initial consonants and the vowels, and judging the association degree of the initial consonants and the vowels by utilizing a correlation matrix.
5. The keyboard design method based on initial and final layout according to claim 1, wherein: and in the third step, the cleaning of the initial and final input data comprises the steps of screening out the data which does not meet the requirements, and carrying out corresponding transcoding on the data meeting the conditions by using python to read the pronunciation rules of the pinyin dictionary so as to obtain the required initial and final.
CN202210176856.9A 2022-02-25 2022-02-25 Keyboard design method based on initial consonant and vowel layout Active CN114546127B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202210176856.9A CN114546127B (en) 2022-02-25 2022-02-25 Keyboard design method based on initial consonant and vowel layout

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202210176856.9A CN114546127B (en) 2022-02-25 2022-02-25 Keyboard design method based on initial consonant and vowel layout

Publications (2)

Publication Number Publication Date
CN114546127A CN114546127A (en) 2022-05-27
CN114546127B true CN114546127B (en) 2023-11-24

Family

ID=81679337

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202210176856.9A Active CN114546127B (en) 2022-02-25 2022-02-25 Keyboard design method based on initial consonant and vowel layout

Country Status (1)

Country Link
CN (1) CN114546127B (en)

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1556457A (en) * 2004-01-05 2004-12-22 郑 方 Chinese whole sentence input method based on key selection double spelling and its keyboard arrangement
CN101174182A (en) * 2006-11-02 2008-05-07 尚晓 Chinese character input method
CN101349946A (en) * 2007-07-20 2009-01-21 无锡职业技术学院 Digital shape-pronunciation code Chinese input method and keyboard
CN104765468A (en) * 2014-01-05 2015-07-08 张刚 Syllable initial and syllable rime double-keyboard sliding input method

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1556457A (en) * 2004-01-05 2004-12-22 郑 方 Chinese whole sentence input method based on key selection double spelling and its keyboard arrangement
CN101174182A (en) * 2006-11-02 2008-05-07 尚晓 Chinese character input method
CN101349946A (en) * 2007-07-20 2009-01-21 无锡职业技术学院 Digital shape-pronunciation code Chinese input method and keyboard
CN104765468A (en) * 2014-01-05 2015-07-08 张刚 Syllable initial and syllable rime double-keyboard sliding input method

Also Published As

Publication number Publication date
CN114546127A (en) 2022-05-27

Similar Documents

Publication Publication Date Title
Elbarougy et al. Extractive Arabic text summarization using modified PageRank algorithm
Buttcher et al. Information retrieval: Implementing and evaluating search engines
Pal et al. An approach to automatic text summarization using WordNet
Lin et al. Emotion classification of online news articles from the reader's perspective
US20150347385A1 (en) Systems and Methods for Determining Lexical Associations Among Words in a Corpus
CN104834735A (en) Automatic document summarization extraction method based on term vectors
CN105426360A (en) Keyword extracting method and device
Kogilavani et al. Clustering and feature specific sentence extraction based summarization of multiple documents
Odeh et al. Arabic text categorization algorithm using vector evaluation method
Chandra et al. A statistical approach for automatic text summarization by extraction
Renouf et al. Filling the gaps: Using the WebCorp Linguist’s Search Engine to supplement existing text resources
Al-Saqqa et al. Stemming effects on sentiment analysis using large arabic multi-domain resources
CN102760140A (en) Incident body-based method for expanding searches
Ghanem et al. Stemming effectiveness in clustering of Arabic documents
Patel et al. Approaches of anonymisation of an SMS corpus
Fatima et al. New graph-based text summarization method
CN114546127B (en) Keyboard design method based on initial consonant and vowel layout
Akther et al. Compilation, analysis and application of a comprehensive Bangla Corpus KUMono
Shah et al. An automatic text summarization on Naive Bayes classifier using latent semantic analysis
Patel et al. Influence of Gujarati STEmmeR in supervised learning of web page categorization
Long et al. Multi-document summarization by information distance
BAZRFKAN et al. Using machine learning methods to summarize persian texts
Maria et al. A new model for Arabic multi-document text summarization
Hamza et al. Text mining: A survey of Arabic root extraction algorithms
Wang et al. Investigating differences in gaze and typing behavior across age groups and writing genres

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant