KR20110128405A - Apparatus for generating the rank of frequency, education system, and method using thereof - Google Patents
Apparatus for generating the rank of frequency, education system, and method using thereof Download PDFInfo
- Publication number
- KR20110128405A KR20110128405A KR1020100047831A KR20100047831A KR20110128405A KR 20110128405 A KR20110128405 A KR 20110128405A KR 1020100047831 A KR1020100047831 A KR 1020100047831A KR 20100047831 A KR20100047831 A KR 20100047831A KR 20110128405 A KR20110128405 A KR 20110128405A
- Authority
- KR
- South Korea
- Prior art keywords
- word
- words
- basic
- frequency sequence
- frequency
- Prior art date
Links
Images
Classifications
-
- G—PHYSICS
- G09—EDUCATION; CRYPTOGRAPHY; DISPLAY; ADVERTISING; SEALS
- G09B—EDUCATIONAL OR DEMONSTRATION APPLIANCES; APPLIANCES FOR TEACHING, OR COMMUNICATING WITH, THE BLIND, DEAF OR MUTE; MODELS; PLANETARIA; GLOBES; MAPS; DIAGRAMS
- G09B19/00—Teaching not covered by other main groups of this subclass
- G09B19/06—Foreign languages
-
- G—PHYSICS
- G09—EDUCATION; CRYPTOGRAPHY; DISPLAY; ADVERTISING; SEALS
- G09B—EDUCATIONAL OR DEMONSTRATION APPLIANCES; APPLIANCES FOR TEACHING, OR COMMUNICATING WITH, THE BLIND, DEAF OR MUTE; MODELS; PLANETARIA; GLOBES; MAPS; DIAGRAMS
- G09B5/00—Electrically-operated educational appliances
- G09B5/02—Electrically-operated educational appliances with visual presentation of the material to be studied, e.g. using film strip
Landscapes
- Engineering & Computer Science (AREA)
- Business, Economics & Management (AREA)
- Physics & Mathematics (AREA)
- Educational Administration (AREA)
- Educational Technology (AREA)
- General Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Entrepreneurship & Innovation (AREA)
- Machine Translation (AREA)
Abstract
The present invention is a word extraction unit for generating a first extraction word to the n-th extraction word by extracting all of the words sequentially from the book files consisting of text; A word DB storing words in a basic form expressed in the present tense and singular form and a derivative derived from the basic form for various words; A basic word generator for searching for a basic type for the first to nth extracted words in the word DB and generating first to nth basic words formed of the basic type; Analyze the first to n-th basic words to generate a basic word set omitting duplicate words, and generate frequency sequence information sequencing each word of the basic word set based on the number of book files and the total frequency of appearance. And a frequency sequence generator for generating the frequency sequence information, wherein the frequency sequence information is information in which each word of the basic word set is arranged based on the number of book files that appear, and the total frequency of appearances when the number of book files that appear is the same. The present invention relates to a frequency sequence generating device, a learning system using the same, and a method, wherein each word of the basic word set is sorted information.
According to the present invention, a learning effect can be enhanced by providing a learner with a list of important words with high frequency of use using frequency sequence information that is sequenced by the number of book files in which words appear and the total frequency in which words appear.
Description
The present invention relates to an apparatus for generating a frequency sequence of words necessary for each learning process, a learning system using the frequency sequence generating apparatus, and a method thereof.
Memorization of words is essential for language learning. To do this, the vocabulary, which summarizes the words necessary for each learning process, was used to intensively memorize the words for each learning process.
However, each word listed in the vocabulary can determine the importance of the word depending on its frequency of use. For example, when talking in English or reading a book in English, the be verb is much more frequently used than the move verb. That is, the importance of words with a high frequency of everyday use is more important.
Therefore, by investigating the frequency of use of words used in each learning process, extracting high frequency words, and learning important words with high frequency of use, the learning effect can be further enhanced.
Since learning the words of high importance first can increase the learning effect, there is an increasing demand for a method of extracting and ordering the frequency of use of words for use in language learning.
The problem to be solved by the present invention is a frequency sequence generating device that can provide a list of important words with high frequency of use by generating frequency sequence information that is sequenced by the number of book files in which the word appeared and the total frequency of the word appeared, frequency To provide a learning system and method using the sequence.
According to an aspect of the present invention, there is provided a frequency sequence generating device comprising: a word extracting unit configured to extract all words sequentially from book files made of text to generate first to nth extracting words; A word DB storing words in a basic form expressed in the present tense and singular form and a derivative derived from the basic form for various words; A basic word generator for searching for a basic type for the first to nth extracted words in the word DB and generating first to nth basic words formed of the basic type; Analyze the first to n-th basic words to generate a basic word set omitting duplicate words, and generate frequency sequence information sequencing each word of the basic word set based on the number of book files and the total frequency of appearance. And a frequency sequence generator for generating the frequency sequence information, wherein the frequency sequence information is information in which each word of the basic word set is arranged based on the number of book files that appear, and the total frequency of appearances when the number of book files that appear is the same. As a reference, each word of the basic word set is characterized by sorted information.
In order to solve the above problems, a learning system using frequency sequence information in which words contained in book files according to the present invention are sequenced is a problem extraction unit that extracts a word group having a high alignment order using frequency sequence information. ; And a problem providing unit for providing the extracted word group to the user in the form of a problem, wherein the frequency sequence information is information in which the words in the book files are sorted based on the number of the book files that appeared. In the case of the same words, the total frequency of appearance is information arranged on a second basis.
Frequency sequence generation method according to the present invention for solving the above problems, (a) collecting a book file (book) consisting of text; (b) generating first to extracted words to nth extracted words by sequentially extracting all words from the collected book files; (c) searching for a basic type for the first to nth extracted words in a word DB that stores words in a basic form expressed in the present tense and singular form, and a derivative derived from the basic form, for a variety of words, Generating a basic word to an nth basic word; (d) analyzing the first to nth basic words to generate a basic word set in which duplicate words are omitted; And (e) generating frequency sequence information sequencing each word of the basic word set based on the number of book files and the total frequency of appearances.
According to the present invention, a learning effect can be enhanced by providing a learner with a list of important words with high frequency of use using frequency sequence information that is sequenced by the number of book files in which words appear and the total frequency in which words appear.
1 is a block diagram illustrating an apparatus for generating a frequency sequence according to the present invention.
2 illustrates frequency sequence information generated according to the present invention.
2 is a block diagram illustrating a learning system in accordance with the present invention.
4 is a flowchart illustrating a frequency sequence generation method according to the present invention.
5 is a flowchart illustrating a learning method using frequency sequence information according to the present invention.
Hereinafter, preferred embodiments of the present invention will be described in detail with reference to the accompanying drawings. Like reference numerals in the drawings denote like elements.
1 is a block diagram illustrating an apparatus for generating a frequency sequence according to the present invention.
Frequency
The
On the other hand, a plurality of book files may be collected by grouping each course or by each field, and book files covering all courses and all fields may be collected without being grouped.
For example, since the books of the first year middle school and the second year middle school are different, the
The storage unit 70 stores the collected book files. When the book files are collected by grouping each learning process, the storage unit 70 stores the book files by grouping each learning process. Meanwhile, the storage unit 70 may store first to n th extraction words and first to n th basic words to be described later.
FIG. 1 illustrates a case in which book files are grouped and collected. The first group Ga includes a plurality of book files A 1 to A n , and the x group Gx includes a plurality of book files. (X 1 to X n ). Although not shown in FIG. 1, the
The
The book file consists of text, and the text consists of various punctuation marks, spaces, and words. Since words and words are separated by spaces and punctuation marks, words can be sequentially extracted from book files using spaces and punctuation marks. To this end, the
The
The comparing unit 32 determines whether each of the first extracted word to the nth extracted word is a basic type, and the conversion unit 34 converts the first extracted word to the nth extracted word into a basic type when it is not the basic type. It plays a role.
Word is divided into basic type and derived type, and basic word means basic type word. Basic words are words expressed in the present tense and singular. For example, in English, the basic form of the verb means the present tense verb, and the basic form of the noun means the singular noun. Past verbs, present participle, past participle, and plural nouns are all derivatives.
The word DB 150 is a database that stores words in a basic form expressed in the present tense and singular form and a derivative form derived from the basic form for various words.
As described above, the
The frequency
The frequency sequence information is information in which the frequency sequence information is information in which each word of the basic word set is arranged based on the number of book files that appeared, and the basic word set based on the total frequency that appeared when the number of book files appeared is the same. Each word in is sorted information.
The
First, the frequency
Meanwhile, the
The
The
The
The
If the word read according to the determination result of the
When the above process is performed from the first basic word to the nth basic word, information on the number of book files in which the basic word appears and the total frequency in which the basic word appears for each basic word can be obtained. When all the words in the basic word field are extracted from the
Meanwhile, when book files are classified into groups according to each learning process, the
When the
Dealing with a specific topic in a particular book file can lead to an unusually high frequency of erratic words. Therefore, if the number of book files is increased by sorting the number of book files in which the basic word appears by the first sorting criteria, the occurrence of such a problem may be prevented.
Meanwhile, even after the frequency sequence information is generated, the frequency sequence information may be newly updated by continuously collecting new book files through the
2 is a diagram illustrating frequency sequence information generated according to the present invention.
Referring to FIG. 2, in the frequency sequence information, the basic words are sorted based on the number of book files that appear, and the basic words are sorted based on the total frequency of appearances when the number of book files appears is the same. can confirm.
3 is a block diagram illustrating a learning system according to the present invention.
The
As described above, the frequency sequence information is information in which each word is sorted on the basis of the number of book files that appear, and the information in which each word is sorted on the basis of the total frequency of appearance when the number of book files that appear is the same. to be.
As described above, the frequency
The learning
The
The
The correct
The database (DB, DB, 85) may include a
Hereinafter, a frequency sequence generation method according to the present invention will be described in sequence. The frequency sequence generating method according to the present invention is a method performed by the frequency
4 is a flowchart illustrating a frequency sequence generation method according to the present invention.
First, the frequency
Then, the frequency
To this end, the frequency
The frequency
Then, the frequency
5 is a flowchart illustrating a learning method using frequency sequence information according to the present invention. The learning method using frequency sequence information according to the present invention is a method performed in a learning system using frequency sequence information.
First, the
The
The present invention can also be embodied as computer-readable codes on a computer-readable recording medium. The computer-readable recording medium includes all kinds of recording devices in which data that can be read by a computer system is stored. Examples of the computer-readable recording medium include a ROM, a RAM, a CD-ROM, a magnetic tape, a floppy disk, an optical data storage device, and the like, and may be implemented in the form of a carrier wave (for example, transmission via the Internet) . The computer readable recording medium can also be distributed over network coupled computer systems so that the computer readable code is stored and executed in a distributed fashion.
While the present invention has been particularly shown and described with reference to exemplary embodiments thereof, it is to be understood that the invention is not limited to the disclosed embodiments, but, on the contrary, is intended to cover various modifications and equivalent arrangements included within the spirit and scope of the invention. Accordingly, the scope of protection of the present invention should be construed in accordance with the following claims, and all technical ideas within the scope of equivalents and equivalents thereof should be construed as being covered by the scope of the present invention.
Learning Server (80) Collector (110)
Frequency
Word Database (150) Frequency Sequence Database (160)
Claims (5)
A word DB storing words in a basic form expressed in the present tense and singular form and a derivative derived from the basic form for various words;
A basic word generator configured to search the basic type for the first to nth extracted words in the word DB and generate first to nth basic words formed of the basic type; And
A frequency sequence in which each word of the basic word set is sequenced based on the first to nth basic words is analyzed to generate a basic word set omitting duplicate words, and based on the number of book files and the total frequency of appearances. It includes a frequency sequence generator for generating information,
The frequency sequence information is information in which each word of the set of basic words is arranged based on the number of appeared book files as a first criterion, and when the number of appeared book files is the same, the basic frequency based on the total frequency of appearances as a second criterion is used. Frequency sequence generator, characterized in that each word of the word set is sorted information.
The book files are classified into groups according to each learning process.
The frequency sequence generator generates frequency sequence information for each group according to the learning process.
And a storage unit for storing the book files collected by the reception unit.
(a) collecting book files of text;
(b) generating first to extracted words to n-th extracted words by sequentially extracting all words from the collected book files;
(c) searching for a basic form for the first to nth extracted words in a word DB that stores words in a basic form expressed in the present tense and singular form and a derivative derived from the basic form for various words, Generating a first basic word to an nth basic word;
(d) analyzing the first to nth basic words to generate a basic word set in which duplicate words are omitted; And
(e) generating frequency sequence information sequencing each word of the set of basic words based on the number of book files and the total frequency of appearances;
The frequency sequence information is information in which each word of the set of basic words is arranged based on the number of appeared book files as a first criterion, and when the number of appeared book files is the same, the basic frequency based on the total frequency of appearances as a second criterion is used. Frequency sequence generation method characterized in that each word of the word set is sorted information.
The frequency sequence information generating method of the frequency sequence, characterized in that generated for each group according to the learning process.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR1020100047831A KR20110128405A (en) | 2010-05-24 | 2010-05-24 | Apparatus for generating the rank of frequency, education system, and method using thereof |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR1020100047831A KR20110128405A (en) | 2010-05-24 | 2010-05-24 | Apparatus for generating the rank of frequency, education system, and method using thereof |
Publications (1)
Publication Number | Publication Date |
---|---|
KR20110128405A true KR20110128405A (en) | 2011-11-30 |
Family
ID=45396589
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
KR1020100047831A KR20110128405A (en) | 2010-05-24 | 2010-05-24 | Apparatus for generating the rank of frequency, education system, and method using thereof |
Country Status (1)
Country | Link |
---|---|
KR (1) | KR20110128405A (en) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR20160000384A (en) * | 2014-06-24 | 2016-01-04 | 안인숙 | Making-Method of Materials for Learning Sino-Korean Word using Database |
KR20210115879A (en) * | 2020-03-16 | 2021-09-27 | 주식회사 이드웨어 | Method and apparatus for training language skills for older people using speech recognition model |
-
2010
- 2010-05-24 KR KR1020100047831A patent/KR20110128405A/en not_active Application Discontinuation
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR20160000384A (en) * | 2014-06-24 | 2016-01-04 | 안인숙 | Making-Method of Materials for Learning Sino-Korean Word using Database |
KR20210115879A (en) * | 2020-03-16 | 2021-09-27 | 주식회사 이드웨어 | Method and apparatus for training language skills for older people using speech recognition model |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN104504150B (en) | News public sentiment monitoring system | |
Dalvi et al. | Websets: Extracting sets of entities from the web using unsupervised information extraction | |
KR102094934B1 (en) | Natural Language Question-Answering System and method | |
JP6813591B2 (en) | Modeling device, text search device, model creation method, text search method, and program | |
CN102262634B (en) | Automatic questioning and answering method and system | |
US20040249808A1 (en) | Query expansion using query logs | |
US20050251384A1 (en) | Word extraction method and system for use in word-breaking | |
CN109947952B (en) | Retrieval method, device, equipment and storage medium based on English knowledge graph | |
CN104899335A (en) | Method for performing sentiment classification on network public sentiment of information | |
Kunneman et al. | Open-domain extraction of future events from Twitter | |
Korn et al. | Automatically generating interesting facts from wikipedia tables | |
KR20070007001A (en) | Method and apparatus for searching information using automatic query creation | |
CN116070599A (en) | Intelligent question bank generation and auxiliary management system | |
Quintard et al. | Question Answering on web data: the QA evaluation in Quæro | |
Boschetti et al. | Computational analysis of historical documents: An application to italian war bulletins in world war i and ii | |
Perea-Ortega et al. | Application of text summarization techniques to the geographical information retrieval task | |
Campbell et al. | Content+ context networks for user classification in twitter | |
JP6942759B2 (en) | Information processing equipment, programs and information processing methods | |
KR20110128405A (en) | Apparatus for generating the rank of frequency, education system, and method using thereof | |
Hamoud et al. | Evaluation corpus for restricted-domain question-answering systems for the holy Quran | |
van Schooten et al. | Handling speech input in the ritel QA dialogue system. | |
Goh | Using named entity recognition for automatic indexing | |
CN113392647B (en) | Corpus generation method, related device, computer equipment and storage medium | |
Jijkoun et al. | Preprocessing documents to answer Dutch questions | |
CN113934910A (en) | Automatic optimization and updating theme library construction method and hot event real-time updating method |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
A201 | Request for examination | ||
E902 | Notification of reason for refusal | ||
E902 | Notification of reason for refusal | ||
E601 | Decision to refuse application |