WO2016076622A1 - 문서 선택에 따른 가이드라인 제공방법, 이를 수행하기 위한 프로그램이 기록된 컴퓨터 판독가능한 기록 매체 및 매체에 저장된 단말장치용 어플리케이션 - Google Patents
문서 선택에 따른 가이드라인 제공방법, 이를 수행하기 위한 프로그램이 기록된 컴퓨터 판독가능한 기록 매체 및 매체에 저장된 단말장치용 어플리케이션 Download PDFInfo
- Publication number
- WO2016076622A1 WO2016076622A1 PCT/KR2015/012096 KR2015012096W WO2016076622A1 WO 2016076622 A1 WO2016076622 A1 WO 2016076622A1 KR 2015012096 W KR2015012096 W KR 2015012096W WO 2016076622 A1 WO2016076622 A1 WO 2016076622A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- word
- document
- words
- information
- test
- Prior art date
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/40—Processing or translation of natural language
- G06F40/58—Use of machine translation, e.g. for multi-lingual retrieval, for server-side translation for client devices or for real-time translation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/30—Semantic analysis
Definitions
- the present invention relates to a method for providing a guideline for easily reading a foreign language document, and more particularly, to determine whether a user's selected document is suitable for reading using a word included in the foreign language document, and to include it in the selected foreign language document.
- the present invention relates to a method for providing a guideline according to document selection, by providing an optimized wordbook optimized for a word, which makes it easier to read a foreign language document.
- Korean Patent Publication No. 2011-0024419 extracts a word from a sentence and provides the user with the meaning of the extracted word so that the user can easily translate the sentence. Furthermore, the word is provided in consideration of the user's vocabulary level. It suggests how to recommend.
- the existing technology determines the user's vocabulary level based on the dictionary difficulty of the word selected by the user, and can be determined only when certain data is accumulated for the user.
- the present invention also relates to a method for providing a guideline according to a document selection, by which a word included in a selected foreign language document can be extracted to customize a word book by a user's selection.
- the present invention relates to a method for providing a guideline according to a document selection when displaying a document selected by a user on a client terminal device together with displaying semantic information of a word included in the displayed document based on the customized wordbook.
- a guideline providing method if a document written in a first language is selected, the frequency of use is extracted by extracting a word included in the selected document And calculating a difficulty level, displaying a test word based on the frequency of use and difficulty level, and determining a suitability for the selected document based on a test result for the test word.
- the words included in the document may be selected and displayed according to the frequency of use or the difficulty in the order of low or high, or may be selectively displayed for each number of difficulty.
- the method may further include providing a goodness of fit for the document based on the test result.
- the test result may be calculated by grouping the difficulty levels of the test word, and the providing of the fitness may provide learning word information for reading the document based on the test result calculated by grouping the difficulty levels. .
- the method may further include receiving memorization target word information from the learning word information, and generating a vocabulary from the memorization target word information.
- the method may further include generating a first wordbook from the memorization target word information of the word group extracted based on the frequency of use or the difficulty.
- the generated first vocabulary includes at least one of dictionary meaning information, part-of-speech information, word circular information, semantic information and sentence sentence information used in the document, memorization hint information, difficulty level, frequency of use, and a learning problem. Can be provided.
- the generated first vocabulary may be managed by dividing each memorized target word into a word to be memorized, a known word or a completed word, or separately storing the words.
- the method may further include displaying the document, and displaying semantic information with respect to a word included in the selected wordbook among words included in the displayed document.
- Receiving a search word for the document and providing a document list including documents matching the search word, wherein the number of words to be memorized, the share of use of the words to be memorized, the number of words to be known,
- the document list may be sorted according to the use of the word, sales volume, rating order, release date or price.
- the method may further include displaying the document, wherein the display of the semantic information may be omitted for a word included in a preset exception word list among words included in the displayed document.
- the meaning of the words contained in the wordbook among the words included in the displayed portion can be displayed together so that it can be easily read without a separate search.
- FIG. 1 is an exemplary view schematically showing a guideline providing method according to document selection according to an embodiment of the present invention.
- FIG. 2 is a flowchart illustrating a guideline providing method according to document selection according to an embodiment of the present invention.
- 3 to 8 are flowcharts illustrating a guideline providing method according to document selection according to another embodiment of the present invention.
- FIG. 9 is a block diagram of a guideline providing system according to document selection according to an embodiment of the present invention.
- FIG. 10 is a block diagram of a guideline providing system according to a document selection according to another embodiment of the present invention.
- FIG. 11 is an exemplary diagram illustrating data and information flow of a method for providing a guideline according to document selection according to an embodiment of the present invention.
- FIG. 12 is an exemplary view showing data and information flow of a guideline providing method according to document selection according to another embodiment of the present invention.
- FIG. 13 is an exemplary view schematically showing a word group generated by a method for providing a guideline according to a document selection according to an embodiment of the present invention.
- FIG 14 is an exemplary view showing a document displayed on the client terminal by the guideline providing method according to the document selection according to another embodiment of the present invention.
- a guideline providing method according to document selection according to an embodiment of the present invention is shown.
- the frequency of use and difficulty may be calculated by extracting a word included in a document written in each first language (S10).
- FIG. 1 shows a connection relationship for service provision by a guideline providing system (hereinafter referred to as a providing system) according to document selection according to an embodiment of the present invention
- FIG. 7 is a configuration diagram of the providing system according to the present embodiment.
- server 10 the providing system may be implemented as a computer system capable of executing a program for providing a guideline according to document selection.
- a user who wants to select and read a document written in a foreign language can access a server 10 in which various foreign language documents are registered through a network using a client terminal device (not shown).
- a client terminal device not shown
- a case where a foreign language document is registered in the server 10 has been described as an example.
- the present invention is not limited thereto, and the foreign language document may be stored and managed in the client terminal device. Therefore, a service may be provided in a form of directly selecting a foreign language document from a client terminal without access to a separate server 10.
- the term "document” may be a concept including both a document file in text or binary form or an image file including text information of an image of the document.
- the server 10 may be hardware in the same configuration as a typical web server, a web application server, or a web server, or a WAP server.
- a web application server or a web server
- a WAP server or a WAP server.
- C C
- C ++ Java You can include program modules that can be implemented in any language, including PHP, .Net, Python, Ruby, and more.
- the server 10 may be connected to a client terminal device such as a computer, a mobile terminal, or another server 10 through a network, whereby the server 10 may work on the client terminal device or another server 10. It may refer to a computer system that receives a request for performance and derives and provides a work result thereof, or computer software (server program) installed for the computer system.
- the server 10 has a broad concept including a series of application programs that operate on the server 10 in addition to the above-described server program, and, in some cases, various databases built internally or externally. It should be understood that content, various information and data can be stored and managed in a database.
- the database may be implemented inside or outside the server 10.
- the client terminal device refers to a machine and a device for performing a computing operation including not only a general PC such as a desktop or a notebook, but also a mobile terminal such as a smartphone, a tablet PC, a PDA, and the like.
- the network is a network connecting the server 10 and the client terminal device, and may be a closed network such as a local area network (LAN), a wide area network (WAN), or an open type network such as the Internet. It may be a network.
- the Internet may mean a global open computer network structure that provides various services existing in the TCP / IP protocol and its upper layer, and includes a mobile terminal such as a smart phone, a tablet PC, a PDA, and a smart phone.
- the network may further include a wireless access network such as a mobile communication network or a Wi-Fi network.
- various documents written in a foreign language may be registered in the server 10.
- various documents written in a foreign language may be stored and managed in the client terminal device.
- These documents include electronically documented books, articles and articles, as well as documents in all fields, including novels, magazines, technical books, poetry, fairy tales, educational texts, periodicals, newspapers, papers, articles, etc., distributed online or offline. It may be included, and may be a file of various formats such as DOC, HWP, ePub, BeBB, AZW, PDF, PPT, XLS, TXT, RTF.
- identification information that can be directly identified or inferred from the document, such as the title of the document, the author, the publisher, the release date, the price, the sales amount, the rating or a set of randomly assigned symbols and numbers. Can be.
- the document may be stored and managed in a database in association with the identification information.
- the first language is a specific language typically used in the preparation of the document, and may be any kind of language such as English, Japanese, Chinese, German, French, Italian, and Arabic.
- English becomes the first language.
- the words included in the registered document are extracted from the word analysis module 100 and classified into a plurality of grades according to a predetermined difficulty classification method, and the number of times the words belonging to each of the plurality of grades are used in the document.
- the frequency of use can be calculated.
- the predetermined difficulty classification method may be a method of classifying English words in a specific dictionary according to a difficulty level of the dictionary, and the predetermined difficulty classification criteria may be stored in the database in advance.
- the scope of the present invention is not limited thereto, and may be a difficulty classification standard arbitrarily set by an administrator of the providing system.
- the difficulty classification method may be regularly updated by the providing system administrator or automatically by the providing system at regular intervals.
- the word analysis module 100 may determine the number of words corresponding to each of the plurality of grades, the utilization rate of words included in each of the plurality of grades, and the plurality of grades, respectively. Word usage status information including information on the number of use of the included words may be provided.
- the difficulty of the words included in the document and the frequency of use may be stored in the database in association with the identification information of the document, and may be provided to the client terminal device through the transmission / reception module 130.
- the word analysis module 100 may not be provided in the server 10 but may be disposed in the client terminal device. In this case, in performing the word analysis task in the word analysis module 100, communication with the server 10 may be omitted.
- a document list including the document may be provided (S20).
- the display module 120 may be provided in the form of a list or icon platform so that the registered document can be selected on the screen of the client terminal device accessing the server 10, but is not limited thereto.
- the document list may be generated directly in the client terminal device without accessing the server 10.
- the display module 120 may be provided at the client side to provide a service.
- the platform can be implemented in various languages, plug-ins, and graphic technologies such as html, Css, javaScript, Visualbasic Script, PHP, JSP, ASP, ASP.net, Document Object Model, CGI, SVG, Canvas API, Flash, Shockwave, Java, etc. .
- the identification information of the user who accesses the service can be received and stored in a database.
- the identification information is information that can be directly identified or inferred from the user.
- the name, ID, password, age, gender, address, and telephone number An e-mail address, an SNS address, a homepage address, a credit card number, a social security number, or a set of randomly assigned symbols and numbers.
- information on the selected document may be directly provided by the client terminal device or received from the server 10 (S30).
- a test word may be displayed based on the difficulty and the frequency of use among words included in the selected document (S40).
- test word is selected according to the order of low or high difficulty among words in a first language included in the selected document.
- the frequency of use is selected in the order of low or high, it may be displayed on the client terminal device through the display module 120.
- the test word may be selected and displayed by a predetermined number for each difficulty level.
- test words may be selected according to selection criteria arbitrarily set by an administrator of the providing system.
- the selected test word may extract a sentence actually applied to the selected document to make the extracted sentence itself a test word.
- the user may input the meaning of the test word displayed on the client terminal device in the second language.
- the user may directly input semantic information, and may select and input any one of a plurality of views.
- the second language is a language that the user wants to interpret the first language of the document into another language, and may be any kind of language such as English, Japanese, Chinese, German, Japanese, French, Italian, and Arabic.
- a person who wants to read a document written in English in Korean can input the meaning in Korean in a test word composed of English words. In this case, English becomes the first language and Korean becomes the second language.
- the semantic information input through the client terminal device may be received by the server 10, and the semantic information of the test word input in the received second language by the test module 110 is defined in advance. Can be determined whether or not.
- another search server 10 capable of comparing or searching a dictionary or translating a language by using the database in which the providing system administrator previously inputs and stores the exact meaning of the test word.
- the semantic information of the test word may be requested to the translation server 10 and the library and compared.
- the client terminal device may include a test module 110 to determine whether the input semantic information directly matches a predefined meaning in the client terminal device.
- test module 110 may calculate the number of correct answers in which the user correctly inputs the meaning of the test word based on the determination of the match, and may calculate a test result based on the number of correct answers.
- a score for each test word by assigning an arbitrary weight to the frequency or difficulty of the test word, a value obtained by summing all the scores of the test words in which the meaning is input accurately is a test result.
- the scope of the present invention is not limited thereto, and the test result may be calculated according to various criteria set by the administrator of the providing system.
- the user may be provided a goodness of fit for the selected document (S50).
- the suitability refers to whether the learning level for the first language is sufficient for the user to read the selected document.
- the test result is more than a certain score or ratio in the test module 110
- the user A 'suitability' can be determined that the learning level for the first language is sufficient to read the selected document, and if the test result is less than or equal to the predetermined score or ratio, the user reads the selected document to the first. Inappropriate judgment can be made that the level of learning about the language is insufficient.
- the determination through the transmission and reception module 130 may transmit a fitness message that said 'suitable' or 'unsuitable' to the terminal device of the client, wherein, the test information includes whether the semantic information match and the exact semantic information Test results can be sent together.
- the words included in the document selected by the user can be directly used in the test to accurately determine whether the document selected by the user is suitable for the user's level, thereby supporting the user's efficient document selection.
- the user may be asked whether to test using another test word included in the selected document and not overlapping with the test word (S51).
- test word Since the test result may vary according to the test word, and thus the user's suitability for the same document may be changed, the test word is not used among the words included in the selected document to increase the reliability of the goodness of fit. Other test words may be used to ask the user whether or not to repeat the test.
- the difficulty level is low or high, except for the test words previously selected from among the words in the first language included in the selected document in the test module 110. If the different test words are selected or the other test words are selected according to the order of low or high frequency of use, the display module 120 may display the selected other test words on the client terminal device (S52). ). In addition, the other test words may be selected and displayed by an arbitrary number for each difficulty level. However, the scope of the present invention is not limited thereto, and the other test words may be selected according to the selection criteria arbitrarily set by the administrator of the providing system.
- the fitness of the selected document may be provided to the user based on the re-execution test result and the existing test result (S53). Receiving the semantic information about the other test word from the user to confirm whether the match, calculation of the number of correct answers for the other test word and the calculation method of the re-execution test result is confirmed to match the existing test word, the test word Is the same as the method for calculating the correct answer and the test result.
- the test module 110 may provide a suitability to the client based on the existing refit test method based on the retest test result, and calculate the average value by adding the retest test result and the existing test result. Based on this, the fitness can be provided to the user according to the fitness determination method.
- the fitness test can be conducted using another test word included in the selected document continuously by the user's selection. Through this process, the reliability of the fitness can be improved, and the user can learn words included in the selected document.
- test results may be calculated by grouping by the difficulty of the test word.
- the test words used for the test may be classified and grouped according to the difficulty. Based on this, the total number of problems of the test words displayed to the user or the total score of the scores set in the test words can be calculated for each difficulty group. In addition, for each difficulty group, the user may calculate the total number of correct answers or correct scores obtained by accurately inputting semantic information of the test word.
- the learning word information is information for recommending a learning word that the user should learn in order to easily read the selected document.
- the learning word information includes the word usage status information, the number of learning words, a learning word list, and a learning word.
- One or more information of semantic information or recommendation message may be included.
- the word analysis module 100 corresponds to a difficulty higher than the learning difficulty, including the word corresponding to the learning difficulty or the learning difficulty among words included in the selected document in cooperation with the test module 110.
- Learning words can be calculated based on all words.
- the learning difficulty is calculated in plural, it is preferable to calculate a learning word based on the lowest level of learning difficulty among the plurality of learning difficulties, but the present invention is not limited thereto.
- the learning word can be calculated based on the learning difficulty of the grade according to the grade or any criterion set by the management of the providing system.
- All words included in the learning difficulty may be calculated as a learning word, or a word more than a certain frequency of use among words included in the learning difficulty may be calculated as a learning word.
- the remaining words except for the test word in which the user inputs the semantic information accurately may be calculated as the learning word.
- the word usage information, learning word count, learning word difficulty, learning word list, learning word semantic information, learning word part-of-speech information, or a recommendation message including any one or more of the information Word information can be generated.
- the user can be provided with a list of the calculated learning words along with a message that the user can read the selected document by learning only 200 words. It can support document selection and provide the amount of learning needed to read the document.
- the wordbook providing module 140 in the The wordbook may be generated in the form of a table, a table, a learning card to facilitate learning by receiving the selected memorizing target word information, or another form arbitrarily set by the providing system administrator.
- the word to be memorized is a word selected by the user from a list of words provided to the user, and means a word that the user does not know or wants to know the exact meaning.
- the wordbook providing module 140 may be provided in the client terminal device instead of the server, and may be configured to enable the wordbook service without being connected to the server.
- the frequency of use to extract the words contained in the first document of the document written in the first language And difficulty can be calculated (S100).
- the providing system may be independently implemented in the client terminal device.
- the providing system may be implemented as a computer system capable of executing a program for providing guidelines according to document selection.
- a user who wants to select and read a document written in a foreign language accesses the server 10 through a network using a client terminal device, or accesses the document by accessing a program, an application or a website installed in the client terminal device. can do.
- the server 10, the network, and the client terminal device are the same as described above.
- various documents written in a foreign language may be registered in the server 10 or the client terminal device.
- These documents include electronically documented books, articles and articles, as well as documents in all fields, including novels, magazines, technical books, poetry, fairy tales, educational texts, periodicals, newspapers, papers, articles, etc., distributed online or offline. It may be included, and may be a file of various formats such as DOC, HWP, ePub, BeBB, AZW, PDF, PPT, XLS, TXT, RTF.
- identification information that can be directly identified or inferred from the document can be inputted, such as the title, author, issuer, issue date, price, or a set of symbols or numbers.
- the document may be stored and managed in a database in association with the identification information.
- the first language is a specific language typically used in the preparation of the document, and may be any kind of language such as English, Japanese, Chinese, German, Japanese, French, Italian, and Arabic.
- English becomes the first language.
- the words included in the registered document are classified into a plurality of grades according to a predetermined difficulty classification method in the word analysis module 100, and a frequency of use which is the number of times of use of the words belonging to each of the plurality of grades in the document. Can be calculated.
- the predetermined difficulty classification method may be a method of classifying English words in a specific dictionary according to a difficulty level of the dictionary, and the predetermined difficulty classification method may be stored in the database in advance.
- the scope of the present invention is not limited thereto, and may be a difficulty classification method arbitrarily set by an administrator of the providing system.
- the difficulty classification method may be regularly updated by the providing system administrator or automatically updated by the providing system at regular intervals.
- the word analysis module 100 may determine the number of words corresponding to each of the plurality of grades, the utilization rate of words included in each of the plurality of grades, and the plurality of grades, respectively. Word usage status information including information on the number of use of the included words may be provided.
- the difficulty and the frequency of use of words included in the document may be stored and managed in association with the identification information of the document, and may be provided to the client terminal device through the transmission / reception module 130.
- a document list including the document may be provided.
- the registered document may be displayed on a screen of a client terminal device in the form of a list or icon platform.
- the platform can be implemented in various languages, plug-ins, and graphic technologies such as html, Css, javaScript, Visualbasic Script, PHP, JSP, ASP, ASP.net, Document Object Model, CGI, SVG, Canvas API, Flash, Shockwave, Java, etc. .
- the information of the first document can be confirmed.
- FIG. 13 illustrates one form of a word group according to the present embodiment.
- the difficulty or frequency of use of the word group may be arbitrarily set by an administrator of the providing system, and may receive setting information from the user. For example, when the administrator or the user sets the frequency of use to '2', words having two or more times of use may be extracted from the words included in the first document. In this case, it can be extracted except for word groups corresponding to the level of difficulty of the elementary, middle and high levels.
- Memorizing target word information of the word group may be received from the user (S120).
- the display module 120 may display the word list included in the extracted word group on the client terminal device. Accordingly, the user may select a word to be memorized from the word list displayed on the client terminal device.
- the word to be memorized is a word selected by the user among the listed words, and may mean a word that is unknown or wants to know the exact meaning.
- a first vocabulary may be generated from the memorized target word information (S130).
- the wordbook providing module 140 may generate a first wordbook to the user based on the memorization target word information selected by the user and provide the first wordbook to the user.
- the first vocabulary may mean a vocabulary for words included in the first document, and the memorized target words selected by the user may be displayed on the client terminal device in a table or a table form.
- the client terminal may directly upload and register any word other than the memorized target word in the generated first wordbook, and select and delete a specific word included in the first wordbook, or change it to another wordbook. You can move it.
- the calculating of the frequency of use and difficulty may provide first learning word information based on a word included in the first document.
- the first learning word information is information for recommending a learning word that a user should learn in order to easily read the first document.
- the first learning word information includes the status information of the word, the number of learning words, and a learning word. One or more of the list, the expected time of study, or the recommended message may be included.
- the word analysis module 100 may extract a learning word according to a predetermined learning word extraction criterion among words included in the first document.
- the predetermined learning word extraction criterion may be a specific difficulty or frequency of use, and may be a word included in a specific wordbook, a database, or tag information.
- a word having a specific difficulty or frequency of use based on the difficulty or frequency of use calculated by the word analysis module 100 is extracted as a learning word, or difficulty of a predetermined level or less.
- a word having a frequency of use may be extracted as a learning word.
- a learning word may be extracted by excluding or adding a word included in any wordbook, database, or tag information generated from the user among words included in the first document.
- the learning word extraction criteria may be stored in advance in the database.
- the words corresponding to the elementary, middle, and high difficulty levels can be excluded from the learning word. Can be excluded.
- the cumulative use weights for accumulating the use frequencies may be calculated according to the order of the high frequency of use of the words. If the cumulative use weight reaches a predetermined value or more, the words not included in the cumulative use weights are included in the learning word. It can also be excluded.
- the learning word extraction criteria may be extracted according to various criteria to provide more efficient learning information to the user.
- the word analysis module 100 calculates the number of groups by the difficulty level based on the extracted learning word, and calculates the number by the learning difficulty time according to the difficulty level of the predetermined word or the expected learning time of the user.
- the total estimated time for learning all the extracted learning words may be calculated by summing all the values multiplied by the calculated number.
- the learning expected time for each difficulty level of the predetermined word may be arbitrarily set by an administrator of the providing system and stored in a database in advance.
- the first word for the first document including any one or more information of the word usage status information, the number of learning words, the difficulty of learning words, the list of learning words, the expected time of study, or the recommended message based on the extracted learning words.
- Learning word information may be generated and provided to the client terminal device.
- the user can be provided with information on the amount of learning to read the document easily so that the user can more efficiently select the document suitable for the learning and level.
- the user may select a memorization target word from the list for the learning word and generate a wordbook based on the selected memorization target words.
- the extracting of the word group may include information on one or more of frequency of use, difficulty, usage form information, part-of-speech information, word circular information, and learning completion time of the extracted word.
- each word included in the list may be analyzed to provide information.
- Usage type information, difficulty level of each word included in the list, usage type information in which the words included in the list are actually applied in the sentences of the first document, part-of-speech information of each word included in the list, and word circle May contain information.
- learning completion time information may be included.
- the user may sort the word list in order of the frequency of use or the difficulty, in order of high or low, and filter to display only words having any frequency of use, difficulty, part-of-speech information, or word circular information. For example, you can filter to display only words that have elapsed from the learning completion time.
- the word analysis module 100 may classify and classify the words, parts of speech or word circle according to a predetermined difficulty classification method, a part-of-speech classification method, or a word circular classification method, and classify the words, parts of speech or words.
- the frequency of use which is the number of times the words belonging to the prototype are used in the document, can be calculated.
- the predetermined difficulty classification method, the part-of-speech classification method, or the word circular classification method may be a method of classifying English words according to a difficulty level, a part-of-speech, or a word circle of a dictionary in a specific dictionary.
- the prototype classification method may be stored in advance in the database.
- the scope of the present invention is not limited thereto, and may be a difficulty, part-of-speech, or word circle classification method arbitrarily set by an administrator of the providing system.
- the learning completion time and the elapsed time will be described in detail later.
- the word circle information, part-of-speech information, the dictionary meaning information of the selected memorized words when the user selects any one of the words to be memorized from the generated first word book, the word circle information, part-of-speech information, the dictionary meaning information of the selected memorized words.
- a separate sub-window may be displayed by displaying semantic information used in the first document and sentence example information based on the sentence used in the first document, memorization hint information, difficulty level, frequency of use, or learning problem. Can be.
- the learning problem is to test whether the user has completed the learning of each of the memorization target words included in the first wordbook, so that the user directly inputs the meaning of the memorization target word or one of a plurality of options.
- the client terminal may provide a word problem in a subjective or multiple choice form for the memorized target word.
- Memorizing target word input correct answer through the learning problem may be said that the user has completed the learning.
- the scope of the present invention is not limited thereto, and the first vocabulary may further provide various learning functions to more easily memorize and learn the words to be memorized.
- the first vocabulary is divided by the user into the words to be memorized, words to know or learning completed words, memorized words, or to create a separate vocabulary for the words to be memorized, known words or learning completed words It can be managed or stored in a separate database.
- the memorization target words included in the first vocabulary are classified as words to be memorized or a word to be memorized by the user for measurement of performance according to future user learning, efficient learning, and continuous management. Can be.
- the memorized target word in which the correct answer is input through the learning problem may be classified as a learning completed word.
- tag information is added to the classified memorized words, known words, or learning completion words, respectively, to sort and filter according to the tag information, and separately for the memorized words, known words, or learning completed words. It is possible to generate a vocabulary of, and to move the word to be memorized, knowing words or learning completed words to another vocabulary.
- the word to be memorized, knowing words or learning completed words can be stored in a separate database in conjunction with the identification information of the user, the case of the learning completed words the time passed the test through the learning problem learning completion time Can be stored together.
- the frequency of use and difficulty are calculated by extracting words included in the first document, and based on the test words, the test words are selected and provided to the user to test the suitability of the user to read the first document. Can be.
- a word in which the correct answer is input among the test words it may be classified as the known word.
- the usage frequency and difficulty calculation method, test word screening method, and fitness test method are as described above.
- the scope of the present invention is not limited thereto, and the words to be memorized may be variously classified, such as words to be memorized, words to be understood, or words completed to learn, words to be confused, key words, and the like.
- the user can directly upload separate words and sort or store them.
- a user may provide a word book that is more optimized for reading the first document, and the user may separately create and manage an integrated word book or database related to the first language regardless of the first document.
- the user may filter to exclude the word included in the known word or the completed word.
- a list of words included in another document or words included in the memorized word information may be filtered to exclude words that match by comparing the words included in the wordbook or the database for the known word or the completed word.
- the learning completion word when filtering the list of words included in the other document or the word included in the memorization target word information based on the learning completion word, from the learning completion time of the learning completion word to the filtering time
- Each of the elapsed days may be calculated, and the learning completion word having the elapsed date less than or equal to a certain period may be filtered out of the filtering.
- the arbitrary period may be set by the user or the providing system administrator in units of days, weeks, months, and years. In this way, the word completion may be forgotten because the learning completion time is long, and the wordbook may be re-learned, and words optimized for the first language level of the user may be provided as the wordbook.
- a search word for the document may be input from the user (S140).
- the search module 150 of the server 10 may provide the user with a search input window for searching at least one of the documents registered in the server 10.
- the user may select the client terminal device. You can enter a search term.
- the search module 150 may compare documents stored in the database with identification information of each document, and select documents matching the search word.
- the matching documents may be displayed on a screen of a user terminal device in a list or icon manner.
- the list or icon of the document may be sorted according to the number of words to be memorized, the use ratio of the words to be memorized, the number of words to know, the use ratio of the words to know, sales volume, rating order, release date or price.
- the words to be memorized or the words to be memorized of the user stored in the database for each document are compared with the words extracted from the documents, respectively. You can sort by.
- the searched documents may be arranged in order according to the sales amount, rating, release date, or high or low order included in the identification information of the document.
- the use weight of the word to be memorized or the use weight of the word to know represents the weight occupied by the word or the word to be memorized among the words included in the document.
- the document number can be calculated by dividing the total number of words extracted from the document from the memorized word number or the number of words memorized for each document, and thus the searched documents are sequentially ordered according to the order in which the calculated ratio is high or low. You can sort.
- the present invention is not limited thereto, and the document list or icon may be arranged based on various information.
- the first document may be displayed on the client terminal device (S160).
- the user may select the first document and request reading, and when the server 10 receives the read request information of the first document, the user reads the file of the first document from the database and the client terminal device. Can be marked on.
- the first wordbook or a separate wordbook may be selected (S170).
- the user is included in the selected word book among the words contained in the first document displayed on the client terminal device. Meaning information about a word may be displayed together on the client terminal device.
- the word terminal list for the first document or the other word list generated by the user may be provided to the client terminal device to select one word list.
- the word analysis module 100 extracts a word included in the first document displayed on the client terminal device, compares the words in the wordbook selected by the user, and compares the semantic information of the matching words with one side of the client terminal device.
- the word and semantic information may be displayed and provided together in a fixed form, a popup form on one side, or an arbitrary form set by the providing system administrator.
- the administrator of the providing system is a list of exception words that are words that do not need to be interpreted or ignored separately for reading the selected document. May be set in advance, and a word included in the exception word list may not be displayed on the client terminal device.
- the exception word list may include words included in the wordbook or database for the known word or the learning completed word.
- FIG. 14 illustrates one embodiment in which the semantic information of words matching by the first document and the selected wordbook are displayed together on the client terminal device according to the present embodiment.
- the user when displaying the document selected by the user on the client terminal device, the user can display and provide the semantic information of the words included in the portion displayed on the client terminal device that the user may not know. You can avoid the additional task of searching for a vocabulary or dictionary separately to interpret the document, and make it easier to read the selected document.
- some of the modules that may be arranged in the server or the client terminal device may be partly arranged in the client terminal device or distributed in the server side.
- the method of providing guidelines according to the document selection described above may also be embodied as computer readable codes on a computer readable recording medium. That is, there may be a computer-readable recording medium in which a program for performing the guideline providing method according to the document selection according to the embodiment of the present invention is recorded.
- the computer-readable recording medium includes all kinds of recording media on which data that can be read by a computer system is stored. Examples of computer-readable recording media include, but are not limited to, ROM, RAM, CD-ROM, magnetic tape, floppy disk, optical data storage device, and the like.
- the guideline providing method according to the document selection described above may be implemented as a program (or an application) and stored in a medium to be executed in combination with a terminal device which is hardware.
- the program may include C, C ++, JAVA, machine language, etc. which the processor (CPU) of the client terminal device can read through the device interface of the client terminal device so that the client terminal device reads the program and executes the methods.
- Code may be coded in the computer language of. Such code may include functional code associated with a function that defines necessary functions for executing the methods, and an execution procedure related control code necessary for the processor of the client terminal device to execute a predetermined procedure. It may include.
- the code may include a memory reference code for additional information or media required to execute the functions by the processor of the client terminal device at which location (address address) of the internal or external memory of the client terminal device should be referred. It may further include.
- the code may be used by any other computer remotely using the communication module of the client terminal device. Or a communication related code for how to communicate with a server or the like, and what information or media should be transmitted and received during communication.
- the client terminal device may be implemented in various devices such as a mobile phone, a smart phone, a tablet computer, a notebook computer, a digital broadcasting terminal, a personal digital assistant (PDA), a portable multimedia player (PMP), a navigation device, a digital TV, and a desktop computer.
- PDA personal digital assistant
- PMP portable multimedia player
- the stored medium is not a medium for storing data for a short time such as a register, a cache, a memory, but semi-permanently, and means a medium that can be read by the device.
- examples of the storage medium include, but are not limited to, a ROM, a RAM, a CD-ROM, a magnetic tape, a floppy disk, an optical data storage device, and the like.
- the program may be stored in various recording media on various server 10 to which the client terminal apparatus can access or various recording media on the client terminal apparatus of the user.
- the media may also be distributed over network coupled computer systems so that the computer readable code is stored in a distributed fashion.
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Health & Medical Sciences (AREA)
- Artificial Intelligence (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Computational Linguistics (AREA)
- General Health & Medical Sciences (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Electrically Operated Instructional Devices (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
Description
Claims (13)
- 제1 언어로 작성된 문서가 선택되면, 상기 선택된 문서에 포함되는 단어를 추출하여 사용빈도 및 난이도를 산출하는 단계;상기 사용빈도 및 난이도를 기초로 테스트단어를 표시하는 단계; 및상기 테스트단어에 대한 테스트결과를 기초로 상기 선택된 문서에 대한 적합도를 결정하는 단계;를 포함하는, 문서 선택에 따른 가이드라인 제공방법.
- 제1항에 있어서,상기 테스트단어를 표시하는 단계는,상기 문서에 포함되는 단어 중 상기 사용빈도 또는 상기 난이도가 낮은 또는 높은 순서에 따라 선별하여 표시하거나, 또는 상기 각 난이도별로 임의의 수만큼 선별하여 표시하는 것을 특징으로 하는, 문서 선택에 따른 가이드라인 제공방법.
- 제1항에 있어서,상기 문서에 포함되며 상기 테스트단어와 중복되지 않는 다른 테스트단어를 사용하여 재실시하는 테스트에 응하는 메시지를 수신할 경우, 상기 다른 테스트단어를 표시하는 단계; 및상기 재실시 테스트결과와 기존의 상기 테스트결과를 기초로 상기 문서에 대한 적합도를 결정하는 단계;를 더 포함하는, 문서 선택에 따른 가이드라인 제공방법.
- 제1항에 있어서,상기 테스트결과는,상기 테스트단어의 상기 난이도별로 그룹하여 산출하되,상기 적합도를 제공하는 단계는,상기 난이도별로 그룹하여 산출된 테스트결과를 기초로 상기 문서를 읽기 위한 학습단어정보를 제공하는 것을 특징으로 하는, 문서 선택에 따른 가이드라인 제공방법.
- 제4항에 있어서,상기 학습단어정보 중 암기대상단어정보를 수신하는 단계; 및상기 암기대상단어정보로부터 단어장을 생성하는 단계;를 포함하는, 문서 선택에 따른 가이드라인 제공방법.
- 제1항에 있어서,상기 사용빈도 또는 상기 난이도에 기초하여 추출된 단어그룹의 암기대상단어정보로부터 제1 단어장을 생성하는 단계를 더 포함하는, 문서 선택에 따른 가이드라인 제공방법.
- 제6항에 있어서,상기 생성된 제1 단어장은,선택된 단어에 대한 사전의미정보, 품사정보, 단어원형정보, 상기 문서에 사용된 의미정보 및 문장예문정보, 암기힌트정보, 난이도, 사용빈도 및 학습문제 중 하나 이상의 정보를 제공하는, 문서 선택에 따른 가이드라인 제공방법.
- 제6항에 있어서,상기 생성된 제1 단어장은,각 암기대상단어들을 암기할 단어, 아는 단어 또는 학습완료 단어로 구분을 하거나 별도로 저장하여 관리되는, 문서 선택에 따른 가이드라인 제공방법.
- 제6항에 있어서,상기 문서를 표시하는 단계;를 더 포함하되,상기 표시된 문서에 포함되는 단어 중 상기 제1 단어장에 들어있는 단어에 대하여 의미정보를 함께 표시하는 것을 특징으로 하는, 문서 선택에 따른 가이드라인 제공방법.
- 제8항에 있어서,상기 문서에 대한 검색어를 입력받는 단계; 및상기 검색어와 매칭되는 문서들을 포함하는 문서리스트를 제공하는 단계;를 더 포함하되,상기 암기할 단어개수, 상기 암기할 단어의 사용비중, 상기 아는 단어개수, 상기 아는 단어의 사용비중, 판매량, 평점순, 출시일 또는 가격에 따라 상기 문서리스트를 정렬할 수 있는, 문서 선택에 따른 가이드라인 제공방법.
- 제6항에 있어서,상기 문서를 표시하는 단계;를 더 포함하되,상기 표시된 문서에 포함되는 단어 중 미리 설정된 예외단어 리스트에 포함되는 단어에 대하여 의미정보 표시를 제외하는, 문서 선택에 따른 가이드라인 제공방법.
- 제1항 내지 제11항 중 어느 하나의 항에 기재된 문서 선택에 따른 가이드라인 제공방법을 수행하는 프로그램이 기록된 컴퓨터 판독가능한 기록 매체.
- 하드웨어인 클라이언트단말과 결합되어 제1항 내지 제11항 중 어느 하나의 항에 기재된 문서 선택에 따른 가이드라인 제공방법을 실행시키기 위하여, 매체에 저장된 단말장치용 어플리케이션.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2017526591A JP2018503853A (ja) | 2014-11-11 | 2015-11-11 | 文書選択に応じたガイドライン提供方法、これを遂行するためのプログラムが記録されたコンピュータ読出し可能な記録媒体及び媒体に格納された端末装置用アプリケーション |
Applications Claiming Priority (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR20140156365 | 2014-11-11 | ||
KR10-2014-0156365 | 2014-11-11 | ||
KR10-2015-0025568 | 2015-02-24 | ||
KR1020150025568A KR101618600B1 (ko) | 2015-02-24 | 2015-02-24 | 문서 선택에 따른 가이드라인 제공방법, 이를 수행하기 위한 프로그램이 기록된 컴퓨터 판독가능한 기록 매체 및 매체에 저장된 단말장치용 어플리케이션 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2016076622A1 true WO2016076622A1 (ko) | 2016-05-19 |
Family
ID=55954635
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/KR2015/012096 WO2016076622A1 (ko) | 2014-11-11 | 2015-11-11 | 문서 선택에 따른 가이드라인 제공방법, 이를 수행하기 위한 프로그램이 기록된 컴퓨터 판독가능한 기록 매체 및 매체에 저장된 단말장치용 어플리케이션 |
Country Status (2)
Country | Link |
---|---|
JP (1) | JP2018503853A (ko) |
WO (1) | WO2016076622A1 (ko) |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP7213509B2 (ja) * | 2019-01-18 | 2023-01-27 | 日本電信電話株式会社 | 語彙発達指標推定装置、語彙発達指標推定方法、プログラム |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR20080084146A (ko) * | 2007-03-15 | 2008-09-19 | 황보의 | 단어장 형성 방법 및 시스템 |
WO2010150986A2 (ko) * | 2009-06-24 | 2010-12-29 | Nam Chan-Hee | 외국어 단어 평생 학습 장치 및 방법 |
KR20110032470A (ko) * | 2009-09-23 | 2011-03-30 | 동국대학교 산학협력단 | 단어 게임 장치 및 방법 |
WO2012026674A2 (ko) * | 2010-08-25 | 2012-03-01 | 에스케이텔레콤 주식회사 | 학습 플랜 분석 방법, 장치 및 시스템 |
WO2013085320A1 (ko) * | 2011-12-06 | 2013-06-13 | Wee Joon Sung | 스마트 기기를 이용한 상황 인식 기반 외국어 습득 및 학습 서비스 제공 방법 |
Family Cites Families (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH1184999A (ja) * | 1997-09-03 | 1999-03-30 | N T T Data:Kk | 情報提示システム及びその構成装置、記録媒体 |
JP2002072841A (ja) * | 2000-08-29 | 2002-03-12 | Hitachi Ltd | 単語学習システム |
JP5032600B2 (ja) * | 2010-01-07 | 2012-09-26 | 株式会社東芝 | 文書可読性評価プログラムおよび文書可読性評価装置 |
KR20120118520A (ko) * | 2011-04-19 | 2012-10-29 | 두산동아 주식회사 | 난이도별 어휘 분류가 가능한 학습 단말 장치 및 방법 |
-
2015
- 2015-11-11 WO PCT/KR2015/012096 patent/WO2016076622A1/ko active Application Filing
- 2015-11-11 JP JP2017526591A patent/JP2018503853A/ja active Pending
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR20080084146A (ko) * | 2007-03-15 | 2008-09-19 | 황보의 | 단어장 형성 방법 및 시스템 |
WO2010150986A2 (ko) * | 2009-06-24 | 2010-12-29 | Nam Chan-Hee | 외국어 단어 평생 학습 장치 및 방법 |
KR20110032470A (ko) * | 2009-09-23 | 2011-03-30 | 동국대학교 산학협력단 | 단어 게임 장치 및 방법 |
WO2012026674A2 (ko) * | 2010-08-25 | 2012-03-01 | 에스케이텔레콤 주식회사 | 학습 플랜 분석 방법, 장치 및 시스템 |
WO2013085320A1 (ko) * | 2011-12-06 | 2013-06-13 | Wee Joon Sung | 스마트 기기를 이용한 상황 인식 기반 외국어 습득 및 학습 서비스 제공 방법 |
Also Published As
Publication number | Publication date |
---|---|
JP2018503853A (ja) | 2018-02-08 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Casillas et al. | A step-by-step guide to collecting and analyzing long-format speech environment (LFSE) recordings | |
CN105229693B (zh) | 教育中心 | |
AU2018279013B2 (en) | Method and system for extraction of relevant sections from plurality of documents | |
US20110219299A1 (en) | Method and system of providing completion suggestion to a partial linguistic element | |
Ye et al. | A crowdsourcing framework for medical data sets | |
CN112052396A (zh) | 课程匹配方法、系统、计算机设备和存储介质 | |
CN110941702A (zh) | 一种法律法规和法条的检索方法及装置、可读存储介质 | |
WO2020111827A1 (ko) | 프로필 자동생성서버 및 방법 | |
CN101512518A (zh) | 自然语言处理系统和词典登录系统 | |
Schmidtke et al. | Mass counts in World Englishes: A corpus linguistic study of noun countability in non-native varieties of English | |
Nkhoma et al. | Learning analytics techniques and visualisation with textual data for determining causes of academic failure | |
Krishna Rohit et al. | Analysis of speeches in Indian parliamentary debates | |
CN112418875A (zh) | 跨平台税务智能客服语料迁移方法及装置 | |
WO2016076622A1 (ko) | 문서 선택에 따른 가이드라인 제공방법, 이를 수행하기 위한 프로그램이 기록된 컴퓨터 판독가능한 기록 매체 및 매체에 저장된 단말장치용 어플리케이션 | |
WO2017179778A1 (ko) | 빅데이터를 이용한 검색 방법 및 장치 | |
Gañan | Plagiarism detection | |
Enăchescu | Screening the Candidates in IT Field Based on Semantic Web Technologies: Automatic Extraction of Technical Competencies from Unstructured Resumes. | |
KR101607128B1 (ko) | 오답과 관련된 연관문제 제공방법 | |
JP2017027168A (ja) | 嗜好学習方法、嗜好学習プログラム、及び嗜好学習装置 | |
CN109710751A (zh) | 法律文件的智能推荐方法、装置、设备及存储介质 | |
KR101783012B1 (ko) | 문서 선택에 따른 가이드라인 제공방법, 이를 수행하기 위한 프로그램이 기록된 컴퓨터 판독가능한 기록 매체 및 매체에 저장된 단말장치용 어플리케이션 | |
JP2021022107A (ja) | 情報提供システムおよびサーバ | |
KR101618600B1 (ko) | 문서 선택에 따른 가이드라인 제공방법, 이를 수행하기 위한 프로그램이 기록된 컴퓨터 판독가능한 기록 매체 및 매체에 저장된 단말장치용 어플리케이션 | |
CN117493210B (zh) | 微服务工具评价方法及系统 | |
CN114565928B (zh) | 文本识别方法、装置、设备及存储介质 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 15859490 Country of ref document: EP Kind code of ref document: A1 |
|
ENP | Entry into the national phase |
Ref document number: 2017526591 Country of ref document: JP Kind code of ref document: A |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
32PN | Ep: public notification in the ep bulletin as address of the adressee cannot be established |
Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 18.08.2017) |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 15859490 Country of ref document: EP Kind code of ref document: A1 |