CN109086285A - Chinese intelligent processing method and system and device based on morpheme - Google Patents

Chinese intelligent processing method and system and device based on morpheme Download PDF

Info

Publication number
CN109086285A
CN109086285A CN201710857227.1A CN201710857227A CN109086285A CN 109086285 A CN109086285 A CN 109086285A CN 201710857227 A CN201710857227 A CN 201710857227A CN 109086285 A CN109086285 A CN 109086285A
Authority
CN
China
Prior art keywords
poem
morpheme
data
word
chinese
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201710857227.1A
Other languages
Chinese (zh)
Other versions
CN109086285B (en
Inventor
夏铨真
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Foshan Huiyuan Mdt Infotech Ltd
Original Assignee
Foshan Huiyuan Mdt Infotech Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Foshan Huiyuan Mdt Infotech Ltd filed Critical Foshan Huiyuan Mdt Infotech Ltd
Publication of CN109086285A publication Critical patent/CN109086285A/en
Application granted granted Critical
Publication of CN109086285B publication Critical patent/CN109086285B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/40Processing or translation of natural language
    • G06F40/58Use of machine translation, e.g. for multi-lingual retrieval, for server-side translation for client devices or for real-time translation

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Machine Translation (AREA)
  • Document Processing Apparatus (AREA)

Abstract

The Chinese intelligent processing method and system and device that the present invention provides a kind of based on morpheme.Its method includes the following steps: to collect poem data using morpheme as word-building unit;Establish the field of poem database, and production Methods type poem Database field;It is added to the poem data being collected into each field of relationship type poem database, and establishes the data link tree between poem data inside and between poem data and generate the relationship type morpheme database with poem data.It has to the search function of poem, allow one to easily and fast, accurately handle Chinese poem.

Description

Chinese intelligent processing method and system and device based on morpheme
Technical field
The present invention relates to computer data processing technology fields, in particular to a kind of in a computer to Chinese, especially Classic poetry etc., such as " Tang poetry ", " such poems of the Song Dynasty ", " Three Character Primer ", " Records of the Historian ", " Book of Songs " progress intelligent processing method and system and dress It sets.
Background technique
China's ancient civilization at least 5,000 years, it is gradually outer as the reform and opening-up of China's Mainland and development are powerful Compatriots are understood, ancient civilization especially therein, enable many foreigners, especially foreign researcher is fascinated, therein Classic poetry can sufficiently be described the artistic conception of people, sighed with feeling by be allowed people with the short limited word of several rows.
Such as: " the even book of returning to one's home village " of the Tang Dynasty poet He Zhizhang " leaves home a mere child and come back an old man, local accent does not change temples hair and declines;Children's phase See and be not well acquainted with each other, laughs at and ask that visitor comes from where." this first poem is long objective strange land, the reflections poem for cherishing the memory of hometown.Poet places oneself in the midst of native place and is familiar with And among strange environment, winding row comes all the way, and mood is quite uncalm;Current year leaves home, at life's full flowering;Today is returned, temples hair It is scattered, it can't help sighing with deep feeling.
However, especially the foreigner and mind of children can not since the learning difficulty of Chinese is too high in the minds of people Learn well, the understanding of many people let alone the poem to this ancient Chinese essence also just can not comprehensively appreciate Gu Poem such as retrieves certain poem from some words in someone poem or poem of China, and to poem full text and The understanding of pronunciation, various foreign language translations, author etc. can not make due contribution to world culture.
Summary of the invention
The present invention is to overcome defect in the prior art and provide a kind of Chinese intelligent processing method based on morpheme and be System and device have the very big search function to poem, particularly classic poetry, make to solve deficiency in the prior art Chinese poem can easily and fast, accurately be handled by obtaining people, the especially foreigner, and potential help is promoted in Chinese in state Inside and outside usage amount, the carry forward Chinese culture in world civilization.
A kind of Chinese intelligent processing method based on morpheme provided for achieving the object of the present invention, comprising the following steps:
Using morpheme as word-building unit, poem data are collected;
Establish the field of poem database, and production Methods type poem Database field;
It is added to the poem data being collected into each field of relationship type poem database, and described in foundation Data link tree between poem data inside and between poem data generates the relationship type morpheme data with poem data Library.
More preferably, the Chinese intelligent processing method, further includes following steps:
Using morpheme as the word-building unit of word and phrase, retrieval poem data are carried out using the poem database.
More preferably, the Chinese intelligent processing method, further includes following steps:
One of original text full text, translation, pronunciation, author, history or more are obtained according to the poem data link tree The combination of kind.
More preferably, the Chinese intelligent processing method, further includes following steps:
If do not retrieved required poem, then directly return;Or other poems that will be retrieved, as new poem Data are added in relationship type poem database, and carry out relational data link tree, then return and exit.
More preferably, the Chinese intelligent processing method, the addition data simultaneously carry out relational links to data, including such as Lower step:
The poem data that will be collected into are added in each field of the relationship type poem database;
It establishes using morpheme as root, establishes individual character morpheme and word morpheme is skill, poem data are the data link tree of leaf;
Between each poem data of data link tree, corresponding link is established.
More preferably, the Chinese intelligent processing method, the morpheme is minimum linguistic unit, smaller than word, same Word corresponds to multiple morphemes;
Most significant difference between morpheme and word is that morpheme is expressed the meaning, neutral, is shown with a variety of different fonts, so its code Referred to as " neutral code ".
The present invention also provides a kind of Chinese intelligent processing system based on morpheme, including the above-mentioned Chinese intelligence based on morpheme The computer system software module of processing method.
More preferably, the Chinese intelligent processing system, including collection module, field establish module, relational links module, Wherein:
The collection module, for collecting poem data, especially classic poetry data using morpheme as word-building unit;
The field establishes module, for establishing the field of poem database, and each word of production Methods type poem database Section;
The relational links module, for being added to the relationship type poem data for the poem data being collected into In each field in library, and establish the data link tree between poem data inside and between poem data.
The present invention also provides a kind of storage medium, the computer software journey including the Chinese intelligent processing system based on morpheme The storage medium of sequence.
The present invention also provides a kind of hardware, including CPU, and are electrically connected to the storage medium;
The CPU calls the computer software programs of the Chinese intelligent processing system to execute from the storage medium.
The present invention is based on the Chinese intelligent processing methods and system and device of morpheme to have the advantages that
It has the function of the very big computer disposal to poem, particularly classic poetry, so that people, the especially foreigner Can easily and fast, accurately handle Chinese, by retrieval obtain part or the full text of poem after, intelligent interlinking obtains portion Point or full text translation and pronunciation, author etc., more potential help promotes Chinese external usage amount at home, in developing Chineseization contributes.
Detailed description of the invention
It, below will be to specific in order to illustrate more clearly of the specific embodiment of the invention or technical solution in the prior art Embodiment or attached drawing needed to be used in the description of the prior art be briefly described, it should be apparent that, it is described below Attached drawing is some embodiments of the present invention, for those of ordinary skill in the art, before not making the creative labor It puts, is also possible to obtain other drawings based on these drawings.
Fig. 1 is Chinese intelligent retrieval processing method flow chart of the embodiment of the present invention based on morpheme;
Fig. 2 is a kind of embodiment flow chart of step S300 in Fig. 1 of the embodiment of the present invention;
Fig. 3 is Chinese processing apparatus structure schematic diagram of the embodiment of the present invention based on morpheme.
Specific embodiment
As shown in Figure 1-3, being illustrated to make the objectives, technical solutions, and advantages of the present invention clearer.In conjunction with specific Embodiment, the present invention is described in detail.During this, descriptions of well-known structures and technologies are omitted, with to avoid To unnecessarily obscuring idea of the invention.For these descriptions, only it is exemplary.It is not to limit the scope of the invention.
A kind of Chinese poem intelligent processing method based on morpheme of the embodiment of the present invention, as shown in Figure 1, including following step It is rapid:
Step S100 collects poem data, especially classic poetry data using morpheme as word-building unit.
Morpheme is the smallest linguistic unit, smaller than word, and the same word can correspond to multiple morphemes.Citing: " biography " word pair Answer two morphemes (English send, biography;Reception and registration or biography);Corresponding two morphemes of " going through " word (English history, calendar;History or calendar);" day " word corresponds to three morphemes (English sun, day, japanese;The sun, date, Japan). The characteristics of morpheme is that its accuracy is strong, only one pronunciation and a meaning (unicity).
Text is used to keep record, and article is made of sentence, and sentence is made of word and phrase, and word and phrase are made of word.Chinese character and west Fang Yuyan is different, it is provided simultaneously with shape, sound, adopted three attributes, and a shape similar word can have multiple meanings and pronunciation.Due to Chinese character Ambiguity hamper the automatic processing of information, influence big data analysis, become difficult retrieval, further, translation is patrolled Collect complicated fallibility.For the weakness of above-mentioned Chinese character, the embodiment of the present invention is proposed to define for word and phrase with morpheme.
The difference of word, morpheme, word is: 1. word is that 2. morpheme is that 3. word is record word and language to structure lexeme for the unit of sentence-making The grapheme of element.The above two belong to linguistic notation system, there is meaning attribute;The latter belongs to mark system, mainly word Shape attribute, adopted attribute are fuzzy.Most significant difference between morpheme and word is that morpheme is expressed the meaning, neutral, can use a variety of different fonts It has been shown that, so its code can be referred to as " neutral code ";Word table shape, shape similar word can have multiple meanings.For a long time, from ancient times to Today, people are always with " word " for structure lexeme, all information systems, including search engine, are entirely letter with " word " Cease the basic unit of processing.
The creation of breakthrough invention of the embodiment of the present invention is to give up this unbreakable conventional method, with " morpheme " for word-building Unit, this come the retrieval of information, analysis, statistics, big data processing, artificial intelligence application ... will obtain greatly Improve.It is that other family of languageies (including English, French) can not be accomplished using morpheme as the information processing of nuclear structure, such as 1 institute of table Show.
Using morpheme as the core of Chinese further so that the conversion between simplified and traditional word is not necessary to by contextual analysis (context analysis) and carry out retrieval process by the instruction (morpheme table registration simplified and traditional font font) of morpheme table, without Identify that it is simplified or the complex form of Chinese characters, it is 100% that retrieval rate can reach substantially.
Table 1:
Morpheme, particularly single syllable morpheme are the units for forming word or phrase, it should be able to be word and phrase very accurately It watch sound and expresses the meaning.It is that morpheme classifies morpheme being included into eight major class from group word angle: 1. Chinese language class morpheme 2. surname class morpheme 3. people Name class morpheme 4. place name class morpheme 5. science and technology morpheme 6. archaic Chinese morpheme 7. nonsense watch sound morpheme 8. table shape morpheme.Two classes exist afterwards Do not recognize in the prior art rather than true morpheme, but in the embodiment of the present invention, for the accurate retrieval of information and big data analysis Need also to encode for them, referred to as " false morpheme " (French=assimil é is equivalent to a kind of morpheme and goes to handle).
As an embodiment, word (especially alien word) watch sound for much forming word or phrase, does not express the meaning, Such as: " horse " and " reaching " word in " motor " this word;" snow ", " iron " and " dragon " word in " Citreen " this word.These are used for Translate external product, trade mark, name and place name Chinese character be only be used to watch sound, horse, reach, avenge, iron, these words of dragon ... and its Literal sense has no bearing on.
Poem database in the embodiment of the present invention utilizes " nonsense watch sound morpheme " table to collect whole watch sound Chinese characters, is every One watch sound word coding, dramatically improves the accuracy of information retrieval and analysis.
The classic poetry data include but is not limited to " Tang poetry ", " such poems of the Song Dynasty ", " Book of Songs ", " Records of the Historian ", " origin of Chinese character ", " three Word warp ", " 42-volume Chinese dictionary compiled during the regin of Kang Xi in the Qing Dynasty " etc..
Step S200 establishes the field of poem database, and production Methods type poem database.
As an embodiment, described to establish poem database, it is to utilize the including but not limited to inscriptions on bones or tortoise shells (Oracle), SQL (Structured Query Language), the database that the relational databases such as Sybase, ACCESS are established File.
In the embodiment of the present invention, establish poem database, particularly classic poetry database, the poem database include but It is not limited to poem original text field, poem translation field etc..
In the database, the poem database with poem original text field and/or poem translation field is established, so that people , particularly the foreigner during learning poem, the original text of poem can be retrieved and translate and understand the full text of poem Or part;
More preferably, the poem database can also include original text Chinese pronunciation field, foreign language pronunciation field, in this way, After retrieving corresponding poem, the China and foreign countries' pronunciation for even reciting poem can be learnt;
More preferably, the poem database can also include row field, foreign language row field;Author field, author translate word Section, in this way, poem fan and foreign researcher is made to have the interest further learnt to Chinese Poetry.
The poem data being collected into are added in each field of relationship type poem database by step S300, and The data link tree between poem data inside and between poem data is established, the relationship with poem data is generated Type morpheme database.
The data link tree inside poem data is established by the relationship between poem data.
As shown in Fig. 2, the step S300 includes the following steps:
Step S310, the poem data that will be collected into, especially classic poetry data are added to the relationship type poem data In each field in library;
In the embodiment of the present invention, with morpheme come the poem data being collected into for word and phrase with word-building unit, especially Gu Poem data increase in poem database as the data one by one in poem database, and foundation can be retrieved according to morpheme and be closed It is type data.
Step S320 is established using morpheme as root, establishes individual character morpheme and word morpheme is skill, poem data are the data of leaf Link tree;
Monosyllable in Chinese character is no more than 1400, and morpheme is big more than this number, because a syllable will represent Perhaps multiple and different meanings.Such as xin this syllable, so that it may indicate " pungent (arduous), new (new person), the heart (heart), zinc (zinc Mine), firewood (salary), core (wick), fragrant (fragrance), glad (joyful) " etc. several morphemes.
As an embodiment, such as unit of syllable li, the root of morpheme is traversed, finds corresponding morpheme, The individual character morpheme for the "Off" left such as is found, then input " 7 " to indicate that word morpheme is 7 words, then the word morpheme of 7 words " leaving home a mere child and come back an old man " is then searched as the leaf on data link tree, to obtain the row poem.
The above-mentioned root that morpheme is traversed as unit of syllable, a kind of only traversal method of the embodiment of the present invention, and it is of the invention Embodiment can also write input by electronics word, stroke input traverses the root of morpheme.
Step S330 establishes corresponding link between each poem data of data link tree.
Between associated each poem data, associated data link is established, for example, author is " li po " Poem can establish the link, can illustrate li po writes out the poem how much being handed down in history in this way, so as to adjudicate substantially The status of poet;Associated data link etc. can also be established with the time of poem, such as Tang Dynasty.
After finding the word morpheme of the row poem, it can find full text, translation, the work of poem by relational database The content of person, history etc. and other links.
Step S400 carries out retrieval poem number using the poem database using morpheme as the word-building unit of word and phrase According to obtaining the knowledge of the various aspects such as original text full text, translation, pronunciation, author, history according to poem data link tree.
As an embodiment, the poem database is retrieved by retrieving morpheme, the retrieval morpheme point For individual character morpheme and word morpheme.
Word morpheme refers to word there are two at least tools, including one or more individual character morpheme, while the individual character morpheme It combines and constitutes significant fixation meaning unit.
The existing input method of Chinese character of individual character morpheme, such as hand-writing input method, spelling input method, phonitic entry method, five it is defeated Entering method input can be obtained by;
The word morpheme, the Chinese idiom including but not limited in Modern Chinese, the verse in various ancient poetries, various special names Word, famous name etc.;
Such as: " People's Republic of China (PRC) ", " execution ", " leaving home a mere child and come back an old man ", " penniless ", " li po " etc., It is all word morpheme.
For example, when retrieval " leaves home a mere child and come back an old man " this poem, after can first inputting the individual character morpheme "Off", then Word number " 7 " are inputted again, indicate that its word morpheme is poem with seven characters to a line, to quickly and easily retrieve the classic poetry.
Further include following steps as a kind of Chinese intelligent search method based on morpheme of more preferably embodiment:
Step S500 does not such as retrieve required poem, then directly returns;Or return step S300, it will retrieve Other poems be added in relationship type poem database as new poem data, and carry out relational data link tree, Then it returns and exits.
As a kind of more preferably embodiment, if not return step S300
Of the invention Chinese intelligent processing method and system and device based on morpheme, have easily and fast, it is accurately right The processing function of Chinese poem.Further, so that people, the especially foreigner can accurately translate Chinese poem, pass through Retrieval obtains the part of poem, and perhaps link obtains translation and pronunciation, author of part or full text etc. after full text, comprehensively, Correctly understand, read and write and even recite Chinese poem.Further, intelligently Chinese poem can be handled, is had latent Power helps to be promoted ancient poetry in Chinese external usage amount at home, contributes for carry forward Chinese culture.
Correspondingly, as shown in figure 3, the embodiment of the present invention also provides a kind of Chinese intelligent processing system based on morpheme, packet Include the system software module of the above-mentioned Chinese intelligent processing method based on morpheme.
As an embodiment, the Chinese intelligent processing system based on morpheme, including collection module 10, word Duan Jianli module 20, relational links module 30, retrieval module 40, in which:
The collection module 10, for collecting poem data, especially classic poetry data using morpheme as word-building unit;
The field establishes module 20, and for establishing the field of poem database, and production Methods type poem database is each Field;
The relational links module 30, for being added to the relationship type poem number for the poem data being collected into According in each field in library, and establish the data link tree between poem data inside and between poem data;
The retrieval module 40, for being carried out using the poem database using morpheme as the word-building unit of word and phrase Poem data are retrieved, knowing for the various aspects such as original text full text, translation, pronunciation, author, history is obtained according to poem data link tree Know.
It further include data addition as a kind of more preferably embodiment, the Chinese intelligent processing system based on morpheme Module 50 then returns to the relational links module 40 for not retrieving required poem such as, generates the head poem as new Poem data, be added in relationship type poem database, and establish between poem data inside and poem data it Between data link tree, then return exit.
As a kind of more more preferably embodiment, the relational links module 40, including addition submodule 41, tree establish son Module 42 and link submodule 43, in which:
The addition submodule 41, the poem data for will be collected into, especially classic poetry data are added to the pass It is in each field of type poem database;
The tree setting up submodule 42 establishes individual character morpheme and word morpheme is skill, poem for establishing using morpheme as root Data are the data link tree of leaf;
The link submodule 43, between each poem data of data link tree, establishing corresponding link.
Chinese intelligent processing system based on the embodiment of the present invention, as shown in figure 3, the embodiment of the present invention also provides a kind of base In the storage medium of the computer software programs of the Chinese intelligent processing system of morpheme, independence is in plate, mobile phone, desktop computer Etc. storing on various hardware, it is electrically connected to CPU (Central Processing Unit, central processing unit) and from the storage The software for calculation program of the Chinese intelligent processing system is called to execute on medium.
The step of method described in conjunction with the examples disclosed in this document or algorithm, can be executed with hardware, processor The combination of software module or the two is implemented.Storage medium can be random access memory (RAM), memory, read-only memory (ROM), electrically programmable ROM, electrically erasable ROM, register, hard disk, moveable magnetic disc, CD-ROM or technical field In any other form of memory well known to interior.
The Chinese intelligent processing system and device based on morpheme of the embodiment of the present invention, the course of work with based on morpheme Chinese intelligent processing method is essentially identical, and obtains essentially identical beneficial effect, therefore, in embodiments of the present invention, no longer It is described in detail one by one.
Those of ordinary skill in the art should further appreciate that, describe in conjunction with the embodiments described herein Each exemplary unit and algorithm steps, can be realized with electronic hardware, computer software, or a combination of the two, in order to clear Illustrate to Chu the interchangeability of hardware and software, generally describes each exemplary group according to function in the above description At and step.These functions are implemented in hardware or software actually, the specific application and design depending on technical solution Constraint condition.Those of ordinary skill in the art can realize described function using distinct methods to each specific application Can, but such implementation should not be considered as beyond the scope of the present invention.
Above-described specific embodiment, to the purpose of the present invention, technical scheme and beneficial effects into track into one Step is described in detail, it should be understood that being not used to limit this hair the foregoing is merely a specific embodiment of the invention Bright protection scope, all within the spirits and principles of the present invention, any modification, equivalent substitution, improvement and etc. done should all wrap Containing within protection scope of the present invention.

Claims (18)

1. a kind of Chinese intelligent processing method based on morpheme, which comprises the following steps:
Using morpheme as word-building unit, poem data are collected;
Establish the field of poem database, and production Methods type poem Database field;
It is added to the poem data being collected into each field of relationship type poem database, and establishes the poem Data link tree between data inside and between poem data generates the relationship type morpheme database with poem data.
2. Chinese intelligent processing method according to claim 1, which is characterized in that further include following steps:
Using morpheme as the word-building unit of word and phrase, retrieval poem data are carried out using the poem database.
3. Chinese intelligent processing method according to claim 2, which is characterized in that further include following steps:
One or more of original text full text, translation, pronunciation, author, history are obtained according to the poem data link tree Combination.
4. Chinese intelligent processing method according to claim 2 or 3, which is characterized in that further include following steps:
If do not retrieved required poem, then directly return;Or other poems that will be retrieved are returned, as new poem Data are added in relationship type poem database, and carry out relational data link tree, then return and exit.
5. Chinese intelligent processing method according to claim 1, which is characterized in that the addition data simultaneously carry out data Relational links include the following steps:
The poem data that will be collected into are added in each field of the relationship type poem database;
It establishes using morpheme as root, establishes individual character morpheme and word morpheme is skill, poem data are the data link tree of leaf;
Between each poem data of data link tree, corresponding link is established.
6. Chinese intelligent processing method according to claim 5, which is characterized in that the morpheme is minimum linguistic unit, Smaller than word, the same word corresponds to multiple morphemes;
Most significant difference between morpheme and word is that morpheme is expressed the meaning, neutral, is shown with a variety of different fonts, so its code is referred to as For " neutral code ".
7. Chinese intelligent processing method according to claim 6, which is characterized in that 1. the morpheme is divided into from a group word angle Chinese language class morpheme 2. surname class morpheme 3. name class morpheme 4. place name class morpheme 5. science and technology morpheme 6. archaic Chinese morpheme 7. nonsense Watch sound morpheme 8. table shape morpheme.
8. Chinese intelligent processing method according to claim 6, which is characterized in that the poem data are classic poetry number According to;
The classic poetry data are " Tang poetry ", " such poems of the Song Dynasty ", " Book of Songs ", " Records of the Historian ", " origin of Chinese character ", " Three Character Primer ", " Kangxu's word Allusion quotation " in one or more than one kinds of combination.
9. Chinese intelligent processing method according to claim 6, which is characterized in that the poem database includes poem original Text section and poem translate field.
10. Chinese intelligent processing method according to claim 9, which is characterized in that the poem database further includes original Literary Chinese pronunciation field, foreign language pronunciation field, row field, foreign language row field;Author field, author translate one of field or More than one combination of person.
11. Chinese intelligent processing method according to claim 10, which is characterized in that the poem database passes through retrieval Morpheme is retrieved;
The retrieval morpheme is divided into individual character morpheme and word morpheme;
The word morpheme refers to word there are two at least tools, including one or more individual character morpheme, while the individual character morpheme It combines and constitutes significant fixation meaning unit.
12. a kind of Chinese intelligent processing system based on morpheme, which is characterized in that including described in any one of claim 1 to 11 The Chinese intelligent processing method based on morpheme computer system software module.
13. Chinese intelligent processing system according to claim 12, which is characterized in that including collection module, field is established Module, relational links module, in which:
The collection module, for collecting poem data, especially classic poetry data using morpheme as word-building unit;
The field establishes module, for establishing the field of poem database, and each field of production Methods type poem database;
The relational links module, for it is each to be added to the relationship type poem database by the poem data being collected into In field, and establish the data link tree between poem data inside and between poem data.
14. Chinese intelligent processing system according to claim 13, which is characterized in that further include retrieval module, for Morpheme is the word-building unit of word and phrase, retrieval poem data is carried out using the poem database, according to poem data link Tree obtains one of original text full text, translation, pronunciation, author, history or more than one combination.
15. Chinese intelligent processing system described in 3 or 14 according to claim 1, which is characterized in that further include data addition mould Block is then directly returned for not retrieving required poem such as;Or other poems that will be retrieved, as new poem Data are added in relationship type poem database, and carry out relational data link tree, then return and exit.
16. Chinese intelligent processing system described in 3 or 14 according to claim 1, which is characterized in that the relational links module, Including adding submodule, tree setting up submodule and link submodule, in which:
The addition submodule, the poem data for will be collected into, especially classic poetry data are added to the relationship type poem In each field of word database;
The tree setting up submodule, for establishing using morpheme as root, establishing individual character morpheme and word morpheme is skill, and poem data are The data link tree of leaf;
The link submodule, between each poem data of data link tree, establishing corresponding link.
17. a kind of storage medium, which is characterized in that including the described in any item Chinese intelligence based on morpheme of claim 12 to 16 The storage medium of the computer software programs of energy processing system.
18. a kind of hardware, including CPU, which is characterized in that further include being electrically connected to storage medium described in claim 17;
The CPU calls the computer software programs of the Chinese intelligent processing system to execute from the storage medium.
CN201710857227.1A 2017-06-14 2017-09-21 Intelligent Chinese processing method, system and device based on morphemes Active CN109086285B (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201710446493 2017-06-14
CN2017104464935 2017-06-14

Publications (2)

Publication Number Publication Date
CN109086285A true CN109086285A (en) 2018-12-25
CN109086285B CN109086285B (en) 2021-10-15

Family

ID=64839127

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710857227.1A Active CN109086285B (en) 2017-06-14 2017-09-21 Intelligent Chinese processing method, system and device based on morphemes

Country Status (1)

Country Link
CN (1) CN109086285B (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109871442A (en) * 2019-01-18 2019-06-11 程家惠 A kind of Sino-British rendering method, device, equipment and the medium of Chinese character calligraphy text
CN109948157A (en) * 2019-03-13 2019-06-28 日照职业技术学院 A kind of poem is collected and data analysing method
CN112434137A (en) * 2020-12-11 2021-03-02 乐山师范学院 Poetry retrieval method and system based on artificial intelligence

Citations (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1777888A (en) * 2003-04-24 2006-05-24 禹蕣朝 Method for sentence structure analysis based on mobile configuration concept and method for natural language search using of it
US7092870B1 (en) * 2000-09-15 2006-08-15 International Business Machines Corporation System and method for managing a textual archive using semantic units
US7225181B2 (en) * 2000-02-04 2007-05-29 Fujitsu Limited Document searching apparatus, method thereof, and record medium thereof
US20070213974A1 (en) * 2006-03-09 2007-09-13 Fujitsu Limited Syntax analysis program, syntax analysis method, syntax analysis device, and computer-readable medium storing syntax analysis program
CN101059915A (en) * 2006-10-18 2007-10-24 杨红春 A system for foreigner learning the common Chinese
CN101075252A (en) * 2007-06-21 2007-11-21 腾讯科技(深圳)有限公司 Method and system for searching network
CN101599078A (en) * 2009-07-10 2009-12-09 腾讯科技(深圳)有限公司 A kind of method of text retrieval and device
CN102184170A (en) * 2011-06-17 2011-09-14 成都成电医星数字健康软件有限公司 Morpheme-level analyzing method for clinical Chinese language
CN102375838A (en) * 2010-08-17 2012-03-14 富士通株式会社 Method and device for constructing polarity morpheme database, and method and device for determining polarity of words
CN102567423A (en) * 2010-12-31 2012-07-11 成都致远诺亚舟教育科技有限公司 Method and system for associated search of poetry
CN103605665A (en) * 2013-10-24 2014-02-26 杭州电子科技大学 Keyword based evaluation expert intelligent search and recommendation method
CN105574067A (en) * 2014-10-31 2016-05-11 株式会社东芝 Item recommendation device and item recommendation method
CN106372039A (en) * 2016-08-18 2017-02-01 王欣 Standard Chinese information ASCII system codes

Patent Citations (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7225181B2 (en) * 2000-02-04 2007-05-29 Fujitsu Limited Document searching apparatus, method thereof, and record medium thereof
US7092870B1 (en) * 2000-09-15 2006-08-15 International Business Machines Corporation System and method for managing a textual archive using semantic units
CN1777888A (en) * 2003-04-24 2006-05-24 禹蕣朝 Method for sentence structure analysis based on mobile configuration concept and method for natural language search using of it
US20070213974A1 (en) * 2006-03-09 2007-09-13 Fujitsu Limited Syntax analysis program, syntax analysis method, syntax analysis device, and computer-readable medium storing syntax analysis program
CN101059915A (en) * 2006-10-18 2007-10-24 杨红春 A system for foreigner learning the common Chinese
CN101075252A (en) * 2007-06-21 2007-11-21 腾讯科技(深圳)有限公司 Method and system for searching network
CN101599078A (en) * 2009-07-10 2009-12-09 腾讯科技(深圳)有限公司 A kind of method of text retrieval and device
CN102375838A (en) * 2010-08-17 2012-03-14 富士通株式会社 Method and device for constructing polarity morpheme database, and method and device for determining polarity of words
CN102567423A (en) * 2010-12-31 2012-07-11 成都致远诺亚舟教育科技有限公司 Method and system for associated search of poetry
CN102184170A (en) * 2011-06-17 2011-09-14 成都成电医星数字健康软件有限公司 Morpheme-level analyzing method for clinical Chinese language
CN103605665A (en) * 2013-10-24 2014-02-26 杭州电子科技大学 Keyword based evaluation expert intelligent search and recommendation method
CN105574067A (en) * 2014-10-31 2016-05-11 株式会社东芝 Item recommendation device and item recommendation method
CN106372039A (en) * 2016-08-18 2017-02-01 王欣 Standard Chinese information ASCII system codes

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
Y. ZIEMAN 等: "Semantic labeling - unveiling the main components of meaning of free-text", 《 PROCEEDINGS EIGHTH SYMPOSIUM ON STRING PROCESSING AND INFORMATION RETRIEVAL》 *
于华: "对外汉语智能教学系统分析与设计研究", 《中国优秀硕士学位论文全文数据库 哲学与人文科学辑》 *
邢红兵: "基于《汉语水平词汇等级大纲》的语素数据库建设", 《数字化对外汉语教学理论与方法研究》 *

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109871442A (en) * 2019-01-18 2019-06-11 程家惠 A kind of Sino-British rendering method, device, equipment and the medium of Chinese character calligraphy text
CN109948157A (en) * 2019-03-13 2019-06-28 日照职业技术学院 A kind of poem is collected and data analysing method
CN112434137A (en) * 2020-12-11 2021-03-02 乐山师范学院 Poetry retrieval method and system based on artificial intelligence
CN112434137B (en) * 2020-12-11 2023-04-11 乐山师范学院 Poetry retrieval method and system based on artificial intelligence

Also Published As

Publication number Publication date
CN109086285B (en) 2021-10-15

Similar Documents

Publication Publication Date Title
Bowern et al. The Routledge handbook of historical linguistics
CN106066866A (en) A kind of automatic abstracting method of english literature key phrase and system
CN108984661A (en) Entity alignment schemes and device in a kind of knowledge mapping
CN105740236A (en) Writing feature and sequence feature combined Chinese sentiment new word recognition method and system
CN109086285A (en) Chinese intelligent processing method and system and device based on morpheme
WO2017193472A1 (en) Method of establishing digital dongba ancient text interpretive library
Hellwig Using Recurrent Neural Networks for joint compound splitting and Sandhi resolution in Sanskrit
CN110096713A (en) A kind of Laotian organization names recognition methods based on SVM-BiLSTM-CRF
Hämäläinen et al. Finding Sami cognates with a character-based NMT approach
Dundes On computers and folk tales
Yona et al. A finite-state morphological grammar of Hebrew
CN109344390A (en) A method of the card language Entity recognition based on multiple features neural network
Bizzoni et al. Some steps towards the generation of diachronic WordNets
Falahati Qadimi Fumani et al. Inconsistent transliteration of Iranian university names: a hazard to Iran’s ranking in ISI Web of Science
CN107818078B (en) Semantic association and matching method for Chinese natural language dialogue
CN106021225A (en) Chinese maximal noun phrase (MNP) identification method based on Chinese simple noun phrases (SNPs)
Yona et al. A finite-state morphological grammar of Hebrew
Kilic et al. Named entity recognition on morphologically rich language: Exploring the performance of bert with varying training levels
Ali et al. Word embedding based new corpus for low-resourced language: Sindhi
Cheng et al. The revised wordframe model for the Filipino language
CN110516069A (en) A kind of quotation Metadata Extraction method based on FastText-CRF
Tulqin o‘g’li et al. FORMATION AND DEVELOPMENT OF LEXICAL UNITS OF ORIENTAL MONUMENTS IN THE UZBEK, RUSSIAN AND ENGLISH LANGUAGES
Zhang et al. Named Entity Recognition of liver cancer data based on Damped Pointer Network and Dynamic Fusion
CN112989068B (en) Knowledge graph construction method for Tang poetry knowledge and Tang poetry knowledge question-answering system
Olivo et al. CRFPOST: Part-of-Speech Tagger for Filipino Texts using Conditional Random Fields

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant