CN101382931A - Interchange internal code for electronic, information and communication system and use thereof - Google Patents

Interchange internal code for electronic, information and communication system and use thereof Download PDF

Info

Publication number
CN101382931A
CN101382931A CNA2008102184555A CN200810218455A CN101382931A CN 101382931 A CN101382931 A CN 101382931A CN A2008102184555 A CNA2008102184555 A CN A2008102184555A CN 200810218455 A CN200810218455 A CN 200810218455A CN 101382931 A CN101382931 A CN 101382931A
Authority
CN
China
Prior art keywords
stroke
character
characters
chinese
radicals
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CNA2008102184555A
Other languages
Chinese (zh)
Inventor
劳英杰
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to CNA2008102184555A priority Critical patent/CN101382931A/en
Publication of CN101382931A publication Critical patent/CN101382931A/en
Priority to PCT/CN2009/001153 priority patent/WO2010043117A1/en
Pending legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H03ELECTRONIC CIRCUITRY
    • H03MCODING; DECODING; CODE CONVERSION IN GENERAL
    • H03M7/00Conversion of a code where information is represented by a given sequence or number of digits to a code where the same, similar or subset of information is represented by a different sequence or number of digits
    • H03M7/30Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction
    • H03M7/40Conversion to or from variable length codes, e.g. Shannon-Fano code, Huffman code, Morse code
    • H03M7/42Conversion to or from variable length codes, e.g. Shannon-Fano code, Huffman code, Morse code using table look-up for the coding or decoding process, e.g. using read-only memory
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/12Use of codes for handling textual entities
    • G06F40/126Character encoding
    • G06F40/129Handling non-Latin characters, e.g. kana-to-kanji conversion

Abstract

The invention discloses a switching inner code used for an electronic system, an information system and a communication system, which comprises a standard character repertoire coded and established with 3 bits as one bit set and 3 or more bit sets for characters, indexing components, strokes, letters or images based on a fixed bit length, wherein, the character, indexing component and stroke of a Chinese character are coded according to the indexing component attribute coding rule, and a Chinese keyword mapping table constituted of Chinese characters of the standard character repertoire and other characters according to semantic matching relationship. The switching inner code used for the electronic system, the information system and the communication system can greatly improve the arithmetic speed and arithmetic precision of a computer, reach the most appropriate coding expansion requirement conveniently, save coding space to the utmost extent and increase the arithmetic speed of a compiler simultaneously.

Description

A kind of interchange internal code and application thereof that is used for electronics, information and communication system
Technical field
The present invention relates to a kind of interchange internal code, the interchange internal code that is used for electronics, information and communication system of particularly a kind of character, radicals by which characters are arranged in traditional Chinese dictionaries, stroke, letter, symbol or figure Unified coding to any literal.
Background technology
The standard of modern computer systems exchange ISN (American Standards Code for InformationInterchange) is called for short ASCII, and beginning is that 7 bits are represented 1 byte, is 2 7Close expression some operating keys, the capital and small letter Latin alphabet and arabic numeral with 128; And Hou is with scale-of-two 2 8Represent 1 byte, extend to 256 coded combinations and represent, comprise the interchange internal code needs of some Western European countries; 1967 and in Geneva of Switzerland suggestion becomes international organization's standard (International Organization for Standardization), be called for short ISO.The global economy development all needs the modern computer systems exchange and shows identical information, so essential unified ISN, along with indivedual countries in Asia and regional needs, country variant and area all reach number of cells in succession by different way and encode.The Big-5 that comprises Big-5, the area, Hong Kong in Japanese Industrial Standards (Japan Industrial Standard/JIS), TRON, Taiwan adds the GBK of HKSCS (Hong Kong Supplementary Character Sets/HKSCS), Korean and China, be the earliest the simplified Chinese character coding GB2312, and Hou comprise the GB18030 etc. of the complex form of Chinese characters.Indivedual countries in above Asia and regional letter application all have a common ground, all are to use simplified or unsimplified Hanzi, and in the middle of most Hanzi font is arranged is identical, but coded representation method difference just fully can not be compatible; Its quantity gap is quite big, is not waited to tens thousand of by thousands of.Also along with changing, incompatible problem causes showing different world's literal in the internet, expedites the emergence of the appearance of Unicode for the rise of internet, the coded representation method of world's literal.Since nineteen ninety, at first world's literal codes of more than 7,000, and then the encode Chinese characters for computer that more than 20,000 China, Japan and Korea S. are used; Reach Hou and add uncommon world's literal and Chinese character again, deal with the needs of the ancient mat in arrangement various places, to having 100,000 word capacities so far.At present, continuous scala media and the high-order computer program language that occurs, and the operating system of being write as with computer program language compatible Unicode simultaneously all, up-to-date standard is ISO10646, but in fact concerning the hardware of any computer or electronic system, with the computer program language of Unicode coding, huge bit amount all can cause very big burden to any computing.The shortcoming of Unicode mainly is the coding method of continuing to use very early time, when causing enlarging character library, must crosswisely develop according to old mode, but with 2 8Crosswise development; The operand of its generation is very big, though can satisfy the needs of coding, has dragged slowly the arithmetic capabilities such as ordering of computer or electronic system.And the coding method of Unicode and logic also do not meet most country and regional literal development need; For example; hanzi system to thousands of; available more than 200 'Radical classification 's; but Unicode does not all insert more than 200 radicals by which characters are arranged in traditional Chinese dictionaries in the Unicode; the position of tens thousand of encodes Chinese characters for computer is very chaotic; can't accomplish the logical attribute corresponding relation between radicals by which characters are arranged in traditional Chinese dictionaries and hanzi system, make the Chinese scholar can't handle the interchange needs of ancient mat ISN.The coded combination tabulation that below is 81 bytes is analyzed:
Table 1
Figure A200810218455D00041
Give in the coded system of Unicode and stayed private coinage space, the user can be placed in the coinage district with the different literals symbol voluntarily; But the setting in this private coinage district but can not be carried out public's transmission in the permutation code mode.All the time, the development of Unicode is not to encode in the regular length mode; The everyday character alphabetic word joint in west is compiled lessly, and seldom used letter symbol is compiled morely, and more bit amount is not easy to realize the high-level efficiency ordering.At present, the development of computer program instruction will solve compatibling problem, is ISN with Unicode all, directly makes the space enlargement of most programming language, strengthens the burden of memory space and hardware.
Present Word message data-encoding scheme, its fundamental purpose are in order to enlarge number of code combinations and accurate recording literal font, and written record semanteme, the literal in west are semantic with the alphabetic string tissue; The China in east is semantic with the block character tissue.Coding development from ASCII to Unicode does not all have any literal or letter is encoded aspect the semantic attribute.Computer and Internet development have produced the Word message of huge amount, and information globalization increases with geometric series especially, and picks up rope with keyword, though the result be inaccurate in a large number because magnanimity information is impossible carry out the semantic attribute classification in the staff mode.
Any in the world spelling literal is all by being that character string different in size is formed, and the character string of the different length of huge amount sorted need expend great computing cost.The most effective management is to store and sort operation with fixed-length data (Fixed-Length Data), automatically the expressed information of any literal is realized the semantic attribute classification, go out the result who has semantic relevance most thereby pick up rope with prestissimo.Magnanimity information pick up rope, the most important condition is to distinguish semantic attribute earlier, carries out tap/dip deep into again in the data of classification Hou automatically; Again be that unique literal that allows possesses the classification method of semantic to literal or letter with the attribute coding.
Mobile phone application the earliest is simple communication facilities, and the function of Hou computer is increasing, and volume is but more and more littler, and current development has been that the function of computer is based upon on the mobile phone; So its electronic structure of the mobile phone of communication facilities is exactly a computer.But for fear of old coded representation method, the very big burden of interchange internal code, a spot of lteral data all is not easy to deal with, so can not develop all functions of computer with low cost on mobile phone; Mainly be ordering at a high speed, other comprise literal and database processing, search and web page browsing etc.If can provide with the hardware of same efficiency than present arithmetic capability more at a high speed, mobile phone can be immediately to more strong functions development.
Summary of the invention
The objective of the invention is to overcome the deficiencies in the prior art, the interchange internal code that is used for electronics, information and communication system of a kind of arithmetic speed height, saving storage space is provided.
In order to reach the foregoing invention purpose, the present invention has adopted following technical scheme: a kind of interchange internal code that is used for electronics, information and communication system, it is characterized in that: comprise with 3 bits as a bit collection and with bit set pair character, radicals by which characters are arranged in traditional Chinese dictionaries, stroke, letter, symbol or image more than 3 or 3 with fixing bit length coding and the standard character storehouse set up, wherein, character element of Chinese character, radicals by which characters are arranged in traditional Chinese dictionaries, stroke are encoded according to radicals by which characters are arranged in traditional Chinese dictionaries attribute coding rule; And comprise that character element of Chinese character and other literal with the standard character storehouse concern the Chinese keyword mapping table of forming according to semantic matches.
The standard character storehouse that constitutes by described graphic character sign indicating number and or Chinese keyword mapping table be stored among the CPU or ROM of computing machine.
The present invention encodes to any character with regular length bit collection, a plurality of bit integrated mode, and each bit collection is 2 3, i.e. 8 kinds of coded combinations; Because 2 3Be near machine 2 1Number of cells, therefore improved the arithmetic speed and the operational precision of computing machine greatly.And with 2 3Encode as bit set pair character, symbol and an image, when the character amount increases, can be according to the needs of character amount, increase by one or an above bit collection, to suit the computing demand of different scales infosystem, reach optimal coding expansion demand, save space encoder to greatest extent, improve the arithmetic speed of compiler simultaneously.And under the coding environment of Unicode,, also can only laterally increase a byte or more byte even the increase of character amount is a bit, cause the serious waste of space encoder, the arithmetic speed of dragging slow compiler.The present invention's fixed-length code (FLC) can develop suitable contrary parallel sort algorithm more at a high speed.
The Chinese character of having used thousands of years belongs to pictograph, be to be formed by radical and unit construction, and radicals by which characters are arranged in traditional Chinese dictionaries have the characteristics of expression essential attribute, so Chinese character has and can classify and conclude the feature of attribute according to the radicals by which characters are arranged in traditional Chinese dictionaries system.Other literal of except that Chinese character any can both be set up the mapping corresponding relation according to speech meaning and Chinese character in the world, thereby possesses the attribute of automatic classification indirectly, thereby the bit amount that is converted to still less is able to storage, sort operation and transmission.Chinese keyword mapping table of the present invention is according to identical semanteme, with Chinese phrase and English or in the world other literal set up semantic corresponding relation, thereby realize with other literal codes of minimum bit amount mapping/expression, save the space encoder of character, realize ordering at a high speed with regular length bit collection simultaneously.
Description of drawings
Fig. 1 is the present invention's schematic flow sheet of encoding.
Fig. 2 encodes with character, radicals by which characters are arranged in traditional Chinese dictionaries, stroke and the letter of 6 any literal of bit set pair, mark with phonetic symbols symbol, symbol etc. to set up the synoptic diagram in standard character storehouse;
Fig. 3 is the synoptic diagram of hanzi system being encoded by the Chinese-character radical-code rule;
Fig. 4 is the relation on attributes synoptic diagram of the Chinese character radicals and the encoding of Chinese word group;
Fig. 5 is that any text phrases and Chinese are set up the mapping relations synoptic diagram according to keyword;
Fig. 6 is the synoptic diagram that shines upon the English phrase of identical semanteme with encode Chinese characters for computer;
Fig. 7 is an application flow synoptic diagram of the present invention.
Embodiment
Below in conjunction with accompanying drawing the preferred embodiments of the present invention being described, should be appreciated that preferred embodiment described herein only is used for description and interpretation the present invention, is not limitation of the invention.
As shown in Figure 1, the present invention at first sets up the standard character storehouse, is used for any character, symbol and figure are encoded with n (n 〉=3) group scale-of-two bit collection, and each bit collection has 2 3Therefore=8 kinds of coded combinations, can provide (2 altogether 3) nPlanting space encoder encodes.
Fig. 2 shows according to coded system of the present invention, with any one character, font, literal, radicals by which characters are arranged in traditional Chinese dictionaries, mark with phonetic symbols symbol, symbol, figure and the image of using in the world at present etc., with unique font encoding symbols.Coded combination is an example with 6 groups, is 2 3X 2 3X 2 3X 2 3X 2 3 X 2 3, can be to 262,144 literal and encoding symbols, and the number of cells of each coding is 18.Example in the figure has Chinese character, Arabic figure, the Latin alphabet, Greek alphabet, Rome figure, music symbol, Korea S's mark with phonetic symbols symbol and Japanese alphabet literal etc. respectively.
Now be encoded to example, promptly 2 with 6 scale-of-two bit set pair Hanzi fonts 3X 2 3X 2 3X 2 3X 2 3 X 2 3, total number of code combinations is 260,000 2 thousand, satisfies 100,000 coding demands of present world literal, is 2.6 times that the literal code combination of the present whole world needs, and also has the coding extending space of 160,000 coded combinations; Enough deal with literal expansion needs over the next several years, the following tabulation of its account form:
Table 2
Bit collection byte quantity 1 2 3 4 5 6 7 8
23 powers are represented 2 3 2 3 x 2 3 2 3 x 2 3 x 2 3 2 3 x 2 3 x 2 3 x 2 3 2 3 x 2 3 x 2 3 x 2 3 x 2 3 2 3 x 2 3 x 2 3 x 2 3 x 2 3 x 2 3 2 3 x 2 3 x 2 3 x 2 3 x 2 3x2 3 x 2 3 3 x 2 3 x 2 3 x 2 3 x 2 3 x 2 3 x 2 3 x 2 3
Number of code combinations 8 64 512 4096 32,768 262,1 44 2,097, 152 16,777 ,216
Every coded combination bit amount 3 6 9 12 15 18 21 24
Byte number 3/8= 6/8= 9/8= 12/8= 15/8= 18/8= 21/8= 24/8=
0.375 0.75 1.125 1.5 1.875 2.25 2.625 3
From last table as seen, encode with 6 bit collection, its coded combination can reach 262,144, compares with Unicode; Still have the space encoder of 160,000 characters, be enough to deal with the expansion needs that reach at present over the next several years, and shared space total volume has only 2.25 bytes (Byte), and memory space and arithmetic capability are less demanding, is fit to the interchange internal code of development portable information and communication system.And following needs according to literal expansion and application, available 6 above scale-of-two bit collection are encoded, and the memory space and the arithmetic capability of its requirement are higher, are fit to the interchange internal code of the large-scale infosystem of development.
When single Hanzi font is encoded, encode with scale-of-two multidigit unit collection according to radicals by which characters are arranged in traditional Chinese dictionaries attribute coding rule, for example as Fig. 3, radical " Ren " possesses identical radicals by which characters are arranged in traditional Chinese dictionaries relating attribute with the Chinese character with " Ren " radicals by which characters are arranged in traditional Chinese dictionaries, and the coding that is embodied in them has the identical expression of essence; So analogize, " and serial Chinese character all in this way for radical " Chi ", " Xin ", " Rolling " and " Bo; In this example, have the character element of Chinese character of identical radicals by which characters are arranged in traditional Chinese dictionaries, the front three numeral of its coding also is identical, thereby radicals by which characters are arranged in traditional Chinese dictionaries attributive classification rule encoding pressed in the realization Chinese character, accurately distinguishes the radicals by which characters are arranged in traditional Chinese dictionaries attribute of different Chinese character font.
Fig. 4 for example, related in the hanzi system with " water " implication, its radicals by which characters are arranged in traditional Chinese dictionaries are " Rui ", if " Rui " is encoded to 111 000, when every Chinese character relevant with " water " implication or radicals by which characters are arranged in traditional Chinese dictionaries can enroll in 111 000 groups recognition property, front three is 111 coded combination, all has the attribute of water, and can automatic attributive classification with water.For example Chinese character " seawater " is encoded with 6 bit collection, be respectively 111 661 and 111 660, and the radicals by which characters are arranged in traditional Chinese dictionaries of Chinese character " seawater " all is " Rui ", is encoded to 111 000; Utilizing the radicals by which characters are arranged in traditional Chinese dictionaries attribute to carry out Methods for Coding can link together the Chinese character that relates to speech meaning " water " in coding and the hanzi system, and the front three numeral of character element code is identical, all is 111.
In the above example, be that stroke order by Chinese character splits at least one radicals by which characters are arranged in traditional Chinese dictionaries or parts with Chinese character, the stem head of this word has taken first three the bit collection in the coding, and three remaining bit collection can be made the flowing water numbering also can consider to adopt further radicals by which characters are arranged in traditional Chinese dictionaries attribute coding.
In actual applications, the bit collection that stem head or first part take also can be 1, or 2, or 4, the present invention does not make qualification to this.
Chinese character is split into the mode of at least one radicals by which characters are arranged in traditional Chinese dictionaries/parts except adopting by Chinese-character order of strokes, the radicals by which characters are arranged in traditional Chinese dictionaries attribute coding rule that is adopted can also be by stroke order and predetermined stroke set Chinese character to be split at least one stroke, and adopts more than one bit collection to come this stroke is encoded.For example: set predetermined stroke set by point ", "------representative is short casts aside and short class stroke, long cast aside " Pie " of pressing down, and---representative is long casts aside and long class stroke, short draw "-" of pressing down---represent hyphen and short perpendicular class stroke and dash " "---the long horizontal and long class stroke that erects of representative is formed for representative point class stroke, short cast aside " Pie ", correspond respectively to 1 ~ 5 five numeral, font stroke insufficient section is represented with digital " 0 ".Then the radical-code of Chinese character " sea " then is 1 11661, and promptly stroke takies a bit collection.
With the character element of Chinese character in the standard character storehouse of radicals by which characters are arranged in traditional Chinese dictionaries attribute coding rule foundation, set up Chinese keyword mapping table simultaneously, be used for other literal according to key words justice matching relationship correspondence mappings to the Chinese phrase, be about in the standard character storehouse Chinese and in the world the phrase of other literal be mapped, represent other literal with Chinese.As Fig. 5; Any in the world spoken and written languages can both be mapped to Chinese keyword mapping table, thereby realize the semantic attribute classification with automated manner indirectly.
As shown in Figure 6, by other literal being mapped to the mode of Chinese keyword mapping table, can be converted to bit amount still less, when other literal need sort operation like this, thereby coding bit amount is significantly reduced, the Chinese keyword mapping table that adopts the character element of Chinese character in standard character storehouse to form shines upon the English of identical semanteme, can replace that Unicode on-fixed length and multidigit unit amount is stored, sort operation and transmission.For example, in the mapping table of Chinese and character, corresponding relation according to semanteme, the Chinese semantic meaning of " Sea Water " is " seawater ", cause is a code storage with 36 bits that " Sea Water " is converted to Chinese keyword " seawater ", be that its coding sign indicating number position is 36, be less than 72 bits of English own far away.Therefore, in the time will retrieving, what no matter import is the keyword of any character express, can concern according to semantic matches, in Chinese keyword mapping table, be mapped to corresponding Chinese phrase, thereby be converted to bit amount still less, accelerate storage, sort operation and the transmission speed of computer system.
During application, with above-mentioned standard character storehouse and or Chinese keyword mapping table directly insert ROM (Read OnlyMemory) or CPU (Central Processing Unit), the more coded combination of any character of quick access and attribute data.
The above is the preferred embodiments of the present invention only, is not limited to the present invention.For a person skilled in the art, the present invention can have various changes and variation.Within the spirit and principles in the present invention all, any modification of being done, be equal to replacement, improvement etc., all should be included within protection scope of the present invention.

Claims (8)

1, a kind of interchange internal code that is used for electronics, information and communication system, it is characterized in that: comprise with 3 bits as a bit collection and with bit set pair character, radicals by which characters are arranged in traditional Chinese dictionaries, stroke, letter, symbol or image more than 3 or 3 with fixing bit length coding and the standard character storehouse set up, wherein, character element of Chinese character, radicals by which characters are arranged in traditional Chinese dictionaries, stroke are encoded according to radicals by which characters are arranged in traditional Chinese dictionaries attribute coding rule; And
Comprise the Chinese keyword mapping table of forming according to the semantic matches relation with character element of Chinese character and other literal in standard character storehouse.
2, interchange internal code according to claim 1 is characterized in that: the number of described bit collection is 6.
3, interchange internal code according to claim 2 is characterized in that: described radicals by which characters are arranged in traditional Chinese dictionaries attribute coding rule is meant that Chinese character splits at least one radicals by which characters are arranged in traditional Chinese dictionaries or parts, encodes with the bit collection more than according to stroke order.
4, interchange internal code according to claim 3 is characterized in that: each radicals by which characters are arranged in traditional Chinese dictionaries or parts are encoded with 3 bit collection.
5, interchange internal code according to claim 2 is characterized in that: described radicals by which characters are arranged in traditional Chinese dictionaries attribute coding rule be meant Chinese character according to the set of predetermined stroke and stroke order splits at least one stroke, with at least one bit collection coded representation.
6, interchange internal code according to claim 5 is characterized in that: described predetermined stroke set by point ", "---representative point class stroke, the short left-falling stroke "
Figure A200810218455C0002114555QIETU
"---representative is short cast aside and short press down class stroke, the long left-falling stroke "
Figure A200810218455C0002114618QIETU
"---long left-falling stroke of representative and long right-falling stroke class stroke, short draw "-"---represent perpendicular class stroke of hyphen and weak point and dash " "---and represent the long horizontal and long class stroke that erects to form, and correspond respectively to 1 ~ 5 five numeral, and font stroke insufficient section is represented with digital " 0 ".
7, interchange internal code according to claim 1 is characterized in that: described standard character storehouse or Chinese keyword mapping table are stored among the CPU or ROM of electronic system.
8, a kind of application rights require that the described interchange internal code that is used for electronics, information and communication system of 1-7 is retrieved, sorted, the method for storage or data output, it is characterized in that may further comprise the steps:
(1) input is with the keyword of source character express;
(2) system is corresponding with the mapping of Chinese phrase with the source document word according to the semantic matching relationship of the keyword of described Chinese keyword mapping table;
(3) to the keyword with Chinese expression sort, retrieve, storage or data output function.
CNA2008102184555A 2008-10-17 2008-10-17 Interchange internal code for electronic, information and communication system and use thereof Pending CN101382931A (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CNA2008102184555A CN101382931A (en) 2008-10-17 2008-10-17 Interchange internal code for electronic, information and communication system and use thereof
PCT/CN2009/001153 WO2010043117A1 (en) 2008-10-17 2009-10-19 Digital encoding method and application thereof

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CNA2008102184555A CN101382931A (en) 2008-10-17 2008-10-17 Interchange internal code for electronic, information and communication system and use thereof

Publications (1)

Publication Number Publication Date
CN101382931A true CN101382931A (en) 2009-03-11

Family

ID=40462776

Family Applications (1)

Application Number Title Priority Date Filing Date
CNA2008102184555A Pending CN101382931A (en) 2008-10-17 2008-10-17 Interchange internal code for electronic, information and communication system and use thereof

Country Status (2)

Country Link
CN (1) CN101382931A (en)
WO (1) WO2010043117A1 (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2010043117A1 (en) * 2008-10-17 2010-04-22 Lo Yingkit Digital encoding method and application thereof
CN102955779A (en) * 2011-08-18 2013-03-06 腾讯科技(深圳)有限公司 Method and device for searching software
CN113362263A (en) * 2021-05-27 2021-09-07 百度在线网络技术(北京)有限公司 Method, apparatus, medium, and program product for changing the image of a virtual idol

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112329389B (en) * 2019-07-30 2024-02-27 北京大学 Chinese character stroke automatic extraction method based on semantic segmentation and tabu search
CN111669394B (en) * 2020-06-04 2022-03-04 西安空间无线电技术研究所 Method for hiding and transmitting image and voice information of satellite communication

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1155690A (en) * 1995-11-27 1997-07-30 王道通 Three-stroke inputting method
CN1244041C (en) * 2003-01-20 2006-03-01 郭松森 Method of big and small character elements for inputting Chinese characters
ITVA20060065A1 (en) * 2006-11-03 2008-05-04 St Microelectronics Srl MEMORY WITH THREE-LEVEL CELLS AND ITS MANAGEMENT METHOD.
CN101408873A (en) * 2007-10-09 2009-04-15 劳英杰 Full scope semantic information integrative cognition system and application thereof
CN101382931A (en) * 2008-10-17 2009-03-11 劳英杰 Interchange internal code for electronic, information and communication system and use thereof

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2010043117A1 (en) * 2008-10-17 2010-04-22 Lo Yingkit Digital encoding method and application thereof
CN102955779A (en) * 2011-08-18 2013-03-06 腾讯科技(深圳)有限公司 Method and device for searching software
CN102955779B (en) * 2011-08-18 2017-11-07 深圳市世纪光速信息技术有限公司 The method and apparatus of software search
CN113362263A (en) * 2021-05-27 2021-09-07 百度在线网络技术(北京)有限公司 Method, apparatus, medium, and program product for changing the image of a virtual idol
CN113362263B (en) * 2021-05-27 2023-09-15 百度在线网络技术(北京)有限公司 Method, apparatus, medium and program product for transforming an image of a virtual idol

Also Published As

Publication number Publication date
WO2010043117A1 (en) 2010-04-22

Similar Documents

Publication Publication Date Title
CN101131690B (en) Method and system for mutual conversion between simplified Chinese characters and traditional Chinese characters
CN111783399B (en) Legal referee document information extraction method
CN101950285A (en) Utilize native language pronunciation string converting system and the method thereof of statistical method to Chinese character
CN106528536A (en) Multilingual word segmentation method based on dictionaries and grammar analysis
CN102662926B (en) The storage and inquire method of character library
CN101382931A (en) Interchange internal code for electronic, information and communication system and use thereof
U Rahman Towards Sindhi corpus construction
Naseem et al. A novel approach for ranking spelling error corrections for Urdu
Ye et al. Part-of-speech tagging based on dictionary and statistical machine learning
CN102929865A (en) PDA (Personal Digital Assistant) translation system for inter-translating Chinese and languages of ASEAN (the Association of Southeast Asian Nations) countries
Li et al. Markbert: Marking word boundaries improves chinese bert
CN105573981A (en) Method and device for extracting Chinese names of people and places
Marsi et al. Memory-based morphological analysis generation and part-of-speech tagging of Arabic
Lu Computers and Chinese writing systems
Kumar Saha et al. Named entity recognition in Hindi using maximum entropy and transliteration
CN101882158A (en) Automatic translation sequence adjusting method based on contexts
CN102053955B (en) Method and system for inputting symbols
Yu et al. Development of a Web-Scale Chinese Word N-gram Corpus with Parts of Speech Information.
CN100533359C (en) Oracle spelling and component disintegration and input method
Nongmeikapam et al. A transliteration of CRF based Manipuri POS tagging
CN101135938B (en) Chinese characters phonetic two-tone input method
CN101576924A (en) Mongolian retrieval method
CN109871550A (en) A method of the raising digital translation quality based on post-processing technology
CN104699662A (en) Method and device for recognizing whole symbol string
Almahdawi et al. Automatically recognizing emotions in text using prediction by partial matching (PPM) text compression method

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
REG Reference to a national code

Ref country code: HK

Ref legal event code: DE

Ref document number: 1126291

Country of ref document: HK

C12 Rejection of a patent application after its publication
RJ01 Rejection of invention patent application after publication

Open date: 20090311

REG Reference to a national code

Ref country code: HK

Ref legal event code: WD

Ref document number: 1126291

Country of ref document: HK