CN102880690A - Method for simplifying character library of electronic book, and processing device for electronic book - Google Patents

Method for simplifying character library of electronic book, and processing device for electronic book Download PDF

Info

Publication number
CN102880690A
CN102880690A CN2012103480942A CN201210348094A CN102880690A CN 102880690 A CN102880690 A CN 102880690A CN 2012103480942 A CN2012103480942 A CN 2012103480942A CN 201210348094 A CN201210348094 A CN 201210348094A CN 102880690 A CN102880690 A CN 102880690A
Authority
CN
China
Prior art keywords
file
format description
font file
book
new
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN2012103480942A
Other languages
Chinese (zh)
Inventor
闻申生
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
BEIJING HANYI KEYIN INFORMATION TECHNOLOGY Co Ltd
Original Assignee
BEIJING HANYI KEYIN INFORMATION TECHNOLOGY Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by BEIJING HANYI KEYIN INFORMATION TECHNOLOGY Co Ltd filed Critical BEIJING HANYI KEYIN INFORMATION TECHNOLOGY Co Ltd
Priority to CN2012103480942A priority Critical patent/CN102880690A/en
Publication of CN102880690A publication Critical patent/CN102880690A/en
Pending legal-status Critical Current

Links

Images

Abstract

The embodiment of the invention discloses a method for simplifying a character library of an electronic book, and a processing device for the electronic book, wherein the method comprises the steps as follows: acquiring Chinese character internal codes in a format description file by analyzing the format description file of the electronic book, and ordering the Chinese character internal codes according to a specified coding sequence to form a code bit sequence list of the format description file; generating a new character library file in a way of only indexing Chinese character internal codes appearing in the format description file according to the code bit sequence list and an original character library file of the electronic book; and replacing the original character library file with the new character library file, so as to finish simplification of the character library of the electronic book. According to the method for simplifying the character library of the electronic book and the processing device for the electronic book disclosed by the embodiment of the invention, a Chinese character internal code sequence of the electronic book is acquired, a font library special for the electronic book is generated based on the Chinese character internal code sequence, so that useless fonts of the electronic book are filtered out; by adopting the method and the device, the problem of data redundancy of the Chinese character library of the electronic book is solved, and files of the character library of the electronic book are reduced greatly.

Description

A kind of method and e-book treating apparatus of simplifying the e-book character library
Technical field
The present invention relates to the digital publishing field, especially relate to a kind of method and e-book treating apparatus of simplifying the e-book character library.
Background technology
When the aesthetic property of the reading property that relates to digital publishing and electronic publishing, Chinese character pattern attractive in appearance and abundant be the most important feature of e-book.The size of e-book is to a great extent by e-book format description file such as EPUB(electronic publication), Mobi etc., and the size of font file determines.The reading of e-book generally is to read after downloading to local device by network, is transferred to the reading of equipment after also can resolving by high in the clouds.Current e-book can be used system's character library, the General System character library carries the character library of one to two kind of GB18030-2000, if not with the e-book of having the font character library by oneself, meeting automatic indexing system character library, so e-book can cause thousand times same sensations of e-book, and aesthetic property and uniqueness can't manifest.Therefore, e-book generally can carry character library.
But the Hanzi font library word-base amount is very large, and 27533 Chinese character patterns of the Chinese character pattern that existing standard is collected from 6763 fonts of GB2312-80 to GB18030-2000 do not wait.Because Chinese character quantity is large, the file size of the Hanzi font library (take the TrueType font file as example) of a cover font does not wait (concrete font file size is relevant with number of words and font) at 3M to 20M.Like this, in an e-book, character library with a cover 18030-2000, character library is occupied the space and is surpassed more than 90%, occupy in a large number the character library application resource, particularly taken the transfer resource of network, and a large amount of font attaches and becomes the data burden that takies resource, so current in the urgent need to a kind of method that can simplify e-book character library size.
Summary of the invention
In view of this, the purpose of the embodiment of the invention provides a kind of method and e-book treating apparatus of simplifying the e-book character library, takies problem than large space to solve present e-book character library.
On the one hand, the embodiment of the invention discloses a kind of method of simplifying the e-book character library, described method comprises:
By the Hanzi internal code in the described format description file of format description file acquisition of resolving e-book, and press the ordering of prescribed coding order, form the code bit sequence list of described format description file;
According to the former font file of described code bit sequence list and described e-book, the mode by the Hanzi internal code that occurs in the described format description file of an index generates new font file;
Replace described former font file with described new font file, to finish simplifying of e-book character library.
Preferably, described format description file is specially the EPUB file.
Preferably, described by the Hanzi internal code in the described format description file of format description file acquisition of resolving e-book, and press the ordering of prescribed coding order, and form the code bit sequence list of described format description file, specifically comprise:
By chained list registering section in the described EPUB file of described EPUB file acquisition of resolving described e-book;
Travel through described chained list registering section, the paragraph content of each pagefile is turned to UNICODE conversion and ordering as UTF-8, form described code bit sequence list.
Preferably, described new font file and described former font file are specially TTF font file or OTF font file.
Preferably, described former font file according to described code bit sequence list and described e-book, the mode by the Hanzi internal code that occurs in the described format description file of an index generates new font file, specifically comprises:
Obtain all data table items in the former font file, described all data table items comprise coding mapping table and other list items;
Described code bit sequence list and described coding mapping table are compared, and the mode by the Hanzi internal code that occurs in the described format description file of an index generates new coding mapping table;
Generate other new list items according to described new coding mapping table;
According to described new coding mapping table and described other new list items, synthetic new font file.
Preferably, the described new font file of described usefulness is replaced described former font file, to finish simplifying of e-book character library, specifically comprises:
The character library of revising in the described format description file selects to navigate to described new font file;
Amended described format description file and described new font file are replaced original described format description file and described former font file, to finish simplifying of e-book character library.
On the other hand, the embodiment of the invention also discloses a kind of e-book treating apparatus, is used for simplifying the font file of e-book, and described device comprises:
Code bit sequence list generation unit is used for the Hanzi internal code by the described format description file of format description file acquisition of resolving described e-book, and presses the ordering of prescribed coding order, forms the code bit sequence list of described format description file;
New font file generation unit is used for the former font file according to described code bit sequence list and described e-book, and the mode by the Hanzi internal code that occurs in the described format description file of an index generates new font file;
The character library replacement unit is used for replacing described former font file with described new font file, to finish simplifying of e-book character library.
Preferably, described format description file is specially the EPUB file, and described code bit sequence list generation unit specifically comprises:
The chained list registering section is obtained subelement, is used for by resolving the described EPUB file of the described EPUB file acquisition chained list registering section of described e-book;
The code bit sequence list obtains subelement, is used for traveling through described chained list registering section, and the paragraph content of each pagefile is turned to UNICODE conversion and ordering as UTF-8, forms described code bit sequence list.
Preferably, described new font file and described former font file are specially TTF font file or OTF font file, and described new font file generation unit specifically comprises:
Data table items obtains subelement, is used for obtaining all data table items of former font file, and described all data table items comprise coding mapping table and other list items;
The coding mapping table generates subelement, is used for described code bit sequence list and described coding mapping table are compared, and the mode by the Hanzi internal code that occurs in the described format description file of an index generates new coding mapping table;
Other list items generate subelement, are used for generating other new list items according to described new coding mapping table;
Font file synthon unit is used for according to described new coding mapping table and described other new list items, synthetic new font file.
Preferably, described character library replacement unit specifically comprises:
Format description file modification subelement selects to navigate to described new font file for the character library of revising described format description file;
File is replaced subelement, is used for amended described format description file and described new font file are replaced original described format description file and described former font file, to finish simplifying of e-book character library.
The embodiment of the invention is by obtaining the Hanzi internal code sequence of an e-book, and generate the method for the font character library of this e-book special use with this, with this e-book use less than font filter, solved the data redundancy problem of e-book Hanzi font library, so that the e-book font file significantly reduces below 20% of size of the original e-book of e-book average out to after simplifying (with a cover 18030-2000 character library).Like this, just significantly reduced the resource occupation of e-book access and transmission, Fu Feng the application of Chinese character pattern, for Chinese character pattern provides technological means in the digital publishing widespread use, strengthened the attractive in appearance of digital publishing and with the individualized feature of Chinese character pattern strengthening electronic book, promote the reading effect of e-book, advanced the common development of double bubble technique and digital publishing.
Description of drawings
In order to be illustrated more clearly in the embodiment of the invention or technical scheme of the prior art, the below will do to introduce simply to the accompanying drawing of required use in embodiment or the description of the Prior Art, apparently, accompanying drawing in the following describes only is some embodiments of the present invention, for those of ordinary skills, under the prerequisite of not paying creative work, can also obtain according to these accompanying drawings other accompanying drawing.
Fig. 1 is the main flow chart of the embodiment of the invention one method;
Fig. 2 is the embodiment of the invention one step S101 internal process figure;
Fig. 3 is the embodiment of the invention one step S102 internal process figure;
Fig. 4 is the embodiment of the invention one step S103 internal process figure;
Fig. 5 is the schematic diagram of embodiment of the invention two devices.
Embodiment
Below in conjunction with the accompanying drawing in the embodiment of the invention, the technical scheme in the embodiment of the invention is clearly and completely described, obviously, described embodiment only is the present invention's part embodiment, rather than whole embodiment.Based on the embodiment among the present invention, those of ordinary skills belong to the scope of protection of the invention not making the every other embodiment that obtains under the creative work prerequisite.
Embodiment one
At first, before the narration embodiment of the invention, simply introduce first this paper related some concepts and term:
Windows system: the window platform software systems that refer to Microsoft.
IOS: refer to the plateform system of Apple, comprise Mac, iPad, iPhone etc.
Android: refer to what Google company released, current a lot of business mens are supported the plateform system of mobile device, such as Samsung, association etc.
EPUB: be the very important concept of this paper.EPUB is the format description file of an e-book, comprises the information such as encode Chinese characters for computer, book version form of this e-book.Specifically, the EPUB file is open file standard, the EPUB file is mainly described the space of a whole page composing control information of book version, the coding standard of use Chinese character, Hanzi internal code, font, size, position etc., explanation for EPUB, obtain exactly coding standard and ISN, and correspond to according to coding standard in the order of GB18030, sort.
Mobi: be the e-book format description file that uses after Amazon kindle releases, principle is same as the EPUB form.
The spine chained list: the part relevant with encode Chinese characters for computer in the EPUB file, this table is the chained record structure, order comprises EPUB and comprises all chapters and sections index informations, and coding standard and Hanzi internal code information.
The UTF-8 coding method: identify literal take 8 as unit, the coding method of variable byte decides maximum 4 bytes according to the UNICODE coding of this symbol.The character of different range is used the coding of different length.For the character between the 0x00-0x7F, the UTF-8 coding is identical with the ASCII coding.
The standard word library format character library of TrueType character library: Windows and iOS system, system has finished for the reduction of character library and explanation, is called for short TTF file (character library).
The OpenType character library: the standard word library format of compatible Adobe postscript Type1 and TrueType form, be called for short OTF file (character library).
Postscript Type1: be the character library standard of Adobe definition, and the page-description language Postscript of Adobe is fully compatible.
GB2312-80: the encode Chinese characters for computer standard, contain 6763 Chinese characters.GB12345-90: the traditional font of corresponding GB2312-80 is expanded, and has 6866 Chinese characters.
GBK: the encode Chinese characters for computer standard, expand based on GB2312-80, have 20902 Chinese characters.
GB18030:GB18030-2000 encode Chinese characters for computer standard, 27533 Chinese characters are collected in the expansion on the GBK basis altogether, and current general-purpose system is all supported this standard code.GB18030-2005 has reached font about 75000 based on the expansion sets font of this standard.
Substantially about 3000 Chinese characters, the Chinese character that a general e-book is used also is at this quantitatively to the daily actual Chinese characters in common use that use.So the inventor finds in realizing process of the present invention, for e-book, the number of words of using only is about 40% of GB2312-80 generally in fact, about 10% of GB13080-2000, and for the e-book that has formed, the number of words of its use is determined more, has very large redundancy in the visible current e-book character library, and the present invention namely produces based on this principle.
Referring to Fig. 1, be the process flow diagram of the embodiment of the invention one method.The present embodiment discloses a kind of method of simplifying the e-book character library, and described method comprises:
S101: by the Hanzi internal code in the described format description file of format description file acquisition of resolving e-book, and press the ordering of prescribed coding order, form the code bit sequence list of described format description file.
In brief, this step namely resolves for the format description file of e-book, the Hanzi internal code that is occured to obtain the format description file, and by as the coded sequence of the appointments such as GB18030 sort, thereby form the code bit sequence list of this format description file.
Lower mask body is described in detail this step take this format description file layout of EPUB as example.And the disposal route of the forms such as Mobi and EPUB are basic identical, repeat no more.
Referring to shown in Figure 2, preferred, step S101 specifically can comprise:
S1011: by chained list registering section in the described EPUB file of described EPUB file acquisition of resolving described e-book;
S1012: travel through described chained list registering section, the paragraph content of each pagefile is turned to UNICODE conversion and ordering as UTF-8, form described code bit sequence list.
Specifically, at first need reading e-books EPUB file and resolve, comprise unpacking of compressed format, then read primary structure information, comprise complete composition information, and encode Chinese characters for computer information etc.
Then parse all Hanzi internal codes.Obtaining of encode Chinese characters for computer information is by parsing spine chained record structure, and obtains all relevant coded messages, and collects in the buffer memory.
The UNICODE coding result of layout ordering again.For UTF-8 coding carry out UNICODE conversion and ordering, obtain the in order coded message of discharging.
In the EPUB file, during the Chinese character information part of statistics, only need statistics chained list registering section in the EPUB file to get final product, i.e. spine chained list registering section.Spine is part relevant with encode Chinese characters for computer in the EPUB file, and this table is the chained record structure, and order comprises EPUB and comprises all chapters and sections index informations, and coding standard and Hanzi internal code information.The way that EPUB is resolved is, for resolving of UTF-8 coding, the paragraph content of each pagefile turned to the UNICODE conversion as UTF-8.At first according to spine structure traversal chained list, as long as less than the chained list end, just take out data, deposit buffer zone in binary mode, then these chapters and sections are carried out transcoding.
Transcoding for Hanzi internal code is the purpose that reads the EPUB data, and technical coding rule byte-by-byte according to UTF-8 is judged the first place value of letter, if less than 127, must cross DBC case for English or numeral, skips and disregards.If the first and 0xE0 be ﹠amp; Operation result is 0xC0, and inferior byte and 0xC0 do; It then is 2 byte UTF-8 Chinese characters that computing is not equal to 0x80, stores in order 2 bytes and gets final product.If the first and 0xF0 be ﹠amp; Operation result is 0xC0, and inferior byte reaches again byte and 0xC0 does; It then is 3 byte UTF-8 Chinese characters that computing is not equal to 0x80, stores in order 3 bytes and gets final product.Then utilizing win32 APIMultiByteToWideChar(system resource call function) front 2 parameters C P UTF8 can transfer the encoding list result of UNICODE to.
S102: according to the former font file of described code bit sequence list and described e-book, the mode by the Hanzi internal code that occurs in the described format description file of an index generates new font file.
In brief, the file of the Hanzi internal code GB18030 that this step namely obtains according to previous step, regenerate the code bit mapping table, the font index that exists out, do not have indexed font just need not leave in the character library, be packaged into the TTF(or the OpenType that simplify) character library, the character library that generates like this and EPUB file are fully corresponding.Be to generate the key of simplifying character library for the character code Map Sort of new word table, what be packaged into simplifies character library (TTF/OTF character library) and will finish unanimously with original character library, has just removed the data of irrelevant font.
Lower mask body is described in detail this step take the TTF font file as example.And OTF font file and TTF font file are basic identical.
Referring to shown in Figure 3, preferred, step S102 specifically can comprise:
S1021: obtain all data table items in the former font file, described all data table items comprise coding mapping table and other list items;
S1022: described code bit sequence list and described coding mapping table are compared, and the mode by the Hanzi internal code that occurs in the described format description file of an index generates new coding mapping table;
S1023: generate other new list items according to described new coding mapping table;
S1024: according to described new coding mapping table and described other new list items, synthetic new font file.
In brief, namely decompose the file structure of former TTF character library, then revise all data table items, be merged at last new TTF character library.The TTF(OTF that comprises the reading e-books appointment) table of all in font file data, then change into the new cmap table of simplifying according to UNICODE result and the cmap data that read, regenerate again the glyf table data of font description part, keep the font that needs, then according to cmap and glyf table, regenerate document alignment loca table, to modification and the generation of other correlation tables, finish at last the synthetic of each table, obtain e-book and specify TTF(OTF) simplify font file.Wherein relate to the encoding list according to the UNICODE after the EPUB parsing, compare with the coding mapping table (cmap table) of former character library and analyze, and the automatic coding mapping table of simplifying that generates, and according to new other list items of coding mapping table correction, such as offset table (loca table), font table (glyf table), and the modification of other correlation tables.
TrueType Font(TTF) file is comprised of some tables, analyzes the TTF table, and reconstruct TTF file is to generate the key of simplifying character library as required.The font description table is the general data item of statement TTF character contour, the TTF profile is to be made of quadratic B ezier splines, and profile is made of starting point, curve point and rectilinear point, and the front and back of point sequentially consist of the direction of profile, be outline counterclockwise, clockwise direction is interior profile.A Chinese character description is comprised of several profiles, and the data directory of this Chinese character is defined among the loca table.And the mapping of Hanzi internal code and loca table is finished by cmap table, and the outline data of all Chinese characters is described and is placed among the glyftable.After we read the e-book Hanzi internal code, the outline data of the Chinese character that occurs is retained among the glyf table, the Chinese character contour data that do not have to occur are no longer deposited, glyf table has just diminished like this, loca table and cmap table have also diminished simultaneously, all need to restructure according to new word table.The character library that regenerates like this is exactly the character library of simplifying for the certain electric philosophical works.
OTF simplifies the just the same of character library generating principle and TTF, and OTF character library and TTF character library are fully compatible, and Main Differences is that OTF is the character library form of compatible Adobe Postscript Type1.OTF also is comprised of profile, and the difference of TTF profile is that when explaining the OTF font file, processing mode is the same with TTF with three bezier curve and a straight line description.
S103: replace described former font file with described new font file, to finish simplifying of e-book character library.Referring to shown in Figure 4, preferred step S103 specifically can comprise:
S1031: the character library of revising in the described format description file selects to navigate to described new font file.
S1032: amended described format description file and described new font file are replaced original described format description file and described former font file, to finish simplifying of e-book character library.
In brief, the present embodiment method be input as EPUB file, TTF character library, be output as the EPUB file, simplify the TTF character library.And more than simplify the TTF/OTF character library, revise EPUB to replace the technical method of primary electron book, can be used as the high in the clouds service plan.
Embodiment two
Referring to Fig. 4, be the schematic diagram of embodiment of the invention two devices.The present embodiment discloses a kind of e-book treating apparatus, is used for simplifying the font file of e-book, and described device comprises:
Code bit sequence list generation unit 501 is used for the Hanzi internal code by the described format description file of format description file acquisition of resolving described e-book, and presses the ordering of prescribed coding order, forms the code bit sequence list of described format description file;
New font file generation unit 502 is used for the former font file according to described code bit sequence list and described e-book, and the mode by the Hanzi internal code that occurs in the described format description file of an index generates new font file;
Character library replacement unit 503 is used for replacing described former font file with described new font file, to finish simplifying of e-book character library.
Preferably, described format description file is specially the EPUB file, and described code bit sequence list generation unit specifically comprises:
The chained list registering section is obtained subelement 5011, is used for by resolving the described EPUB file of the described EPUB file acquisition chained list registering section of described e-book;
The code bit sequence list obtains subelement 5012, is used for traveling through described chained list registering section, and the paragraph content of each pagefile is turned to UNICODE conversion and ordering as UTF-8, forms described code bit sequence list.
Preferably, described new font file and described former font file are specially TTF font file or OTF font file, and described new font file generation unit specifically comprises:
Data table items obtains subelement 5021, is used for obtaining all data table items of former font file, and described all data table items comprise coding mapping table and other list items;
The coding mapping table generates subelement 5022, is used for described code bit sequence list and described coding mapping table are compared, and the mode by the Hanzi internal code that occurs in the described format description file of an index generates new coding mapping table;
Other list items generate subelement 5023, are used for generating other new list items according to described new coding mapping table;
Font file synthon unit 5024 is used for according to described new coding mapping table and described other new list items, synthetic new font file.
Preferably, described character library replacement unit specifically comprises:
Format description file modification subelement 5031 selects to navigate to described new font file for the character library of revising described format description file;
File is replaced subelement 5032, is used for amended described format description file and described new font file are replaced original described format description file and described former font file, to finish simplifying of e-book character library.
For device embodiment, because it corresponds essentially to embodiment of the method, so relevant part gets final product referring to the part explanation of embodiment of the method.Device embodiment described above only is schematic, wherein said unit as the separating component explanation can or can not be physically to separate also, the parts that show as the unit can be or can not be physical locations also, namely can be positioned at a place, perhaps also can be distributed on a plurality of network element.Can select according to the actual needs wherein some or all of module to realize the purpose of the present embodiment scheme.Those of ordinary skills namely can understand and implement in the situation that do not pay creative work.
The present invention can describe in the general context of the computer executable instructions of being carried out by computing machine, for example program module.Usually, program module comprises the routine carrying out particular task or realize particular abstract data type, program, object, assembly, data structure etc.Also can in distributed computing environment, put into practice the present invention, in these distributed computing environment, be executed the task by the teleprocessing equipment that is connected by communication network.In distributed computing environment, program module can be arranged in the local and remote computer-readable storage medium that comprises memory device.
One of ordinary skill in the art will appreciate that all or part of step that realizes in the said method embodiment is to come the relevant hardware of instruction to finish by program, described program can be stored in the computer read/write memory medium, here alleged storage medium, as: ROM, RAM, magnetic disc, CD etc.
Also need to prove, in this article, relational terms such as the first and second grades only is used for an entity or operation are separated with another entity or operational zone, and not necessarily requires or hint and have the relation of any this reality or sequentially between these entities or the operation.And, term " comprises ", " comprising " or its any other variant are intended to contain comprising of nonexcludability, thereby not only comprise those key elements so that comprise process, method, article or the equipment of a series of key elements, but also comprise other key elements of clearly not listing, or also be included as the intrinsic key element of this process, method, article or equipment.In the situation that not more restrictions, the key element that is limited by statement " comprising ... ", and be not precluded within process, method, article or the equipment that comprises described key element and also have other identical element.
The above is preferred embodiment of the present invention only, is not for limiting protection scope of the present invention.Used specific case herein and principle of the present invention and embodiment have been carried out lock stated, the explanation of above embodiment just is used for helping to understand method of the present invention and core concept thereof; Simultaneously, for one of ordinary skill in the art, according to thought of the present invention, all will change in specific embodiments and applications.In sum, this description should not be construed as limitation of the present invention.All any modifications of doing within the spirit and principles in the present invention, be equal to replacement, improvement etc., all be included in protection scope of the present invention.

Claims (10)

1. a method of simplifying the e-book character library is characterized in that, described method comprises:
By the Hanzi internal code in the described format description file of format description file acquisition of resolving e-book, and press the ordering of prescribed coding order, form the code bit sequence list of described format description file;
According to the former font file of described code bit sequence list and described e-book, the mode by the Hanzi internal code that occurs in the described format description file of an index generates new font file;
Replace described former font file with described new font file, to finish simplifying of e-book character library.
2. method according to claim 1 is characterized in that, described format description file is specially the EPUB file.
3. method according to claim 2, it is characterized in that, described by the Hanzi internal code in the described format description file of format description file acquisition of resolving e-book, and press the ordering of prescribed coding order, form the code bit sequence list of described format description file, specifically comprise:
By chained list registering section in the described EPUB file of described EPUB file acquisition of resolving described e-book;
Travel through described chained list registering section, the paragraph content of each pagefile is turned to UNICODE conversion and ordering as UTF-8, form described code bit sequence list.
4. method according to claim 2 is characterized in that, described new font file and described former font file are specially TTF font file or OTF font file.
5. method according to claim 4 is characterized in that, described former font file according to described code bit sequence list and described e-book, and the mode by the Hanzi internal code that occurs in the described format description file of an index generates new font file, specifically comprises:
Obtain all data table items in the former font file, described all data table items comprise coding mapping table and other list items;
Described code bit sequence list and described coding mapping table are compared, and the mode by the Hanzi internal code that occurs in the described format description file of an index generates new coding mapping table;
Generate other new list items according to described new coding mapping table;
According to described new coding mapping table and described other new list items, synthetic new font file.
6. method according to claim 1 is characterized in that, the described new font file of described usefulness is replaced described former font file, to finish simplifying of e-book character library, specifically comprises:
The character library of revising in the described format description file selects to navigate to described new font file;
Amended described format description file and described new font file are replaced original described format description file and described former font file, to finish simplifying of e-book character library.
7. an e-book treating apparatus is characterized in that, is used for simplifying the font file of e-book, and described device comprises:
Code bit sequence list generation unit is used for the Hanzi internal code by the described format description file of format description file acquisition of resolving described e-book, and presses the ordering of prescribed coding order, forms the code bit sequence list of described format description file;
New font file generation unit is used for the former font file according to described code bit sequence list and described e-book, and the mode by the Hanzi internal code that occurs in the described format description file of an index generates new font file;
The character library replacement unit is used for replacing described former font file with described new font file, to finish simplifying of e-book character library.
8. device according to claim 7 is characterized in that, described format description file is specially the EPUB file, and described code bit sequence list generation unit specifically comprises:
The chained list registering section is obtained subelement, is used for by resolving the described EPUB file of the described EPUB file acquisition chained list registering section of described e-book;
The code bit sequence list obtains subelement, is used for traveling through described chained list registering section, and the paragraph content of each pagefile is turned to UNICODE conversion and ordering as UTF-8, forms described code bit sequence list.
9. device according to claim 7 is characterized in that, described new font file and described former font file are specially TTF font file or OTF font file, and described new font file generation unit specifically comprises:
Data table items obtains subelement, is used for obtaining all data table items of former font file, and described all data table items comprise coding mapping table and other list items;
The coding mapping table generates subelement, is used for described code bit sequence list and described coding mapping table are compared, and the mode by the Hanzi internal code that occurs in the described format description file of an index generates new coding mapping table;
Other list items generate subelement, are used for generating other new list items according to described new coding mapping table;
Font file synthon unit is used for according to described new coding mapping table and described other new list items, synthetic new font file.
10. device according to claim 7 is characterized in that, described character library replacement unit specifically comprises:
Format description file modification subelement selects to navigate to described new font file for the character library of revising described format description file;
File is replaced subelement, is used for amended described format description file and described new font file are replaced original described format description file and described former font file, to finish simplifying of e-book character library.
CN2012103480942A 2012-09-18 2012-09-18 Method for simplifying character library of electronic book, and processing device for electronic book Pending CN102880690A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN2012103480942A CN102880690A (en) 2012-09-18 2012-09-18 Method for simplifying character library of electronic book, and processing device for electronic book

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN2012103480942A CN102880690A (en) 2012-09-18 2012-09-18 Method for simplifying character library of electronic book, and processing device for electronic book

Publications (1)

Publication Number Publication Date
CN102880690A true CN102880690A (en) 2013-01-16

Family

ID=47482016

Family Applications (1)

Application Number Title Priority Date Filing Date
CN2012103480942A Pending CN102880690A (en) 2012-09-18 2012-09-18 Method for simplifying character library of electronic book, and processing device for electronic book

Country Status (1)

Country Link
CN (1) CN102880690A (en)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104462026A (en) * 2014-12-10 2015-03-25 中国科学院软件研究所 System and method for providing cloud word stock service
CN104965830A (en) * 2014-06-06 2015-10-07 腾讯科技(深圳)有限公司 Method and apparatus for updating a character
CN108197103A (en) * 2017-12-27 2018-06-22 掌阅科技股份有限公司 Electronics breviary inteilectual is into method, electronic equipment and computer storage media
CN108763179A (en) * 2018-05-15 2018-11-06 掌阅科技股份有限公司 The modification method and computing device of mark position in e-book
CN112395828A (en) * 2020-12-18 2021-02-23 深圳高通半导体有限公司 Recording method of dot matrix character library information and generating method of dot matrix character
CN113326087A (en) * 2021-05-19 2021-08-31 翱捷科技股份有限公司 Chinese character library processing method and device on LVGL

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1779620A (en) * 2004-11-26 2006-05-31 佛山市顺德区瑞图万方科技有限公司 Building method for Chinese inputting system, Chinese phonetic inputting system and method
US20080028304A1 (en) * 2006-07-25 2008-01-31 Monotype Imaging, Inc. Method and apparatus for font subsetting
CN101963954A (en) * 2009-07-24 2011-02-02 康佳集团股份有限公司 Method and device for displaying words
CN102043764A (en) * 2009-10-16 2011-05-04 埃克斯坦西斯股份有限公司 Reduced glyph font files

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1779620A (en) * 2004-11-26 2006-05-31 佛山市顺德区瑞图万方科技有限公司 Building method for Chinese inputting system, Chinese phonetic inputting system and method
US20080028304A1 (en) * 2006-07-25 2008-01-31 Monotype Imaging, Inc. Method and apparatus for font subsetting
CN101963954A (en) * 2009-07-24 2011-02-02 康佳集团股份有限公司 Method and device for displaying words
CN102043764A (en) * 2009-10-16 2011-05-04 埃克斯坦西斯股份有限公司 Reduced glyph font files

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
吕强,史磊,杨季文: "《TrueType字体文件格式初探》", 《计算机研究与发展》, vol. 32, no. 11, 15 November 1995 (1995-11-15) *
王黎黎: "《电子书阅读软件的设计与实现》", 《中国优秀硕士学位论文全文数据库》, 30 November 2011 (2011-11-30), pages 138 - 59 *

Cited By (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104965830A (en) * 2014-06-06 2015-10-07 腾讯科技(深圳)有限公司 Method and apparatus for updating a character
CN104965830B (en) * 2014-06-06 2018-07-17 腾讯科技(深圳)有限公司 A kind of character update method and device
US10699059B2 (en) 2014-06-06 2020-06-30 Tencent Technology (Shenzhen) Company Limited Character updating method and apparatus
CN104462026A (en) * 2014-12-10 2015-03-25 中国科学院软件研究所 System and method for providing cloud word stock service
CN108197103A (en) * 2017-12-27 2018-06-22 掌阅科技股份有限公司 Electronics breviary inteilectual is into method, electronic equipment and computer storage media
CN108197103B (en) * 2017-12-27 2019-05-17 掌阅科技股份有限公司 Electronics breviary inteilectual is at method, electronic equipment and computer storage medium
CN108763179A (en) * 2018-05-15 2018-11-06 掌阅科技股份有限公司 The modification method and computing device of mark position in e-book
CN108763179B (en) * 2018-05-15 2019-04-09 掌阅科技股份有限公司 The modification method of mark position and calculating equipment in e-book
CN112395828A (en) * 2020-12-18 2021-02-23 深圳高通半导体有限公司 Recording method of dot matrix character library information and generating method of dot matrix character
CN113326087A (en) * 2021-05-19 2021-08-31 翱捷科技股份有限公司 Chinese character library processing method and device on LVGL
CN113326087B (en) * 2021-05-19 2023-01-24 翱捷科技股份有限公司 Chinese character library processing method and device on LVGL

Similar Documents

Publication Publication Date Title
CN102880690A (en) Method for simplifying character library of electronic book, and processing device for electronic book
CN108154211B (en) Two-dimensional code generation method, two-dimensional code generation device, two-dimensional code service processing equipment and two-dimensional code
CN109284145A (en) The generation of multilingual configuration file and methods of exhibiting and device, equipment and medium
CN113051285B (en) SQL sentence conversion method, system, equipment and storage medium
CN104035754A (en) XML (Extensible Markup Language)-based custom code generation method and generator
CN103582883A (en) Improved encoding and decoding of variable-length data with group formats
CN111126019B (en) Report generation method and device based on mode customization and electronic equipment
CN104579360B (en) A kind of method and apparatus of data processing
CN101526963A (en) Method for identifying web page coding, device and terminal equipment
CN105786921B (en) A kind of the data module method for transformation and device of non-structured document
CN106557307B (en) Service data processing method and system
CN110543303A (en) Visual business platform
CN101963954A (en) Method and device for displaying words
CN102567514B (en) Data processing method and system
CN105187258A (en) Method for analyzing and processing configurable Trap alarm
CN104615431A (en) Method and device for achieving Web application
CN104199812A (en) Data system and method supporting multiple languages
CN103971036A (en) Page field access control system and method
CN107566090B (en) Fixed-length/variable-length text message processing method and device
CN105528345A (en) Terminal, server and character complementing method
CN106407288B (en) Method and system for synchronously updating information
CN105488471A (en) Character pattern recognition method and device
CN101727505B (en) Efficient data processing method and device
CN103365894A (en) Font format conversion method and device
CN105740239A (en) Translation method and system of character on webpage

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C12 Rejection of a patent application after its publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20130116