CN104978341A - File processing method and equipment, and network system - Google Patents

File processing method and equipment, and network system Download PDF

Info

Publication number
CN104978341A
CN104978341A CN201410138848.0A CN201410138848A CN104978341A CN 104978341 A CN104978341 A CN 104978341A CN 201410138848 A CN201410138848 A CN 201410138848A CN 104978341 A CN104978341 A CN 104978341A
Authority
CN
China
Prior art keywords
file
content
css
document handling
designator
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201410138848.0A
Other languages
Chinese (zh)
Inventor
黄欢
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Qihoo Technology Co Ltd
Qizhi Software Beijing Co Ltd
Original Assignee
Beijing Qihoo Technology Co Ltd
Qizhi Software Beijing Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Qihoo Technology Co Ltd, Qizhi Software Beijing Co Ltd filed Critical Beijing Qihoo Technology Co Ltd
Priority to CN201410138848.0A priority Critical patent/CN104978341A/en
Priority to PCT/CN2015/076112 priority patent/WO2015154680A1/en
Publication of CN104978341A publication Critical patent/CN104978341A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor

Abstract

The invention discloses a file processing method equipment, which comprises a content reader, an identifier finder, an identifier processor and an output device, wherein the content reader is suitable for reading the file content of a file to be processed; the identifier finder is suitable for looking up a scheduled identifier from the read file content; the identifier reader is suitable for looking up the scheduled identifier according to the identifier and processing the read file content; and the output device is suitable for outputting the processed file content. The invention also discloses a corresponding file processing method and the network system which comprises the file processing equipment.

Description

A kind of document handling method, equipment and network system
Technical field
The present invention relates to internet arena, the file particularly related to being published to server carries out pretreated document handling method, document handling apparatus and network system.
Background technology
Along with the development of internet, increasing user carrys out obtaining information by internet, and the information of internet is generally stored on the server that is linked in network.User can access these servers and carry out obtaining information.Along with the development of network technology, the information that user obtains is also more and more abundanter.Present Webpage and the interactivity of user and can readabilityly also come might as well.Such as, in webpage, the webpage associated documents such as a large amount of CSS files can be adopted to define the display mode of the page and the interactive mode etc. with user.
Along with webpage is day by day complicated, the content of CSS file also gets more and more.For the website of high access, user is from after server obtains CSS file, this file can be resolved in the networking client application of such as browser and so on, if this file is not optimised, client can be caused on the one hand to resolve the efficiency of this class file lower, also may cause repeatedly from server request file because CSS file refer to other file on the server on the other hand, cause document analysis speed significantly to reduce.
Therefore, need a kind of can by the webpage associated documents of such as CSS file and so on upload onto the server to be downloaded by user before carry out pre-service optimization so that user can download this class file from server efficiently and carry out the new departure of resolving.
Summary of the invention
For this reason, the invention provides a kind of new scheme to try hard to solve or at least alleviate Problems existing above.
According to an aspect of the present invention, a kind of document handling apparatus is provided.This document handling apparatus comprises content reader, is suitable for reading the file content that will process file; Identifier lookup device, is suitable in read file content, search predetermined mark; Identification processor, is suitable for the predetermined mark found according to identifier lookup device, processes read file content; And output unit, be suitable for exporting treated file content.
Alternatively, in document handling apparatus according to the present invention, predetermined mark comprises various process designator, and identification processor comprises encode processor, be suitable for when predefined identifier is coded treatment designator, according to the instruction be associated with coded treatment designator, the non-English character in file content is replaced with the numeric string after unicode coding.
Alternatively, in document handling apparatus according to the present invention, identification processor also comprises file and imports processor, be suitable for when making a reservation for be designated file importing process designator, file reading imports the file content of process indicated by designator, and this content is added in the file content of content reader reading.
Alternatively, in document handling apparatus according to the present invention, identification processor comprises default processor, is suitable for, when making a reservation for be designated default process designator, deleting the content be associated with default process designator in read file content.
Alternatively, also comprise content compression device according to document handling apparatus of the present invention, be suitable for before output unit output file content, compression process is carried out to file content.
Alternatively, in document handling apparatus according to the present invention, file to be processed is cascade pattern (CSS) file, and described process designator is the CSS instruction started with "@".
According to another aspect of the present invention, provide a kind of document handling method, the method is suitable for performing in document handling apparatus according to the present invention, and comprises step: read the file content that will process file; Predetermined mark is searched in read file content; According to the predetermined mark found, read file content is processed; And export treated file content.
Also have an aspect according to of the present invention, provide a kind of network system, this system comprises server, multiple client of carrying out communicating via network and server; And comprise and provide equipment according to the file of document handling apparatus of the present invention, its file provides devices, coupled to arrive server, file transfer after document handling apparatus process is stored wherein to server, to go forward side by side row relax to download this file by any one in multiple client.
According to document handling modus of the present invention, can the webpage associated documents of such as CSS file and so on are uploaded server for user from client downloads and resolve before, carry out various pre-service optimization, such as can process multi-language text in advance, thus avoid client and need to resolve off-gauge language content; Can in advance by other text merge of quoting in webpage related text, thus avoid the repeatedly server request of client; The useless text in webpage related text can be deleted in advance, thus decrease the size of webpage related text, save flow etc.
Accompanying drawing explanation
In order to realize above-mentioned and relevant object; combine description below and accompanying drawing herein to describe some illustrative aspect; these aspects indicate the various modes can putting into practice principle disclosed herein, and all aspects and equivalent aspect thereof are intended to fall in the scope of theme required for protection.Read detailed description below in conjunction with the drawings, above-mentioned and other object of the present disclosure, Characteristics and advantages will become more obvious.Throughout the disclosure, identical Reference numeral is often referred to for identical parts or element.
Fig. 1 shows the schematic diagram of network system according to an embodiment of the invention;
Fig. 2 shows the schematic diagram of document handling apparatus according to an embodiment of the invention; And
Fig. 3 shows the process flow diagram of document handling method according to an embodiment of the invention.
Embodiment
Below with reference to accompanying drawings exemplary embodiment of the present disclosure is described in more detail.Although show exemplary embodiment of the present disclosure in accompanying drawing, however should be appreciated that can realize the disclosure in a variety of manners and not should limit by the embodiment set forth here.On the contrary, provide these embodiments to be in order to more thoroughly the disclosure can be understood, and complete for the scope of the present disclosure can be conveyed to those skilled in the art.
Fig. 1 shows the schematic diagram of network system 100 according to an embodiment of the invention.As shown in Figure 1, network system 100 comprises multiple client 110,120 and 130, provides the server 140 of the network information (such as webpage) for these clients.These equipment intercom mutually via internet.
All clients are indicated hereinafter by for client 110.In the present invention, can to be that this area is any can present the network information (the such as webpage and file (the such as CSS file that is associated that obtain from server 140 to client 110-130, JS file and multimedia file etc.)) equipment, include but not limited to desktop computer, notebook computer, personal digital assistant, intelligent mobile terminal, panel computer etc.Usually run modern operating system in client 110-130, utilize the hardware resource that operating system is come in administrative client.Client 110 is run usually and has web browser, for presenting the webpage obtained from server 140 and the data (image quoted such as webpage, Voice & Video etc.) be associated.
Server 140 is answered the request of client 110 and the asked network information is returned client 110.The various files that the network information is generally Webpage and quotes in the page, such as image file, the multimedia file such as audio frequency and/or video file, and the webpage associated documents of such as CSS file and JS file and so on.In practice, when client 110 is from after server 140 requests web page files, client 110 is when resolving and present the information in this web page files, the file that can quote from server 140 downloading web pages file subsequently, if these files also refer to further other file, then client 110 can continue to ask to download these files from server 140.
In addition, in network system 100, also comprising file provides equipment 150.File provide equipment 150 to upload onto the server 140 various network associated documents anticipate.Such as provide in equipment 150 at file, can edit webpage and associated documents thereof, on device 150 preview webpage present effect etc., file transfer after being provided by file equipment 150 to process subsequently also stores wherein, to download this file by client from server 140 to server 140.
File provides in equipment 150 and also comprises document handling apparatus 200.Document handling apparatus 200 can provide the good webpage associated documents of equipment 150 inediting (such as, CSS file etc.) to carry out pre-service to file.Specifically, document handling apparatus 200 can carry out pre-service to these files before uploading in server 140 by webpage associated documents, thus the content of these files is optimized, and the webpage associated documents through optimizing are transferred in server 140, such client 110 is when to server 140 requested webpage, webpage and the associated documents through optimizing can be obtained from server 140, and present the content of these webpages in client.
Fig. 2 shows the schematic diagram of document handling apparatus 200 according to an embodiment of the invention.As shown in Figure 2, document handling apparatus 200 comprises content reader 210, identifier lookup device 220, identification processor 230 and output unit 240.
Generally speaking, document handling apparatus 200 can provide the user of equipment 150 to start by the resident file wherein of document handling apparatus 200 and process this user ready various webpage associated documents.Such as file provides the user of equipment 150 to edit webpage, and prepares these webpages and the content that is associated to release to be obtained from server 140 by client 110 and to be presented on the browser of client 110.Subsequently, file provides the user of equipment 150 will be optimized pre-service to various webpage associated documents by startup file treatment facility 200.User usually can specify and will carry out pretreated file directory.
Webpage associated documents can be that such as CSS file, meeting be called by webpage and be helped in client to present web page contents and/or the mutual associated documents between raising and the user of client.These files meet HTML standard, and are widely used in HTML.In these webpage associated documents, also can quote other file outside these files.
Such as, when webpage associated documents are CSS file, hereinafter show a fragment of CSS file:
@charset"utf-8";
/ * assembly css*/
@import url('top_nav.header.footer.css');
@import url('dialog.css');
@import url (' upper_my_info.css'); My signature speech region * of/*/
@import url('myspace_menu.css');
@import url('upper_u_info.css');
@import url('space_menu.css');
@import url('app_menu.css');
@import url (' clm.css'); / * achieve badge/ play recently/hot issue/interested people/I good friend/visiting * recently/
@import url (' wb_content.css'); / * content of microblog */
@import url (' manage_subnav.css'); / * navigation */
@media screen{BODY{font-size:12pt;}}
// printer font size is set
@media print{
@import"print.css"
BODY{font-size:8pt;}
}
.game{margin:10px25px025px;border:solid1px#c9edc3;border-radius:5px;}
.game.show_gamebox{width:960px;margin:0auto;clear:both;padding:10px015px;overflow:hidden;border-bottom:solid0#D8D8D8;}
.game.show_game{border:2px solid black;float:left;height:520px;width:700px;margin-left:4px;}
.game.show_list{float:left;height:520px;margin-left:3px;width:230px;background-color:#e7ffe6;border:1px solid#dbf2da;border-radius:5px;}
.game.show_list ul{width:220px;height:470px;overflow:hidden;margin:30px5px0;}
.game.blues{color:#125A9C;}
.game.red{color:red;}
In the above example, can find out, at CSS file, exist with the various process designators of@beginning, the character set that such as@charset specifies the style sheet in CSS file to use, @import specifies the position of other style sheet be introduced in this CSS style sheet, and@media specifies medium type etc.
Document handling apparatus 200 one by one processes the file under user's assigned catalogue, and such as document handling apparatus 200 can find all webpage associated documents under this catalogue, and processes one by one.Such as, document handling apparatus 200 can search the file that All Files suffix under this catalogue is called css, and processes one by one.
Content reader 210 can read the file content of webpage associated documents.Such as, the mass storage of the file content of webpage associated documents from such as hard disk and so on can read buffer memory, so that subsequent treatment by content reader 210.In one embodiment, content reader 210 even can such as utilize HTML engine to this file content analysis, thus obtains the institutional framework of content in this file.Such as with regard to CSS, the label (tag) of each CSS and corresponding value (value) can be obtained, such as, can obtain each with the designator of beginning and the content after it.Certainly, according to another embodiment of the invention, directly content can be read out row cache of going forward side by side.
Identifier lookup device 220 is analyzed read file content subsequently, with the various predetermined mark found out in this file content and identify corresponding content with these.According to one embodiment of present invention, webpage associated documents are CSS file, and the predetermined CSS instruction being designated "@" and starting, each CSS instruction searched by identifier lookup device 220 in read file content, and searches command content corresponding to each CSS instruction.According to still another embodiment of the invention, when content reader 210 obtains the institutional framework of file, then identifier lookup device 220 can carry out traveling through to search corresponding CSS instruction and corresponding content in this institutional framework.It should be noted that the present invention is not limited to the specific implementation of identifier lookup device 220, all modes can searching predetermined mark and corresponding content from read file content are all within protection scope of the present invention.
Such as, in the above example, identifier lookup device 220 can find multiple CSS instruction, such as@charset ,@import and@media etc., and finds corresponding command content.Such as, the command content corresponding to@charset instruction is " utf8 ", and the command content corresponding to@import is the CSS file of each application, and the content corresponding to@media is arranging of printer.
After identifier lookup device 220 finds each CSS instruction, identification processor 230 is according to each CSS instruction, based on the command content corresponding with this CSS instruction, CSS file is processed, thus make this CSS content follow-up can more efficiently be obtained by client 110 and client resolve and process.Identification processor 230 according to the difference of CSS instruction, and can adopt different processing modes.
According to one embodiment of present invention, identification processor 230 comprises encode processor 232.When identification processor CSS instruction to be processed is coded treatment instruction character (i.e.@charset), identification processor 230 adopts encode processor 232 to process this CSS instruction.The command content of@charset instruction determined by encode processor 232, then according to all non-English character in the coded system file reading content of@charset command content definition, and these non-English characters are converted to unicode(Unicode) content after coded system coding.In addition alternatively, the command content of@charset instruction can be revised as " unicode " by encode processor 232.Because the default meeting of browser to process the content of CSS file in the mode of unicode, therefore encode processor 232 even can delete this instruction, thus reduces the size of CSS file.
Such as, when the non-English character in CSS content is " Chinese character ", encode processor 232 can be converted into " 0x6c49 " and " 0x5b57 ".Like this, when client 110 obtains this CSS file and resolves its content, again carry out code conversion with regard to not needing, and directly read the content of unicode, thus save the time of client process CSS.
Also have an embodiment according to of the present invention, identification processor 230 comprises file and imports processor 234.When identification processor CSS instruction to be processed is file importing processing instruction symbol (i.e.@import), identification processor 230 adopts file to import processor 234 to process this CSS instruction.File imports the command content that@import instruction determined by processor 234, the CSS document location of then specifying according to this command content provides from file the file content reading this CSS file equipment 150, and is inserted in the content that content reader 210 reads.Because each CSS can define process and the presentation mode of different element usually; generally there is not the problem of tandem in the content of each CSS; therefore; file importing processor 234 can read the file content indicated by@import instruction, and is inserted into ending place of content reader 210 reading of content.Subsequently, file imports processor 234 and can delete corresponding@import instruction.
Such as, in the above example, there is a file and import instruction:
@import url('top_nav.header.footer.css');
Identifier lookup device 220 can identify this file importing instruction and indicate the file that will import to be top_nav.header.footer.css, file imports the content that processor 230 can read this file top_nav.header.footer.css subsequently, and read content is added to the end place of the content that content reader 210 reads.
Derived the process of processor 234 by file, further other CSS Piece file mergences CSS file can quoted are in a CSS file.After client 110 obtains this CSS file, do not need resolve@import instruction and go server 140 to obtain further other CSS files again, thus the file download request number of times decreased between client 110 and server 140, accelerate the speed that client 110 resolves CSS file.
According to one embodiment of present invention, identification processor 230 also comprises default processor 236.When identification processor CSS instruction to be processed is other with the instruction character of@beginning, identification processor 230 adopts default processor 236 to process these instructions.Due to when resolving CSS, other does not act on for presenting of webpage with the instruction of@beginning, in order to improve the processing speed to CSS file at client 110 place, by default processor 236, these@instruction characters and corresponding command content can be deleted, thus make client after acquisition CSS file, the@instruction character that these are irrelevant need not be resolved, and improve treatment effeciency.
Such as, in the above example, there is@media instruction and related content thereof:
@media screen{BODY{font-size:12pt;}}
// printer font size is set
@media print{
@import"print.css"
BODY{font-size:8pt;}
}
Because@media instruction can't affect to some extent on presenting web page contents in client, therefore, identification processor 230 can call the command content that default processor 236 is deleted this instruction in the content that content reader 210 reads and is associated.Like this, when client 110 resolves treated CSS file, in this file, there is not the default process instruction of such as media and so on, thus need not carry out extra process to these instructions yet, thus decrease the time that client 110 resolves CSS file.
After identification processor 230 completes process, output unit 240 can read the content of treated webpage associated documents, and outputs to the outside of document handling apparatus 200.Such as, treated webpage associated documents can directly be transferred in server 140 by output unit 240, also this locality of equipment 150 can be provided to store at file, and after waiting for that all webpage associated documents process complete, unification provides equipment 150 be transferred to server 140 and store by file.
Alternatively, document handling apparatus 200 also comprises content compression device 250.Content compression device 250 before output unit 240 exports webpage associated documents, can carry out compression process to treated webpage associated documents.Browser due to present client 110 place all supports that the file receiving compression is gone forward side by side row relax, so, webpage associated documents can be carried out compressing and be stored in server 140, the file that client 110 can directly obtain through overcompression processes, and can save network bandwidth consumption between clients and servers so further.
In addition, content compression device 250 can also carry out some optimization processes, such as, can delete the idle characters such as carriage return unnecessary in webpage associated documents, space, make the size of webpage associated documents less.When the enormous amount of client 110, a few modifications of webpage associated documents size just can save a large amount of network bandwidths.
Utilize according to document handling apparatus 200 of the present invention, the webpage associated documents of such as CSS file and so on can uploaded before server 140 to be downloaded from client 110 for user and resolved, carry out various pre-service optimization, such as can process multi-language text in advance, thus avoid client 110 and need to resolve off-gauge language content; Can in advance by other text merge of quoting in webpage related text, thus avoid the repeatedly server request of client; The useless text in webpage related text can be deleted in advance, thus decrease the size of webpage related text, save flow etc.Document handling apparatus 200 by carrying out optimization process in advance to the webpage associated documents of such as CSS file instruction, thus makes client 110 more efficiently to resolve and processes various webpage associated documents.
Fig. 3 shows the process flow diagram of document handling method 300 according to an embodiment of the invention.The method file be suitable for shown in Fig. 1 provides equipment 150, performs, to be published in server 140 by webpage associated documents in the computing equipment 200 especially shown in Fig. 2.
Generally speaking, method 300 can provide the user of equipment 150 to call by file and process this user ready various webpage associated documents.Such as file provides the user of equipment 150 to write webpage associated documents, and prepares these file distribution to go out to be obtained from server 140 by client 110 and to be presented on the browser of client 110.Subsequently, file provides the user of equipment 150 will carry out pre-service optimization to various webpage associated documents by call method 300.User usually can specify and will carry out pretreated file directory.
Webpage associated documents can be that such as CSS file and so on, meeting be called by webpage and be helped in client to present web page contents and/or the mutual page associated documents between raising and the user of client.These files meet HTML standard, and are widely used in HTML.In these webpage associated documents, also can quote other webpage associated documents.
Give the concrete example of webpage associated documents as CSS in the description of composition graphs 2 above, repeated no more here.The method is suitable for one by one processing the file under user's assigned catalogue.Such as, the method can search the file that All Files suffix under this catalogue is called css, and processes one by one.
The method starts from step S310.In step S310, read the file content of webpage associated documents.Such as, in step S310, the mass storage of the file content of webpage associated documents from such as hard disk and so on can be read buffer memory, so that subsequent treatment.In one embodiment, HTML engine even such as can be utilized to analyze this file content, thus obtain the institutional framework of content in this file.Such as with regard to CSS, the label (tag) of each CSS and corresponding value (value) can be obtained.Certainly, according to another embodiment of the invention, directly content can be read out row cache of going forward side by side.
Subsequently, in step s 320, the file content read in step S310 is analyzed, with the various predetermined mark found out in this file content and identify corresponding content with these.According to one embodiment of present invention, webpage associated documents are CSS file, and the predetermined CSS instruction being designated "@" and starting, in step s 320, in read file content, search each CSS instruction, and search command content corresponding to each CSS instruction.According to still another embodiment of the invention, when obtaining the institutional framework of file in step S310, then in step s 320, can carry out traveling through to search corresponding CSS instruction and corresponding content in this institutional framework.It should be noted that the present invention is not limited to the specific implementation of step S320, all modes can searching predetermined mark and corresponding content from read file content are all within protection scope of the present invention.
Such as, in the above example, in step s 320, multiple CSS instruction can be found, such as@charset ,@import and@media etc., and find corresponding command content.Such as, the command content corresponding to@charset instruction is " utf8 ", and the command content corresponding to@import is the CSS file of each application, and the content corresponding to@media is arranging of printer.
After finding each predetermined mark in step s 320, in step S330, according to each CSS instruction, based on the command content corresponding with this CSS instruction, CSS file is processed, thus make this CSS content follow-up can more efficiently be obtained by client 110 and client resolve and process.Step S330 according to the difference of CSS instruction, and can adopt different processing modes.
According to one embodiment of present invention, when CSS instruction to be processed is coded treatment instruction character (i.e.@charset), in step S330, determine the command content of@charset instruction, then according to all non-English character in the coded system file reading content of@charset command content definition, and these non-English characters are converted to unicode(Unicode) content after coded system coding.In addition alternatively, in step S330, the command content of@charset instruction can be revised as " unicode ".Because the default meeting of browser to process the content of CSS file in the mode of unicode, therefore step S330 even can delete this instruction, thus reduces the size of CSS file.
Such as, when the non-English character in CSS content is " Chinese character ", the process of step S330 can be converted into " 0x6c49 " and " 0x5b57 ".Like this, when client 110 obtains this CSS file and resolves its content, again carry out code conversion with regard to not needing, and directly read the content of unicode, thus save the time of client process CSS.
An embodiment is also had according to of the present invention, when CSS instruction to be processed in step S330 is file importing processing instruction symbol (i.e.@import), determine the command content of@import instruction, the CSS document location of then specifying according to this command content provides from file the file content reading this CSS file equipment 150, and is inserted in the content that step S310 reads.Because each CSS can define process and the presentation mode of different element usually; generally there is not the problem of tandem in the content of each CSS, therefore, in step S330; the file content indicated by@import instruction can be read, and be inserted into ending place of step S310 institute reading of content.Subsequently, in step S330, corresponding@import instruction can be deleted.
Such as, in the above example, there is a file and import instruction:
@import url('top_nav.header.footer.css');
Step S330 can identify this file importing instruction and indicate the file that will import to be top_nav.header.footer.css, the content of this file top_nav.header.footer.css can be read subsequently, and add read content the end place of the content that step S310 reads to.
By above-mentioned process, further other CSS Piece file mergences CSS file can quoted are in a CSS file.After client 110 obtains this CSS file, do not need resolve@import instruction and go server 140 to obtain further other CSS files again, thus the file download request number of times decreased between client 110 and server 140, accelerate the speed that client 110 resolves CSS file.
An embodiment is also had according to of the present invention, when step S330 CSS instruction to be processed is other with the instruction character of@beginning, due to when resolving CSS, other does not act on for presenting of webpage with the instruction of@beginning, in order to improve the processing speed to CSS file at client 110 place, can in step S330, these@instruction characters and corresponding command content are deleted, thus make client after acquisition CSS file, the@instruction character that these are irrelevant need not be resolved, and improve treatment effeciency.
Such as, in the above example, there is@media instruction and related content thereof:
@media screen{BODY{font-size:12pt;}}
// printer font size is set
@media print{
@import"print.css"
BODY{font-size:8pt;}
}
Because@media instruction can't affect to some extent on presenting web page contents in client, therefore, in step S330, this instruction in the content that delete step S310 reads and the command content be associated.Like this, when client 110 resolves treated CSS file, in this file, there is not the default process instruction of such as media and so on, thus need not carry out extra process to these instructions yet, thus decrease the time that client 110 resolves CSS file.
Complete process in step S330 after, the content of treated webpage associated documents can be read in step S340, and output to outside.Such as, can directly treated webpage associated documents be transferred in server 140, also this locality of equipment 150 can be provided to store at file, and after waiting for that all webpage associated documents process complete, unification provide equipment 150 be transferred to server 140 and store by file.
Alternatively, document handling method 300 can also comprise step S350.Step S350 before step S340 exports webpage associated documents, can carry out compression process to treated webpage associated documents.Browser due to present client 110 place all supports that the file receiving compression is gone forward side by side row relax, so, webpage associated documents can be carried out compressing and be stored in server 140, the file that client 110 can directly obtain through overcompression processes, and can save network bandwidth consumption between clients and servers so further.
In addition, can also some optimization processes be carried out in step S350, such as, can delete the idle characters such as carriage return unnecessary in webpage associated documents, space, make the size of webpage associated documents less.When the enormous amount of client 110, a few modifications of webpage associated documents size just can save a large amount of network bandwidths.
Utilize according to document handling method 300 of the present invention, the webpage associated documents of such as CSS file and so on can uploaded before server 140 to be downloaded from client 110 for user and resolved, carry out various pre-service optimization, such as can process multi-language text in advance, thus avoid client 110 and need to resolve off-gauge language content; Can in advance by other text merge of quoting in webpage related text, thus avoid the repeatedly server request of client; The useless text in webpage related text can be deleted in advance, thus decrease the size of webpage related text, save flow etc.Document handling apparatus 200 by carrying out optimization process in advance to the webpage associated documents of such as CSS file instruction, thus makes client 110 more efficiently to resolve and processes various webpage associated documents.
In instructions provided herein, describe a large amount of detail.But can understand, embodiments of the invention can be put into practice when not having these details.In some instances, be not shown specifically known method, structure and technology, so that not fuzzy understanding of this description.
Similarly, be to be understood that, in order to simplify the disclosure and to help to understand in each inventive aspect one or more, in the description above to exemplary embodiment of the present invention, each feature of the present invention is grouped together in single embodiment, figure or the description to it sometimes.But, the method for the disclosure should be construed to the following intention of reflection: namely the present invention for required protection requires than the feature more multiple features clearly recorded in each claim.Or rather, as claims below reflect, all features of disclosed single embodiment before inventive aspect is to be less than.Therefore, the claims following embodiment are incorporated to this embodiment thus clearly, and wherein each claim itself is as independent embodiment of the present invention.
Those skilled in the art are to be understood that the module of the equipment in example disclosed herein or unit or assembly can be arranged in equipment as depicted in this embodiment, or alternatively can be positioned in one or more equipment different from the equipment in this example.Module in aforementioned exemplary can be combined as a module or can be divided into multiple submodule in addition.
Those skilled in the art are appreciated that and adaptively can change the module in the equipment in embodiment and they are arranged in one or more equipment different from this embodiment.Module in embodiment or unit or assembly can be combined into a module or unit or assembly, and multiple submodule or subelement or sub-component can be put them in addition.Except at least some in such feature and/or process or unit be mutually repel except, any combination can be adopted to combine all processes of all features disclosed in this instructions (comprising adjoint claim, summary and accompanying drawing) and so disclosed any method or equipment or unit.Unless expressly stated otherwise, each feature disclosed in this instructions (comprising adjoint claim, summary and accompanying drawing) can by providing identical, alternative features that is equivalent or similar object replaces.
In addition, those skilled in the art can understand, although embodiments more described herein to comprise in other embodiment some included feature instead of further feature, the combination of the feature of different embodiment means and to be within scope of the present invention and to form different embodiments.Such as, in the following claims, the one of any of embodiment required for protection can use with arbitrary array mode.
In addition, some in described embodiment are described as at this can by the processor of computer system or the method implemented by other device performing described function or the combination of method element.Therefore, there is the device of processor formation for implementing the method or method element of the necessary instruction for implementing described method or method element.In addition, the element described herein of device embodiment is the example as lower device: this device is for implementing the function performed by the element of the object in order to implement this invention.
As used in this, unless specifically stated so, use ordinal number " first ", " second ", " the 3rd " etc. to describe plain objects and only represent the different instances relating to similar object, and be not intended to imply the object be described like this must have the time upper, spatially, sequence aspect or in any other manner to definite sequence.
Although the embodiment according to limited quantity describes the present invention, benefit from description above, those skilled in the art understand, in the scope of the present invention described thus, it is contemplated that other embodiment.In addition, it should be noted that the language used in this instructions is mainly in order to object that is readable and instruction is selected, instead of select to explain or limiting theme of the present invention.Therefore, when not departing from the scope and spirit of appended claims, many modifications and changes are all apparent for those skilled in the art.For scope of the present invention, be illustrative to disclosing of doing of the present invention, and nonrestrictive, and scope of the present invention is defined by the appended claims.

Claims (10)

1. a document handling apparatus, comprising:
Content reader, is suitable for reading the file content that will process file;
Identifier lookup device, is suitable in read file content, search predetermined mark;
Identification processor, is suitable for the predetermined mark found according to described identifier lookup device, processes read file content; And
Output unit, is suitable for exporting treated file content.
2. document handling apparatus as claimed in claim 1, wherein said predetermined mark comprises various process designator, and described identification processor comprises encode processor, be suitable for when described predefined identifier is coded treatment designator, according to the instruction be associated with described coded treatment designator, the non-English character in described file content is replaced with the content after unicode coding.
3. document handling apparatus as claimed in claim 2, wherein said identification processor also comprises file and imports processor, be suitable for when described predetermined be designated file import process designator time, read and import with described file the file content processed indicated by designator, this content is added in the file content of described content reader reading.
4. the document handling apparatus as described in claim 2 or 3, wherein said identification processor comprises default processor, be suitable for when described predetermined be designated default process designator time, in read file content, delete the content be associated with described default process designator.
5. as the document handling apparatus in claim 1-4 as described in any one, also comprise content compression device, be suitable for before described output unit output file content, compression process is carried out to file content.
6. a document handling method, comprises step:
Reading will process the file content of file;
Predetermined mark is searched in read file content;
According to the predetermined mark found, read file content is processed; And
Export treated file content.
7. document handling method as claimed in claim 6, wherein said predetermined mark comprises various process designator, and describedly comprises the step that read file content processes:
When described predefined identifier is coded treatment designator, according to the instruction be associated with described coded treatment designator, the non-English character in described file content is replaced with the numeric string after unicode coding.
8. document handling method as claimed in claim 7, wherein saidly also comprises the step that read file content processes:
When described predetermined be designated file import process designator time, read and import with described file the file content processed indicated by designator, this content is added in the file content of described content reader reading.
9. the document handling method as described in claim 7 or 8, wherein saidly also comprises the step that read file content processes:
When described predetermined be designated default process designator time, in read file content, delete the content be associated with described default process designator.
10. a network system, comprises
Server;
Multiple client, communicates with server via network; And
File provides equipment, comprises as the document handling apparatus in claim 1-5 as described in any one,
Wherein said file provides devices, coupled to arrive server, the file transfer after described document handling apparatus process is stored wherein to server, to go forward side by side row relax to download this file by any one in multiple client.
CN201410138848.0A 2014-04-08 2014-04-08 File processing method and equipment, and network system Pending CN104978341A (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN201410138848.0A CN104978341A (en) 2014-04-08 2014-04-08 File processing method and equipment, and network system
PCT/CN2015/076112 WO2015154680A1 (en) 2014-04-08 2015-04-08 File processing method, device, and network system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201410138848.0A CN104978341A (en) 2014-04-08 2014-04-08 File processing method and equipment, and network system

Publications (1)

Publication Number Publication Date
CN104978341A true CN104978341A (en) 2015-10-14

Family

ID=54274856

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201410138848.0A Pending CN104978341A (en) 2014-04-08 2014-04-08 File processing method and equipment, and network system

Country Status (2)

Country Link
CN (1) CN104978341A (en)
WO (1) WO2015154680A1 (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107818108A (en) * 2016-09-13 2018-03-20 阿里巴巴集团控股有限公司 A kind of webpage rendering intent, apparatus and system
CN108153786A (en) * 2016-12-02 2018-06-12 航天星图科技(北京)有限公司 A kind of document handling method
CN110688118A (en) * 2019-10-28 2020-01-14 郑州阿帕斯科技有限公司 Webpage optimization method and device

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101127038A (en) * 2006-08-18 2008-02-20 鸿富锦精密工业(深圳)有限公司 System and method for downloading website static web page
CN102521266A (en) * 2011-11-18 2012-06-27 深圳中兴网信科技有限公司 Method for realizing dynamic compression and combination of JavaScript file
CN102622376A (en) * 2011-01-28 2012-08-01 北京千橡网景科技发展有限公司 Method and equipment used for merging cascading style sheet files
CN103220352A (en) * 2013-04-15 2013-07-24 福建伊时代信息科技股份有限公司 Terminal, server, file storage system and file storage method

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090150168A1 (en) * 2007-12-07 2009-06-11 Sap Ag Litigation document management
CN101237658B (en) * 2007-12-18 2012-06-27 中国移动通信集团设计院有限公司 Data storage method and device in mobile communication system
CN101706761A (en) * 2009-09-08 2010-05-12 厦门敏讯信息技术股份有限公司 Method and device for processing data of serial port parameters of wireless terminal
CN102006174B (en) * 2010-11-08 2015-01-28 中兴通讯股份有限公司 Data processing method and device based on online behavior of mobile phone user
CN102393849A (en) * 2011-07-18 2012-03-28 电子科技大学 Web log data preprocessing method

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101127038A (en) * 2006-08-18 2008-02-20 鸿富锦精密工业(深圳)有限公司 System and method for downloading website static web page
CN102622376A (en) * 2011-01-28 2012-08-01 北京千橡网景科技发展有限公司 Method and equipment used for merging cascading style sheet files
CN102521266A (en) * 2011-11-18 2012-06-27 深圳中兴网信科技有限公司 Method for realizing dynamic compression and combination of JavaScript file
CN103220352A (en) * 2013-04-15 2013-07-24 福建伊时代信息科技股份有限公司 Terminal, server, file storage system and file storage method

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107818108A (en) * 2016-09-13 2018-03-20 阿里巴巴集团控股有限公司 A kind of webpage rendering intent, apparatus and system
CN108153786A (en) * 2016-12-02 2018-06-12 航天星图科技(北京)有限公司 A kind of document handling method
CN110688118A (en) * 2019-10-28 2020-01-14 郑州阿帕斯科技有限公司 Webpage optimization method and device

Also Published As

Publication number Publication date
WO2015154680A1 (en) 2015-10-15

Similar Documents

Publication Publication Date Title
US20220147696A1 (en) Enhanced links in curation and collaboration applications
US10795939B2 (en) Query method and apparatus
US20140331124A1 (en) Method for maintaining common data across multiple platforms
US20130326333A1 (en) Mobile Content Management System
US11005851B2 (en) Retrieving digital content over a network
US20150135061A1 (en) Systems and methods for parallel traversal of document object model tree
CN105988996B (en) Index file generation method and device
CN110688595B (en) File online preview method, device, equipment and storage medium
KR20130142121A (en) Multi-modal approach to search query input
US20110191328A1 (en) System and method for extracting representative media content from an online document
CN102184177A (en) Method and system for previewing file
US9633073B1 (en) Distributed data store for hierarchical data
US10289747B2 (en) Dynamic file concatenation
CN102651021A (en) Icon content updating method and device
CN104462590A (en) Information searching method and device
EP3242221A1 (en) Information searching method and apparatus
CN104978341A (en) File processing method and equipment, and network system
US9667696B2 (en) Low latency web-based DICOM viewer system
CN103473264A (en) Note taking and rapid reproducing method applied to literature reading of farmer bookstore
JP6932360B2 (en) Object search method, device and server
CN106899755B (en) Information sharing method, information sharing device and terminal
CN109308336A (en) Data processing method, processing equipment and storage medium
US11281740B2 (en) Information processing device, information processing method, program and storage medium
WO2018208412A1 (en) Detection of caption elements in documents
CN104978339A (en) File processing method and equipment, and network system

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20151014

RJ01 Rejection of invention patent application after publication