CN104077323A - Method and device for converting web page content to multimedia messages - Google Patents

Method and device for converting web page content to multimedia messages Download PDF

Info

Publication number
CN104077323A
CN104077323A CN201310108973.2A CN201310108973A CN104077323A CN 104077323 A CN104077323 A CN 104077323A CN 201310108973 A CN201310108973 A CN 201310108973A CN 104077323 A CN104077323 A CN 104077323A
Authority
CN
China
Prior art keywords
web page
page contents
multimedia message
picture
multimedia
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201310108973.2A
Other languages
Chinese (zh)
Inventor
程宝平
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
China Mobile Communications Group Co Ltd
Original Assignee
China Mobile Communications Group Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by China Mobile Communications Group Co Ltd filed Critical China Mobile Communications Group Co Ltd
Priority to CN201310108973.2A priority Critical patent/CN104077323A/en
Publication of CN104077323A publication Critical patent/CN104077323A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/958Organisation or management of web site content, e.g. publishing, maintaining pages or automatic linking
    • G06F16/986Document structures and storage, e.g. HTML extensions
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04WWIRELESS COMMUNICATION NETWORKS
    • H04W4/00Services specially adapted for wireless communication networks; Facilities therefor
    • H04W4/12Messaging; Mailboxes; Announcements

Abstract

The invention discloses a method and device for converting the web page content to multimedia messages. The device comprises an analyzing and extracting module, a content filling module and an insertion generating module. The analyzing and extracting module is used for performing semantic analysis on the web page content and extracting the web page content according to the result of semantic analysis. The content filling module is used for processing the multi-media content in the web page content and filling the web page content into the corresponding format of a multimedia message template. The insertion generating module is used for inserting multimedia message frames into the web page content filled into the corresponding format of the multimedia message template to generate the multimedia messages. According to the method and device for converting the web page content to the multimedia messages, by means of source code semantic analysis, extraction, image compression and layout customization are performed on the browsed web page content, after the multimedia messages are generated according to the multimedia message format protocol, the web page content is sent to mobile phones of friends by sending the multimedia messages, and the web page content sharing with friends is completed.

Description

A kind of method and apparatus of web page contents conversion multimedia message
Technical field
The present invention relates to Internet technical field in the communications field, particularly, the method relating to and device.
Background technology
Internet has become one of main source of people's obtaining information, and the information spinner of magnanimity will present by form web page.
At present, web page contents is shared mode and is mainly contained two large classes: 1) log in sharing of account based on business: by various accounts such as microblogging, immediate information softwares (Fetion, MSN), the cyberspace (website) that content (or synopsis, network linking) is published to oneself is shared with good friend, and good friend can check the content of sharing by access related web page address.2) by short message mode sharing contents: by web page contents title, content short summary or web page interlinkage, by short message mode, issue good friend, good friend can check web page contents by clickthrough.
Realizing in process of the present invention, inventor finds that in prior art, at least there are the following problems:
For first kind scheme, by various accounts such as microblogging, immediate information softwares (Fetion, MSN), the cyberspace (website) that content (or synopsis, network linking) is published to oneself is shared with good friend, and good friend can check the content of sharing by access related web page address.Need to start related software or log in specific website and just can view the content of sharing.
For Equations of The Second Kind scheme, by short message mode sharing contents: by web page contents title, content short summary or web page interlinkage, by short message mode, issue good friend, good friend can check web page contents by clickthrough.Due to note number of words (140 characters, 70 Chinese characters) restriction and can only carry text message, and can not carry picture, therefore, generally can only share title, content makes brief of the introduction and web page interlinkage, and the web page contents that cannot carry.
Multimedia message ability is the basic ability of mobile terminal, energy bearing multimedia content (text, picture, audio frequency etc.), receive the features such as free, can be by multimedia message mode sharing web page content if made, user can receive and check webpage content in full whenever and wherever possible, bring more easily and will experience to user.
If the content of original text is arrived to user's cell-phone customer terminal by the mode of multimedia message, can make up above deficiency, the quantity of information of multimedia message carrying is many and comprehensive after all, also can allow user receive anywhere or anytime and check the information of sharing simultaneously, makes sharing of information more convenient and quick.
Summary of the invention
The present invention shares inconvenient defect in order to overcome web page contents in prior art with other people, according to an aspect of the present invention, propose a kind of method of web page contents conversion multimedia message.
According to the method for the web page contents conversion multimedia message of the embodiment of the present invention, comprising:
Web page contents is carried out to semantic analysis, extract web page contents according to semantic analysis result;
After the content of multimedia in web page contents is processed, web page contents is inserted in the corresponding format of multimedia message template;
The web page contents of inserting in the corresponding format of multimedia message template is inserted to multimedia message frame, generate multimedia message.
The present invention shares inconvenient defect in order to overcome web page contents in prior art with other people, according to another aspect of the present invention, propose a kind of device of web page contents conversion multimedia message.
According to the device of the web page contents conversion multimedia message of the embodiment of the present invention, comprising:
Analyze extraction module, for web page contents is carried out to semantic analysis, extract web page contents according to semantic analysis result;
Content is inserted module, processes for the content of multimedia to web page contents, web page contents is inserted in the corresponding format of multimedia message template;
Insert generation module, for the described web page contents of the corresponding format of inserting multimedia message template is inserted to multimedia message frame, generate multimedia message.
The method and apparatus of web page contents conversion multimedia message of the present invention, by source code semantic analysis, to browsed web page contents extract, picture compression and format customization etc., after the multimedia message of MMS format protocol generation, by sending multimedia message, web page contents is dealt on good friend's mobile phone, completes with good friend's web page contents and share.
Other features and advantages of the present invention will be set forth in the following description, and, partly from instructions, become apparent, or understand by implementing the present invention.Object of the present invention and other advantages can be realized and be obtained by specifically noted structure in write instructions, claims and accompanying drawing.
Below by drawings and Examples, technical scheme of the present invention is described in further detail.
Brief description of the drawings
Accompanying drawing is used to provide a further understanding of the present invention, and forms a part for instructions, for explaining the present invention, is not construed as limiting the invention together with embodiments of the present invention.In the accompanying drawings:
Fig. 1 is multimedia message structural representation in prior art;
Fig. 2 is the apparatus structure schematic diagram of web page contents conversion multimedia message of the present invention.
Embodiment
Below in conjunction with accompanying drawing, the specific embodiment of the present invention is described in detail, but is to be understood that protection scope of the present invention is not subject to the restriction of embodiment.
Web page contents generally adopts html script language development, the issue of web page contents is the web page template based on certain generally, the certain labels of employing such as the title of this web page template to webpage, author, time of origin, illustration, main contents identify, the label semanteme that can analyze source code carries out content extraction, then generates multimedia message according to the content extracting.
The method of web page contents conversion multimedia message of the present invention comprises:
Step 102: newly-built multimedia message bag, extracts web page contents;
There is the web page template of oneself in general portal website, and the source code label of the web page contents of analyzing web site, describes according to the extraction of semantics web page contents of source code label:
Step 1022: extract heading message;
Headline: id=" artibodyTitle ", at this key word first " > " afterwards, " < " before content is heading message.
News source: id=" art_source ",
News briefing time: id=" pub_date ",
News author: id=" media_name ",
Body part: id=" artibody ";
Step 1024: extract picture;
Key word: img_wrapper, after key word src=" ... " for picture address, title=" ... " for picture header, after class=" img_descr " >, the description that is picture of " < " content before, extract picture according to picture address;
Step 1026: extract video/audio;
Key word: flash player begin, after key word href=" ... " if in " video.sina.com.cn " character string, this address is video/audio address, afterwards " video/audio: ... " for video/title, " source: ... " for video/audio content source.
Step 1028: extract text:
Content between <p> and </p> is body matter, and wherein one group of <p></pGreatT.Gre aT.GT represents a paragraph.
Step 104: the content of multimedia in web page contents is processed, being comprised:
Step 1042: picture processing;
A, amendment photo resolution, for example picture width changes 320 pixels into, height uniform zoom;
B, compressed picture size, for example, be compressed to picture size below 30k;
Step 1044: audio frequency processing;
A. audio format conversion: for example audio conversion can be changed into the form that the multimedia messages such as amr are supported;
B. compressed audio size: audio frequency size is compressed to below 30k;
If audio file is too large, be for example greater than 3MB, chained address, heading message and the descriptor of record audio file, using chained address, heading message and descriptor be as the processing of multimedia message body matter.
Step 1046: Video processing;
Chained address, heading message and the descriptor of recording of video file, using chained address, heading message and descriptor be as the processing of multimedia message body matter.
Step 106: web page contents is inserted in the corresponding format of multimedia message template;
As shown in Figure 1, existing multimedia message structure comprises multimedia message head (MMS headers) and multimedia message body (MMS body) two large divisions, its maximum feature is to support multimedia function, can the comprehensive content of propagation function and information, comprise the information of the various forms such as word, image, voice and data.
In step 106, the information such as the title, picture, audio frequency, video and the text that extract in step 102 are inserted respectively in corresponding format.
Step 108: the web page contents of inserting in the corresponding format of multimedia message template is inserted to multimedia message frame, generate multimedia message and issue to user.
If body matter inserts multimedia message Chinese word frame, if illustration inserts multimedia message picture frame, if audio frequency inserts multimedia message audio frame, if video inserts multimedia message frame of video.
Multimedia message size General Requirements, in 300KB, exceedes 300KB, may be split into some multimedia messages, and the essential informations such as title are constant.
The method of web page contents conversion multimedia message of the present invention, by source code semantic analysis, to browsed web page contents extract, picture compression and format customization etc., after the multimedia message of MMS format protocol generation, by sending multimedia message, web page contents is dealt on good friend's mobile phone, completes with good friend's web page contents and share.
As shown in Figure 2, the invention discloses a kind of device of web page contents conversion multimedia message, comprising:
Analyze extraction module 10, for web page contents is carried out to semantic analysis, extract web page contents according to semantic analysis result;
Content is inserted module 20, processes for the content of multimedia to web page contents, web page contents is inserted in the corresponding format of multimedia message template;
Insert generation module 30, for the web page contents of the corresponding format of inserting multimedia message template is inserted to multimedia message frame, generate multimedia message.
Wherein: analyze extraction module 10 and comprise:
Label is analyzed submodule 11, for the source code label of analyzing web page content;
Contents extraction submodule 12, for according to the extraction of semantics web page contents of label.
Wherein: content is inserted module 20 and comprised:
Picture processing submodule 21, for revising photo resolution and compressed picture size;
Audio frequency is processed submodule 22, for changing audio format and compressed audio size;
Video processing submodule 23, for chained address, heading message and the descriptor of recording of video, using chained address, heading message and descriptor be as the processing of multimedia message body matter.
Wherein:
Audio frequency is processed submodule 22, if be greater than setting numerical value specifically for audio file size, chained address, heading message and the descriptor of record audio file, using described chained address, heading message and descriptor as the processing of multimedia message body matter.
Wherein:
Contents extraction submodule 12, specifically for extracting heading message according to headline, news source, news briefing time and news author;
Contents extraction submodule 12, specifically for extracting picture according to key word, picture address, picture header and picture description information.
The device of web page contents conversion multimedia message of the present invention, by source code semantic analysis, to browsed web page contents extract, picture compression and format customization etc., after the multimedia message of MMS format protocol generation, by sending multimedia message, web page contents is dealt on good friend's mobile phone, completes with good friend's web page contents and share.
The present invention can have multiple multi-form embodiment; above taking Fig. 1-Fig. 2 as example is by reference to the accompanying drawings to technical scheme of the present invention explanation for example; this does not also mean that the applied instantiation of the present invention can only be confined in specific flow process or example structure; those of ordinary skill in the art should understand; the specific embodiments that above provided is some examples in multiple its preferred usage, and the embodiment of any embodiment the claims in the present invention all should be within technical solution of the present invention scope required for protection.
Finally it should be noted that: the foregoing is only the preferred embodiments of the present invention, be not limited to the present invention, although the present invention is had been described in detail with reference to previous embodiment, for a person skilled in the art, its technical scheme that still can record aforementioned each embodiment is modified, or part technical characterictic is wherein equal to replacement.Within the spirit and principles in the present invention all, any amendment of doing, be equal to replacement, improvement etc., within all should being included in protection scope of the present invention.

Claims (10)

1. a method for web page contents conversion multimedia message, is characterized in that, comprising:
Web page contents is carried out to semantic analysis, extract web page contents according to described semantic analysis result;
After the content of multimedia in described web page contents is processed, described web page contents is inserted in the corresponding format of multimedia message template;
The described web page contents of inserting in the corresponding format of multimedia message template is inserted to multimedia message frame, generate multimedia message.
2. method according to claim 1, is characterized in that, described web page contents is carried out to semantic analysis, and the step of extracting web page contents according to described semantic analysis result comprises:
Analyze the source code label of described web page contents, according to the extraction of semantics web page contents of described label;
Described web page contents comprises: heading message, pictorial information, audio/video information and text message.
3. method according to claim 1 and 2, is characterized in that, the treatment step of described content of multimedia comprises: picture processing, audio frequency are processed and Video processing;
Described picture processing comprises: amendment photo resolution and compressed picture size;
Described audio frequency processing comprises: conversion audio format and compressed audio size;
Described Video processing comprises: chained address, heading message and the descriptor of recording of video, and using described chained address, heading message and descriptor as the processing of multimedia message body matter.
4. method according to claim 3, it is characterized in that, if described audio file size is greater than setting numerical value, chained address, heading message and the descriptor of record audio file, using described chained address, heading message and descriptor as the processing of multimedia message body matter.
5. method according to claim 2, is characterized in that, the step that described heading message is extracted comprises: extract heading message according to headline, news source, news briefing time and news author;
The step that described picture extracts comprises: extract picture according to key word, picture address, picture header and picture description information.
6. a device for web page contents conversion multimedia message, is characterized in that, comprising:
Analyze extraction module, for web page contents is carried out to semantic analysis, extract web page contents according to described semantic analysis result;
Content is inserted module, processes for the content of multimedia to described web page contents, described web page contents is inserted in the corresponding format of multimedia message template;
Insert generation module, for the described web page contents of the corresponding format of inserting multimedia message template is inserted to multimedia message frame, generate multimedia message.
7. device according to claim 6, is characterized in that, described analysis extraction module comprises:
Label is analyzed submodule, for analyzing the source code label of described web page contents;
Contents extraction submodule, for according to the extraction of semantics web page contents of described label.
8. according to the device described in claim 6 or 7, it is characterized in that, described content is inserted module and is comprised:
Picture processing submodule, for revising photo resolution and compressed picture size;
Audio frequency is processed submodule, for changing audio format and compressed audio size;
Video processing submodule, for chained address, heading message and the descriptor of recording of video, using described chained address, heading message and descriptor as the processing of multimedia message body matter.
9. device according to claim 8, is characterized in that,
Described audio frequency is processed submodule, if be greater than setting numerical value specifically for described audio file size, chained address, heading message and the descriptor of record audio file, using described chained address, heading message and descriptor as the processing of multimedia message body matter.
10. device according to claim 7, is characterized in that,
Described contents extraction submodule, specifically for extracting heading message according to headline, news source, news briefing time and news author;
Described contents extraction submodule, specifically for extracting picture according to key word, picture address, picture header and picture description information.
CN201310108973.2A 2013-03-29 2013-03-29 Method and device for converting web page content to multimedia messages Pending CN104077323A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201310108973.2A CN104077323A (en) 2013-03-29 2013-03-29 Method and device for converting web page content to multimedia messages

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201310108973.2A CN104077323A (en) 2013-03-29 2013-03-29 Method and device for converting web page content to multimedia messages

Publications (1)

Publication Number Publication Date
CN104077323A true CN104077323A (en) 2014-10-01

Family

ID=51598582

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201310108973.2A Pending CN104077323A (en) 2013-03-29 2013-03-29 Method and device for converting web page content to multimedia messages

Country Status (1)

Country Link
CN (1) CN104077323A (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104850588A (en) * 2015-04-24 2015-08-19 深圳市梦网科技股份有限公司 Method and system for generating and publishing media content
CN106533926A (en) * 2016-12-27 2017-03-22 武汉斗鱼网络科技有限公司 Webpage information dissemination method and device
CN106815316A (en) * 2016-12-23 2017-06-09 北京奇虎科技有限公司 Method, device and mobile terminal that content of pages is shared
CN107562799A (en) * 2017-08-04 2018-01-09 海南智媒云图科技股份有限公司 A kind of content reprints the method and device shared
CN109408757A (en) * 2018-09-21 2019-03-01 广州神马移动信息科技有限公司 Question and answer content share method, device, terminal device and computer storage medium

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090037845A1 (en) * 2007-08-03 2009-02-05 Tzu-Han Kao Method and System for Editing Web Data
CN101552829A (en) * 2008-03-31 2009-10-07 比亚迪股份有限公司 Method, system and information terminal for editing multimedia message
CN101945346A (en) * 2009-07-06 2011-01-12 北京亿阳信通软件研究院有限公司 Method and device for automatically creating multimedia message
CN102682105A (en) * 2012-05-04 2012-09-19 高凌 System and method for identifying and acquiring related web page information by using mobile terminal

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090037845A1 (en) * 2007-08-03 2009-02-05 Tzu-Han Kao Method and System for Editing Web Data
CN101552829A (en) * 2008-03-31 2009-10-07 比亚迪股份有限公司 Method, system and information terminal for editing multimedia message
CN101945346A (en) * 2009-07-06 2011-01-12 北京亿阳信通软件研究院有限公司 Method and device for automatically creating multimedia message
CN102682105A (en) * 2012-05-04 2012-09-19 高凌 System and method for identifying and acquiring related web page information by using mobile terminal

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104850588A (en) * 2015-04-24 2015-08-19 深圳市梦网科技股份有限公司 Method and system for generating and publishing media content
CN106815316A (en) * 2016-12-23 2017-06-09 北京奇虎科技有限公司 Method, device and mobile terminal that content of pages is shared
CN106533926A (en) * 2016-12-27 2017-03-22 武汉斗鱼网络科技有限公司 Webpage information dissemination method and device
CN107562799A (en) * 2017-08-04 2018-01-09 海南智媒云图科技股份有限公司 A kind of content reprints the method and device shared
CN109408757A (en) * 2018-09-21 2019-03-01 广州神马移动信息科技有限公司 Question and answer content share method, device, terminal device and computer storage medium

Similar Documents

Publication Publication Date Title
US20220171915A1 (en) Automated augmentation of text, web and physical environments using multimedia content
KR100490734B1 (en) Annotation-based automatic document generation apparatus and method
CN102254550B (en) Method and system for reading characters on webpage
EP2687997A1 (en) Method for rearranging web page
TWI519979B (en) Information recommendation method and device thereof and information resource recommendation system
CN106897251B (en) Rich text display method and device
TWI592807B (en) Method and device for web style address merge
CN104346322A (en) Document format processing device and document format processing method
CN104077323A (en) Method and device for converting web page content to multimedia messages
CN104516892A (en) Distribution method, system and terminal of user generated content associated with rich media information
JP2009064442A (en) Mobile web service system and method
CN102779167A (en) Method and system for displaying webpage in mobile terminal
CN105808587B (en) method, gateway equipment and system for embedding information in webpage
CN106547511A (en) A kind of voice broadcasts method, browser client and the server of reading web page information
CN105094775A (en) Webpage generation method and apparatus
CN112016290A (en) Automatic document typesetting method, device, equipment and storage medium
CN112487763A (en) SVG-based OFD file online display method, server side and system
CN111625308B (en) Information display method and device and electronic equipment
CN104426863B (en) A kind of page request method, page request device, transfer server and terminal
CN102693237B (en) Webpage content adaptation and encapsulation system and method
CN103885988B (en) Export method and device, the content output system of content
CN103139227B (en) A kind of application data transmission system and method being applied to mobile terminal
CN102209086A (en) Method, device and system for accessing Internet
CN104216868A (en) Adaptation method and device for document display format
JP2007219763A (en) Diary server and diary system

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20141001

RJ01 Rejection of invention patent application after publication