CN101247603B - Multi-layer anchor point extraction method and device - Google Patents

Multi-layer anchor point extraction method and device Download PDF

Info

Publication number
CN101247603B
CN101247603B CN2008100840935A CN200810084093A CN101247603B CN 101247603 B CN101247603 B CN 101247603B CN 2008100840935 A CN2008100840935 A CN 2008100840935A CN 200810084093 A CN200810084093 A CN 200810084093A CN 101247603 B CN101247603 B CN 101247603B
Authority
CN
China
Prior art keywords
information
chained
chained list
lists
extracting
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN2008100840935A
Other languages
Chinese (zh)
Other versions
CN101247603A (en
Inventor
陈斌
蒋敏
薛丹
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
ZTE Corp
Original Assignee
ZTE Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by ZTE Corp filed Critical ZTE Corp
Priority to CN2008100840935A priority Critical patent/CN101247603B/en
Publication of CN101247603A publication Critical patent/CN101247603A/en
Application granted granted Critical
Publication of CN101247603B publication Critical patent/CN101247603B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Abstract

The invention discloses a method and a device for extracting a multi-anchor, the method has the following steps: first step, a short message text is scanned and is extracted information according to a default rule; second step, one or more link lists are created according to the extracted information, and the information is stored in one or more of link lists; and third step, the information is judged whether to overlap or not, and one or more link lists are integrated and formed at least one link list according to a judgment result and a default integration rule. The invention provide an efficient path for extracting a telephone number, an Email address, and an URL address in short message content in a way of simplicity, quick speed, fullness.

Description

Multi-layer anchor point extraction method and device
Technical field
The present invention relates to field of mobile phones, relate in particular to the multi-layer anchor point extractive technique of SMS.
Background technology
Current, most cellphone subscriber can use mobile phone to send short message, and the SMS (Short Message Service) of mobile phone is transmitted in time, and is convenient, flexible, can convey feeling and ideas, and also can get in touch with notice, and can also preserve important information and put on record, thereby very popular.Custom is all known with the people of short message; Can be like a cork the telephone number that comprises in the phone number of information transmitter and the short message be saved in the address list of own phone; Preserve the Email address that comprises in the note easily or send Email, even the URL address that comprises in the short message content is connected fast and preserves operation such as bookmark to it.
Extraction telephone number, Email address, the URL address of usually, extracting based on anchor point can be divided into three parts: contents extraction, the high bright demonstration of focusing and use focus on item.
The core that contents extraction is extracted as anchor point; Mainly be through scanning whole note character; In conjunction with RFC standard (the RFC 822-Standard for the format of ARPA Internet text messages of the reference format of regulation Email; And the RFC 2396-Uniform Resource Identifiers (URI) of regulation unified resource identifier: corresponding (telephone number, Email and URL address) syntax rule is extracted valid string wherein, and is stored the extraction result with certain mode Generic Syntax.).
Focus on high bright demonstration then according to the extraction result who reads; Receive in the note interface at cell phone reading; Judge the initial sum final position of content corresponding (telephone number, Email address and URL address); This partial content is focused on and high bright demonstration, make the user can select whether this content to be operated and how to be operated.Will be implemented in simultaneously between the multinomial optional content and switch, as comprising a plurality of telephone numbers and Email address and URL address in the short message content, then the user can operate every content, switches before and after generally being realized in order by directionkeys.
Use to focus on item and be meant that mobile phone makes a response to the incident that user key-press triggers through the menu function of platform, realize focusing on the concrete operations of content.The user calls optional action-item through function key; The operation that the selection of direction of passage key will be carried out; As telephone number being called out, preserve, is sent operations such as note, transmission multimedia message; The Email address is preserved and sent operations such as mail, the URL address is connected and preserves operations such as bookmark.
Present most of mobile phone has all been realized the function that anchor point extracts.Yet effect and unsatisfactory, trace it to its cause mainly be to extracting rule understand single, do not have fault tolerant mechanism and intelligent degree not enough.Short message content is edited by the user, and everyone custom is different again in editing process.When especially in note, comprising telephone number, Email address and URL address simultaneously, all might exist the phenomenon of overlapping intersection between the three or even between two extraction contents of same type.Such as suffix that comprises similar URL address in the numeric string that comprises similar telephone number among the URL, the Email address or the like.
Therefore, the solution that needs a kind of multi-layer anchor point to extract can solve the problem in the above-mentioned correlation technique.
Summary of the invention
The present invention is intended on the basis that anchor point extracts, realize fault tolerant mechanism, and the content that the intersection extraction occurs is carried out intelligent decision, and extraction content as much as possible supplies the user to select to use.
According to an aspect of the present invention, a kind of multi-layer anchor point extraction method is provided, has may further comprise the steps: step 1, scanning short message text and the predetermined extracting rule information extraction of basis; Step 2 is created one or more chained lists according to the type of info of extracting, and with information stores in one or more chained lists; And step 3; Judge whether lap is arranged between the information, and one or more chained lists are integrated at least one chained list, wherein according to judged result and according to predetermined rules; If judge do not have between the information overlapping; Then one or more chained lists are integrated at least one chained list according to predetermined rules, otherwise, according to predetermined rules one or more chained lists are integrated into more than a chained list.
Step 3 is further comprising the steps of: will having each other, the information of lap stores into respectively more than in the different chained lists in the chained list; And whether be illegal information extraction according to the information that predetermined algorithm confirm to be extracted, if, then with its deletion.
Said method is further comprising the steps of: the DISPLAY ORDER of confirming at least one chained list; And show the information of being extracted at least one chained list that is stored in according to the order of confirming.
Predetermined extracting rule comprises fault tolerant mechanism, and information comprises address information, and the type of address information comprises: telephone number, Email address and URL address.
According to another aspect of the present invention, a kind of multi-layer anchor point extraction element is provided, has comprised: extraction module is used to scan short message text and the predetermined extracting rule information extraction of basis; Chained list is created and memory module, creates one or more chained lists according to the type of info of extracting, and with information stores in one or more chained lists; And judgement integrate module; Judge whether lap is arranged between the information, and one or more chained lists are integrated at least one chained list, wherein according to judged result and according to predetermined rules; If judge do not have between the information overlapping; Then one or more chained lists are integrated at least one chained list according to predetermined rules, otherwise, according to predetermined rules one or more chained lists are integrated into more than a chained list.
Chained list is created and memory module comprises: chained list is created the unit, is used for creating one or more chained lists according to the type of info of extracting; And memory cell, be used for the information stores of extracting to one or more chained lists.
Judge that integrate module comprises: judging unit is used for judging whether lap is arranged between the information; And integral unit, be used for one or more chained lists being integrated at least one chained list according to judged result and according to predetermined rules.
The present invention provides a kind of valid approach for telephone number, Email address and the URL address of extracting in the short message content simply, fast, comprehensively, neatly.
Other features and advantages of the present invention will be set forth in specification subsequently, and, partly from specification, become obvious, perhaps understand through embodiment of the present invention.The object of the invention can be realized through the structure that in the specification of being write, claims and accompanying drawing, is particularly pointed out and obtained with other advantages.
Description of drawings
Accompanying drawing described herein is used to provide further understanding of the present invention, constitutes the application's a part, and illustrative examples of the present invention and explanation thereof are used to explain the present invention, do not constitute improper qualification of the present invention.In the accompanying drawings:
Fig. 1 shows the flow chart of multi-layer anchor point extraction method according to an embodiment of the invention;
Fig. 2 shows the block diagram of multi-layer anchor point extraction element according to an embodiment of the invention; And
Fig. 3 shows the flow chart of multi-layer anchor point extraction method in accordance with another embodiment of the present invention.
Embodiment
To combine accompanying drawing to specify embodiments of the invention below.
Fig. 1 shows the flow chart according to the multi-layer anchor point extraction method of the embodiment of the invention.With reference to Fig. 1, may further comprise the steps: step S102, scanning short message text and according to predetermined Rule Extraction information according to the multi-layer anchor point extraction method of the embodiment of the invention; Step S104 creates one or more chained lists according to the type of info of extracting, and with information stores in one or more chained lists; And step S106, judge whether lap is arranged between the information, and one or more chained lists are integrated at least one chained list according to judged result and according to predetermined rules.
Step S106 may further comprise the steps: if judge do not have between the information overlapping; Then one or more chained lists are integrated at least one chained list according to predetermined rules; Otherwise, one or more chained lists are integrated into more than a chained list according to predetermined rules.
Step S106 is further comprising the steps of: will having each other, the information of lap stores into respectively more than in the different chained lists in the chained list; And whether be illegal information extraction according to the information that predetermined algorithm confirm to be extracted, if, then with its deletion.
Said method is further comprising the steps of: the DISPLAY ORDER of confirming at least one chained list; And show the information of being extracted at least one chained list that is stored in according to the order of confirming.
Predetermined extracting rule comprises fault tolerant mechanism, and information comprises address information, and the type of address information comprises: telephone number, Email address and URL address.
Fig. 2 shows the block diagram according to the multi-layer anchor point extraction element of the embodiment of the invention.With reference to Fig. 2, comprise according to the multi-layer anchor point extraction element 200 of the embodiment of the invention: extraction module 202 is used to scan the short message text and according to predetermined Rule Extraction information; Chained list is created and memory module 204, creates one or more chained lists according to the type of info of extracting, and with information stores in one or more chained lists; And judge integrate module 206, judge whether lap is arranged between the information, and one or more chained lists are integrated at least one chained list according to judged result and according to predetermined rules.
Chained list is created and memory module 204 comprises: chained list is created the unit, is used for creating one or more chained lists according to the type of info of extracting; And memory cell, be used for the information stores of extracting to one or more chained lists.
Judge that integrate module 206 comprises: judging unit is used for judging whether lap is arranged between the information; And integral unit, be used for one or more chained lists being integrated at least one chained list according to judged result and according to predetermined rules.
Describe an alternative embodiment of the invention in detail with reference to Fig. 3 below.
The method that the multi-layer anchor point of present embodiment extracts may further comprise the steps:
Step S302, the extracting rule of definition telephone number, Email address and URL address, significant character and character field (with reference to the described RFC standard of preamble);
Step S304, the grammer according to telephone number, Email address and URL address scans whole short message text respectively, extracts corresponding contents, generates corresponding chained list;
Step S306 integrates three chained lists that generate, and deletes confirmable illegal extraction according to rules;
Step S308 according to the chi structure of chained list, confirms and combination layering display structure; And
Step S310 is to management and the control of use highlight in the menu.
Embodiment mainly comprises two parts, and the one, extract telephone number, Email address and URL address and generate corresponding chained list; The 2nd, chained list is integrated, make it reasonably to make up and be shown to the user.
In this embodiment, store according to the mode of single linked list extracting content, and three kinds of different contents extract respectively.The process of extracting mainly is to scan character by character, differentiates according to the grammer of correspondence.If confirm that a character string is the content that will extract, then in chained list, create a new node, information such as node containing type, starting point, length, content.The linked list head node then comprises information such as node number, start node, terminal node in the whole chained list.Each node sorts according to the order that its starting point occurs in note in the chained list, so that chained list is integrated and the user focuses on demonstration according to the directionkeys order.
The integration of chained list mainly designs to the intersection situation of content in different chained lists or the same chained list; That is to say through extracting; Telephone number possibly have lap with the URL address, and same telephone number is followed between the Email address, the Email address also possibly exist lap with between the URL address.For example; Such as a long numeric string of 8 appears in the URL address; This numeric string of 8 extracts as a telephone number when telephone number extracts; And in the URL extraction the whole character string that comprises this 8 figure place word string is come out as a URL address extraction, so the telephone number chained list that generates just has lap with the URL chained list.Present most of mobile phone all is to accept or reject according to certain priority, is superior to telephone number such as the extraction of giving tacit consent to URL, and then this numeric string of 8 does not just use as telephone number, only is a part of being used as URL.Yet this extraction obviously can not be satisfied user's requirement, because the user possibly need these 8 character strings to store as a telephone number.The chained list that native system adopts is integrated mode both according to the rules that designs; Extracting chained list to three integrates; According to one of the final generation of the characteristics of list structure or two chained lists; Comprise the extraction content that all possibly be useful informations concerning the user, and do not have node in each chained list in locational overlapping intersection.
Use highlight option needs to increase corresponding menu item according to the final chained list number that generates in the menu.For the situation of a chained list, according to the high bright corresponding content that shows of sequencing of content in the chained list, directionkeys control is selected forward or backward, and the user can carry out use highlight operation to certain content that need use.Situation for two chained lists; At first according to the high bright demonstration corresponding content of the sequencing that connects content in the table, when user's direction of passage key is browsed the content that whole short message text do not find oneself to want to focus on, can select the second chained list to browse through menu; The same as article one chained list; Carry out high bright demonstration in order, directionkeys control is browsed forward or backward, runs into the content that needs to use and can do use highlight operation.
In sum, present embodiment mainly is divided into two major parts: extract content and generate chained list, integrate chained list.Wherein extract content and adopt the RFC standard and telephone number, Email address, URL address are extracted, integrate chained list and then be the complete demonstration of the content that extracts being carried out fault-tolerant processing accurately and overlapping content through the scanning short message content.Through adopting above technology; The present invention realizes quick, comprehensive, the extraction flexibly to telephone number, Email address and URL address in the note; And reasonably show the content that all extract in the user interface of reading note, be very easy to the user to extracting the operation of content.
The present invention provides a kind of valid approach for telephone number, Email address and the URL address of extracting in the short message content simply, fast, comprehensively, neatly.
The above is merely the preferred embodiments of the present invention, is not limited to the present invention, and for a person skilled in the art, the present invention can have various changes and variation.All within spirit of the present invention and principle, any modification of being done, be equal to replacement, improvement etc., all should be included within protection scope of the present invention.

Claims (10)

1. a multi-layer anchor point extraction method is characterized in that, may further comprise the steps:
Step 1, scanning short message text and the predetermined extracting rule information extraction of basis;
Step 2 is created one or more chained lists according to the said type of info of extracting, and with said information stores in said one or more chained lists; And
Step 3; Judge whether lap is arranged between the said information, and said one or more chained lists are integrated at least one chained list, wherein according to judged result and according to predetermined rules; If judge do not have between the said information overlapping; Then said one or more chained lists are integrated at least one chained list according to said predetermined rules, otherwise, according to said predetermined rules said one or more chained lists are integrated into more than a chained list.
2. method according to claim 1 is characterized in that, said step 3 is further comprising the steps of:
To have each other, the said information of lap stores into said more than in the different chained lists in the chained list respectively.
3. method according to claim 2 is characterized in that, said step 3 is further comprising the steps of:
Confirm according to predetermined algorithm whether the said information of extracting is illegal information extraction, if, then with its deletion.
4. method according to claim 3 is characterized in that, and is further comprising the steps of:
Confirm the DISPLAY ORDER of said at least one chained list; And
Show the information of being extracted in said at least one chained list that is stored in according to the order of confirming.
5. according to each described method in the claim 1 to 4, it is characterized in that said predetermined extracting rule comprises fault tolerant mechanism.
6. according to each described method in the claim 1 to 4, it is characterized in that said information comprises address information.
7. method according to claim 6 is characterized in that, the type of said address information comprises: telephone number, Email address and URL address.
8. a multi-layer anchor point extraction element is characterized in that, comprising:
Extraction module is used to scan short message text and the predetermined extracting rule information extraction of basis;
Chained list is created and memory module, creates one or more chained lists according to the said type of info of extracting, and with said information stores in said one or more chained lists; And
Judge integrate module; Judge whether lap is arranged between the said information, and said one or more chained lists are integrated at least one chained list, wherein according to judged result and according to predetermined rules; If judge do not have between the said information overlapping; Then said one or more chained lists are integrated at least one chained list according to said predetermined rules, otherwise, according to said predetermined rules said one or more chained lists are integrated into more than a chained list.
9. device according to claim 8 is characterized in that, said chained list is created and memory module comprises:
Chained list is created the unit, is used for creating one or more chained lists according to the said type of info of extracting; And
Memory cell is used for the said information stores of extracting to said one or more chained lists.
10. device according to claim 9 is characterized in that, said judgement integrate module comprises:
Judging unit is used to judge whether lap is arranged between the said information; And
Integral unit is used for according to judged result and according to predetermined rules said one or more chained lists being integrated at least one chained list.
CN2008100840935A 2008-03-26 2008-03-26 Multi-layer anchor point extraction method and device Expired - Fee Related CN101247603B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN2008100840935A CN101247603B (en) 2008-03-26 2008-03-26 Multi-layer anchor point extraction method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN2008100840935A CN101247603B (en) 2008-03-26 2008-03-26 Multi-layer anchor point extraction method and device

Publications (2)

Publication Number Publication Date
CN101247603A CN101247603A (en) 2008-08-20
CN101247603B true CN101247603B (en) 2012-04-04

Family

ID=39947750

Family Applications (1)

Application Number Title Priority Date Filing Date
CN2008100840935A Expired - Fee Related CN101247603B (en) 2008-03-26 2008-03-26 Multi-layer anchor point extraction method and device

Country Status (1)

Country Link
CN (1) CN101247603B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106597049A (en) * 2016-12-15 2017-04-26 电子科技大学 Multi-waveform envelope extraction method based on chain table array

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101741756B (en) * 2008-11-19 2012-09-26 中兴通讯股份有限公司 Method and system for converting special character strings in instant communication text message

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1377482A (en) * 1999-10-05 2002-10-30 株式会社建伍 Mobile communication terminal
CN1622563A (en) * 2003-11-28 2005-06-01 乐金电子(中国)研究开发中心有限公司 Method for extracting specific information from short massage
WO2005074213A1 (en) * 2004-01-20 2005-08-11 Cloudmark, Inc. Method and system for url-based screening of electronic communications
WO2006135205A1 (en) * 2005-06-15 2006-12-21 Sk Telecom Co., Ltd. Method and mobile communication terminal for providing function of hyperlink telephone number including short message service

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1377482A (en) * 1999-10-05 2002-10-30 株式会社建伍 Mobile communication terminal
CN1622563A (en) * 2003-11-28 2005-06-01 乐金电子(中国)研究开发中心有限公司 Method for extracting specific information from short massage
WO2005074213A1 (en) * 2004-01-20 2005-08-11 Cloudmark, Inc. Method and system for url-based screening of electronic communications
WO2006135205A1 (en) * 2005-06-15 2006-12-21 Sk Telecom Co., Ltd. Method and mobile communication terminal for providing function of hyperlink telephone number including short message service

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106597049A (en) * 2016-12-15 2017-04-26 电子科技大学 Multi-waveform envelope extraction method based on chain table array
CN106597049B (en) * 2016-12-15 2019-01-25 电子科技大学 Several waveform envelope extracting methods based on linked list array

Also Published As

Publication number Publication date
CN101247603A (en) 2008-08-20

Similar Documents

Publication Publication Date Title
US8169409B2 (en) Method of managing a language information for a text input and method of inputting a text and a mobile terminal
CN101552829B (en) Method, system and information terminal for editing multimedia message
CN101951577B (en) Short message processing method and device thereof
US20050198180A1 (en) Method and system for providing automatic email address book
EP2206246A2 (en) Method for storing telephone number by automatically analyzing message and mobile terminal executing the method
CN102404251A (en) Realization method, client and system for instant messaging with remarking function
CN101645086A (en) Retrieval method
CN103906012A (en) Information sending method and device
CN103136042A (en) Method and device for processing information
CN104881279A (en) Mass messaging method and device
CN101488997A (en) Local searching method for mobile phone
KR100774187B1 (en) Mobile communication terminal and method for displaying SMS therefor
CN102833408B (en) Method and device for displaying messages
CN102394977A (en) Processing method of unread cellphone text messages and cellphone
CN101247603B (en) Multi-layer anchor point extraction method and device
CN101980156A (en) Method for automatically extracting email address and creating new email
CN104394280B (en) The menu option display methods and device of the communication information
CN102262441A (en) Input method and device
CN102946592B (en) The method and system that a kind of mobile terminal is received and sent messages
CN100394812C (en) Method of transmitting multimedia short message
CN101527889A (en) Editing and showing method and mobile communication terminal for graphic short messages
CN101345966A (en) Method and device for automatically matching menu
KR100686164B1 (en) Mobile communication terminal and method for displaying sms therefor
KR100983904B1 (en) Enhanced message transfer method for telecommunication
CN101155323A (en) Message transmission method and device

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20120404

Termination date: 20170326

CF01 Termination of patent right due to non-payment of annual fee