WO2018006256A1 - Local mail data collection method and system - Google Patents

Local mail data collection method and system Download PDF

Info

Publication number
WO2018006256A1
WO2018006256A1 PCT/CN2016/088501 CN2016088501W WO2018006256A1 WO 2018006256 A1 WO2018006256 A1 WO 2018006256A1 CN 2016088501 W CN2016088501 W CN 2016088501W WO 2018006256 A1 WO2018006256 A1 WO 2018006256A1
Authority
WO
WIPO (PCT)
Prior art keywords
mail data
collected
local
keyword
local mail
Prior art date
Application number
PCT/CN2016/088501
Other languages
French (fr)
Chinese (zh)
Inventor
马岩
Original Assignee
马岩
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 马岩 filed Critical 马岩
Priority to PCT/CN2016/088501 priority Critical patent/WO2018006256A1/en
Publication of WO2018006256A1 publication Critical patent/WO2018006256A1/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor

Definitions

  • the present invention relates to the field of communications, and in particular, to a method and system for collecting local mail data.
  • the Internet is a global network connected by computers that communicate with each other using a common language, that is, an international computer network composed of a wide area network, a local area network, and a single unit in accordance with a certain communication protocol.
  • the Internet is a carrier of public information, and this mass media is faster than any previous communication medium.
  • the mail data on the Internet is a huge amount of mail data. How to collect the massive mail data is a research direction, and the prior art cannot realize the collection of local mail data.
  • the application provides a method for collecting local mail data. It solves the shortcoming that the prior art technical solution cannot collect local mail data.
  • a method for collecting local mail data includes the following steps:
  • the keyword for collecting the mail data is obtained, and the collected mail data is separately stored according to the keyword.
  • the method further includes:
  • the method further includes:
  • a collection system for local mail data comprising:
  • a crawling unit configured to capture the same collected mail data in the local mail data
  • the classification unit is configured to acquire keywords of the collected mail data, and store the collected mail data separately according to the keyword.
  • system further includes:
  • a naming unit for naming the stored folder with categories and keywords A naming unit for naming the stored folder with categories and keywords.
  • system further includes:
  • a statistical unit that counts the number of occurrences of the same keyword A statistical unit that counts the number of occurrences of the same keyword.
  • the technical solution provided by the present invention acquires the category of the local mail data that needs to be collected, captures the same collected mail data in the local mail data, acquires the keyword of the collected mail data, and collects the mail data according to the keyword. Stored separately, so it has the advantage of efficient collection of local mail data.
  • FIG. 1 is a flowchart of a method for collecting local mail data according to a first preferred embodiment of the present invention
  • FIG. 2 is a structural diagram of a local mail data collection system according to a second preferred embodiment of the present invention.
  • FIG. 1 is a method for collecting local mail data according to a first preferred embodiment of the present invention. The method is as shown in FIG.
  • Step S101 Obtain a category of local mail data that needs to be collected
  • Step S102 Grab the same collected mail data in the local mail data
  • Step S103 Acquire keywords of the collected mail data, and store the collected mail data separately according to the keyword.
  • the technical solution provided by the present invention acquires the category of the local mail data that needs to be collected, captures the same collected mail data in the local mail data, acquires the keyword of the collected mail data, and collects the mail data according to the keyword. Stored separately, so it has the advantage of efficient collection of local mail data.
  • the foregoing method may further include:
  • the foregoing method may further include:
  • FIG. 2 is a collection system of local mail data according to a second preferred embodiment of the present invention.
  • the system includes:
  • the obtaining unit 201 is configured to acquire a category of local mail data that needs to be collected;
  • the crawling unit 202 is configured to capture the same collected mail data in the local mail data
  • the classification unit 203 is configured to acquire keywords of the collected mail data, and store the collected mail data separately according to the keyword.
  • the technical solution provided by the present invention acquires the category of the local mail data that needs to be collected, captures the same collected mail data in the local mail data, acquires the keyword of the collected mail data, and collects the mail data according to the keyword. Stored separately, so it has the advantage of efficient collection of local mail data.
  • the above system may further include:
  • the naming unit 204 is configured to name the stored folder by category and keyword.
  • the above system may further include:
  • the statistics unit 205 is configured to count the number of occurrences of the same keyword.
  • the program may be stored in a computer readable storage medium, and the storage medium may include: Flash drive, read-only memory (English: Read-Only Memory, referred to as: ROM), random accessor (English: Random Access Memory, referred to as: RAM), disk or CD.
  • ROM Read-Only Memory
  • RAM Random Access Memory

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Transfer Between Computers (AREA)

Abstract

A local mail data collection method and system. The method comprises the following steps: acquiring a local mail data category requiring collection (S101); fetching, from local mail data, collection mail data having an identical category to said category (S102); and acquiring a keyword of the collection mail data, and storing, according to the keyword, the collection mail data separately (S103). The method of the present invention enables collection of local mail data.

Description

本地邮件数据的搜集方法及系统  Local mail data collection method and system 技术领域Technical field
本发明涉及通信领域,尤其涉及一种本地邮件数据的搜集方法及系统。 The present invention relates to the field of communications, and in particular, to a method and system for collecting local mail data.
背景技术Background technique
互联网(internet),是由一些使用公用语言互相通信的计算机连接而成的全球网络,即广域网、局域网及单机按照一定的通讯协议组成的国际计算机网络。互联网是一种公用信息的载体,这种大众传媒比以往的任何一种通讯媒体都要快。互联网中的邮件数据是海量的邮件数据,如何依据该海量的邮件数据搜集是一个研究方向,现有技术无法实现本地邮件数据的搜集。The Internet (internet) is a global network connected by computers that communicate with each other using a common language, that is, an international computer network composed of a wide area network, a local area network, and a single unit in accordance with a certain communication protocol. The Internet is a carrier of public information, and this mass media is faster than any previous communication medium. The mail data on the Internet is a huge amount of mail data. How to collect the massive mail data is a research direction, and the prior art cannot realize the collection of local mail data.
技术问题technical problem
本申请提供一种本地邮件数据的搜集方法。其解决现有技术的技术方案无法对本地邮件数据进行搜集的缺点。The application provides a method for collecting local mail data. It solves the shortcoming that the prior art technical solution cannot collect local mail data.
技术解决方案Technical solution
一方面,提供一种本地邮件数据的搜集方法,所述方法包括如下步骤:In one aspect, a method for collecting local mail data is provided, and the method includes the following steps:
获取需要搜集的本地邮件数据的类别;Get the category of local mail data that needs to be collected;
在本地邮件数据中抓取与该类别相同的搜集邮件数据;Grab the same collected mail data in the local mail data;
获取该收集邮件数据的关键词,按该关键词将搜集邮件数据进行分别存储。The keyword for collecting the mail data is obtained, and the collected mail data is separately stored according to the keyword.
可选的,所述方法还包括:Optionally, the method further includes:
将该存储的文件夹以类别和关键词命名。Name the stored folder with categories and keywords.
可选的,所述方法还包括:Optionally, the method further includes:
统计相同关键词出现的次数。Count the number of times the same keyword appears.
第二方面,提供一种本地邮件数据的搜集系统,所述系统包括:In a second aspect, a collection system for local mail data is provided, the system comprising:
获取单元,用于获取需要搜集的本地邮件数据的类别;An obtaining unit for obtaining a category of local mail data that needs to be collected;
抓取单元,用于在本地邮件数据中抓取与该类别相同的搜集邮件数据;a crawling unit, configured to capture the same collected mail data in the local mail data;
分类单元,用于获取该收集邮件数据的关键词,按该关键词将搜集邮件数据进行分别存储。The classification unit is configured to acquire keywords of the collected mail data, and store the collected mail data separately according to the keyword.
可选的,所述系统还包括:Optionally, the system further includes:
命名单元,用于将该存储的文件夹以类别和关键词命名。A naming unit for naming the stored folder with categories and keywords.
可选的,所述系统还包括:Optionally, the system further includes:
统计单元,用于统计相同关键词出现的次数。A statistical unit that counts the number of occurrences of the same keyword.
有益效果Beneficial effect
本发明提供的技术方案获取需要搜集的本地邮件数据的类别,在本地邮件数据中抓取与该类别相同的搜集邮件数据,获取该收集邮件数据的关键词,按该关键词将搜集邮件数据进行分别存储,所以其具有对本地邮件数据有效搜集的优点。The technical solution provided by the present invention acquires the category of the local mail data that needs to be collected, captures the same collected mail data in the local mail data, acquires the keyword of the collected mail data, and collects the mail data according to the keyword. Stored separately, so it has the advantage of efficient collection of local mail data.
附图说明DRAWINGS
为了更清楚地说明本发明实施例的技术方案,下面将对实施例描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图是本发明的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。In order to more clearly illustrate the technical solutions of the embodiments of the present invention, the drawings used in the description of the embodiments will be briefly described below. It is obvious that the drawings in the following description are some embodiments of the present invention. Those skilled in the art can also obtain other drawings based on these drawings without paying any creative work.
图1为本发明第一较佳实施方式提供的一种本地邮件数据的搜集方法的流程图;1 is a flowchart of a method for collecting local mail data according to a first preferred embodiment of the present invention;
图2为本发明第二较佳实施方式提供的一种本地邮件数据的搜集系统的结构图。2 is a structural diagram of a local mail data collection system according to a second preferred embodiment of the present invention.
本发明的实施方式Embodiments of the invention
下面将结合本发明实施例中的附图,对本发明实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例是本发明一部分实施例,而不是全部的实施例。基于本发明中的实施例,本领域普通技术人员在没有作出创造性劳动前提下所获得的所有其他实施例,都属于本发明保护的范围。The technical solutions in the embodiments of the present invention are clearly and completely described in the following with reference to the accompanying drawings in the embodiments of the present invention. It is obvious that the described embodiments are a part of the embodiments of the present invention, but not all embodiments. All other embodiments obtained by those skilled in the art based on the embodiments of the present invention without creative efforts are within the scope of the present invention.
请参考图1,图1是本发明第一较佳实施方式提出的一种本地邮件数据的搜集方法,该方法如图1所示,包括如下步骤:Please refer to FIG. 1. FIG. 1 is a method for collecting local mail data according to a first preferred embodiment of the present invention. The method is as shown in FIG.
步骤S101、获取需要搜集的本地邮件数据的类别;Step S101: Obtain a category of local mail data that needs to be collected;
步骤S102、在本地邮件数据中抓取与该类别相同的搜集邮件数据;Step S102: Grab the same collected mail data in the local mail data;
步骤S103、获取该收集邮件数据的关键词,按该关键词将搜集邮件数据进行分别存储。Step S103: Acquire keywords of the collected mail data, and store the collected mail data separately according to the keyword.
本发明提供的技术方案获取需要搜集的本地邮件数据的类别,在本地邮件数据中抓取与该类别相同的搜集邮件数据,获取该收集邮件数据的关键词,按该关键词将搜集邮件数据进行分别存储,所以其具有对本地邮件数据有效搜集的优点。The technical solution provided by the present invention acquires the category of the local mail data that needs to be collected, captures the same collected mail data in the local mail data, acquires the keyword of the collected mail data, and collects the mail data according to the keyword. Stored separately, so it has the advantage of efficient collection of local mail data.
可选的,上述方法在步骤S103之后还可以包括:Optionally, after the step S103, the foregoing method may further include:
将该存储的文件夹以类别和关键词命名。Name the stored folder with categories and keywords.
可选的,上述方法在步骤S103之后还可以包括:Optionally, after the step S103, the foregoing method may further include:
统计相同关键词出现的次数。Count the number of times the same keyword appears.
请参考图2,图2是本发明第二较佳实施方式提出的一种本地邮件数据的搜集系统,该系统包括:Please refer to FIG. 2. FIG. 2 is a collection system of local mail data according to a second preferred embodiment of the present invention. The system includes:
获取单元201,用于获取需要搜集的本地邮件数据的类别;The obtaining unit 201 is configured to acquire a category of local mail data that needs to be collected;
抓取单元202,用于在本地邮件数据中抓取与该类别相同的搜集邮件数据;The crawling unit 202 is configured to capture the same collected mail data in the local mail data;
分类单元203,用于获取该收集邮件数据的关键词,按该关键词将搜集邮件数据进行分别存储。The classification unit 203 is configured to acquire keywords of the collected mail data, and store the collected mail data separately according to the keyword.
本发明提供的技术方案获取需要搜集的本地邮件数据的类别,在本地邮件数据中抓取与该类别相同的搜集邮件数据,获取该收集邮件数据的关键词,按该关键词将搜集邮件数据进行分别存储,所以其具有对本地邮件数据有效搜集的优点。The technical solution provided by the present invention acquires the category of the local mail data that needs to be collected, captures the same collected mail data in the local mail data, acquires the keyword of the collected mail data, and collects the mail data according to the keyword. Stored separately, so it has the advantage of efficient collection of local mail data.
可选的,上述系统还可以包括:Optionally, the above system may further include:
命名单元204,用于将该存储的文件夹以类别和关键词命名。The naming unit 204 is configured to name the stored folder by category and keyword.
可选的,上述系统还可以包括:Optionally, the above system may further include:
统计单元205,用于统计相同关键词出现的次数。The statistics unit 205 is configured to count the number of occurrences of the same keyword.
需要说明的是,对于前述的各个方法实施例,为了简单描述,故将其都表述为一系列的动作组合,但是本领域技术人员应该知悉,本发明并不受所描述的动作顺序的限制,因为依据本发明,某一些步骤可以采用其他顺序或者同时进行。其次,本领域技术人员也应该知悉,说明书中所描述的实施例均属于优选实施例,所涉及的动作和模块并不一定是本发明所必须的。It should be noted that, for the foregoing various method embodiments, for the sake of simple description, they are all expressed as a series of action combinations, but those skilled in the art should understand that the present invention is not limited by the described action sequence. Because certain steps may be performed in other sequences or concurrently in accordance with the present invention. In addition, those skilled in the art should also understand that the embodiments described in the specification are all preferred embodiments, and the actions and modules involved are not necessarily required by the present invention.
在上述实施例中,对各个实施例的描述都各有侧重,某个实施例中没有详细描述的部分,可以参见其他实施例的相关描述。In the above embodiments, the descriptions of the various embodiments are different, and the parts that are not described in detail in a certain embodiment can be referred to the related descriptions of other embodiments.
本领域普通技术人员可以理解上述实施例的各种方法中的全部或部分步骤是可以通过程序来指令相关的硬件来完成,该程序可以存储于一计算机可读存储介质中,存储介质可以包括:闪存盘、只读存储器(英文:Read-Only Memory ,简称:ROM)、随机存取器(英文:Random Access Memory,简称:RAM)、磁盘或光盘等。A person skilled in the art may understand that all or part of the various steps of the foregoing embodiments may be performed by a program to instruct related hardware. The program may be stored in a computer readable storage medium, and the storage medium may include: Flash drive, read-only memory (English: Read-Only Memory, referred to as: ROM), random accessor (English: Random Access Memory, referred to as: RAM), disk or CD.
以上对本发明实施例所提供的内容下载方法及相关设备、系统进行了详细介绍,本文中应用了具体个例对本发明的原理及实施方式进行了阐述,以上实施例的说明只是用于帮助理解本发明的方法及其核心思想;同时,对于本领域的一般技术人员,依据本发明的思想,在具体实施方式及应用范围上均会有改变之处,综上所述,本说明书内容不应理解为对本发明的限制。The content downloading method and the related device and system provided by the embodiments of the present invention are described in detail above. The principles and implementation manners of the present invention are described in the specific examples. The description of the above embodiments is only used to help understand the present invention. The method of the invention and its core idea; at the same time, for the person of ordinary skill in the art, according to the idea of the present invention, there are some changes in the specific embodiment and the scope of application. In summary, the content of the specification should not be understood. To limit the invention.

Claims (6)

  1. 一种本地邮件数据的搜集方法,其特征在于,所述方法包括如下步骤: A method for collecting local mail data, characterized in that the method comprises the following steps:
    获取需要搜集的本地邮件数据的类别;Get the category of local mail data that needs to be collected;
    在本地邮件数据中抓取与该类别相同的搜集邮件数据;Grab the same collected mail data in the local mail data;
    获取该收集邮件数据的关键词,按该关键词将搜集邮件数据进行分别存储。The keyword for collecting the mail data is obtained, and the collected mail data is separately stored according to the keyword.
  2. 根据权利要求1所述的方法,其特征在于,所述方法还包括:The method of claim 1 further comprising:
    将该存储的文件夹以类别和关键词命名。Name the stored folder with categories and keywords.
  3. 根据权利要求1所述的方法,其特征在于,所述方法还包括:The method of claim 1 further comprising:
    统计相同关键词出现的次数。Count the number of times the same keyword appears.
  4. 一种本地邮件数据的搜集系统,其特征在于,所述系统包括:A collection system for local mail data, characterized in that the system comprises:
    获取单元,用于获取需要搜集的本地邮件数据的类别;An obtaining unit for obtaining a category of local mail data that needs to be collected;
    抓取单元,用于在本地邮件数据中抓取与该类别相同的搜集邮件数据;a crawling unit, configured to capture the same collected mail data in the local mail data;
    分类单元,用于获取该收集邮件数据的关键词,按该关键词将搜集邮件数据进行分别存储。The classification unit is configured to acquire keywords of the collected mail data, and store the collected mail data separately according to the keyword.
  5. 根据权利要求4所述的系统,其特征在于,所述系统还包括:The system of claim 4, wherein the system further comprises:
    命名单元,用于将该存储的文件夹以类别和关键词命名。A naming unit for naming the stored folder with categories and keywords.
  6. 根据权利要求4所述的系统,其特征在于,所述系统还包括:The system of claim 4, wherein the system further comprises:
    统计单元,用于统计相同关键词出现的次数。A statistical unit that counts the number of occurrences of the same keyword.
PCT/CN2016/088501 2016-07-05 2016-07-05 Local mail data collection method and system WO2018006256A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
PCT/CN2016/088501 WO2018006256A1 (en) 2016-07-05 2016-07-05 Local mail data collection method and system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2016/088501 WO2018006256A1 (en) 2016-07-05 2016-07-05 Local mail data collection method and system

Publications (1)

Publication Number Publication Date
WO2018006256A1 true WO2018006256A1 (en) 2018-01-11

Family

ID=60901528

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2016/088501 WO2018006256A1 (en) 2016-07-05 2016-07-05 Local mail data collection method and system

Country Status (1)

Country Link
WO (1) WO2018006256A1 (en)

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6925605B2 (en) * 2000-12-28 2005-08-02 International Business Machines Corporation Collating table for email
JP2006285857A (en) * 2005-04-04 2006-10-19 Sanyo Electric Co Ltd Mail server
CN101488198A (en) * 2008-01-17 2009-07-22 联想(北京)有限公司 Mail classifying method and apparatus
CN103136266A (en) * 2011-12-01 2013-06-05 中兴通讯股份有限公司 Method and device for classification of mail
CN104734943A (en) * 2015-03-17 2015-06-24 深圳市连用科技有限公司 Processing method and system for E-mails
CN106169974A (en) * 2016-07-05 2016-11-30 马岩 The gathering method of local mail data and system

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6925605B2 (en) * 2000-12-28 2005-08-02 International Business Machines Corporation Collating table for email
JP2006285857A (en) * 2005-04-04 2006-10-19 Sanyo Electric Co Ltd Mail server
CN101488198A (en) * 2008-01-17 2009-07-22 联想(北京)有限公司 Mail classifying method and apparatus
CN103136266A (en) * 2011-12-01 2013-06-05 中兴通讯股份有限公司 Method and device for classification of mail
CN104734943A (en) * 2015-03-17 2015-06-24 深圳市连用科技有限公司 Processing method and system for E-mails
CN106169974A (en) * 2016-07-05 2016-11-30 马岩 The gathering method of local mail data and system

Similar Documents

Publication Publication Date Title
WO2019000304A1 (en) Public opinion monitoring method and system
WO2018006256A1 (en) Local mail data collection method and system
WO2018006255A1 (en) Network mail data collection method and system
WO2018014316A1 (en) Method and system for collecting email data of local area network
WO2018006254A1 (en) Local area network mail data-based fetching method and system
WO2018006217A1 (en) Network mail data-based fetching method and system
WO2017117716A1 (en) Outdoor positioning management method and system for smart city
WO2018006218A1 (en) Local mail data-based fetching method and system
WO2017128357A1 (en) Big data-based method and system for webpage crawling
WO2018157330A1 (en) Big data partitioning method and system
WO2017117783A1 (en) Network information searching method and system
WO2018014317A1 (en) Method and system for sorting and saving email data
WO2017190284A1 (en) Online course user acquisition method and system
WO2018027928A1 (en) Forum big data capturing method and system
WO2018032249A1 (en) Audio data fetching method and system
WO2018014319A1 (en) Method and system for categorised storage of network mail data
WO2018157333A1 (en) Method and system for processing big data
WO2017117781A1 (en) Network information classification method and system
WO2018032246A1 (en) Search method and system for big data in local area network
WO2017214915A1 (en) Internet education grouping method and system
WO2018157391A1 (en) Big-data enterprise evaluation method and system
WO2018032245A1 (en) Data search method and system for comment data of social networking software
WO2018032250A1 (en) Text data search method and system for big data
WO2018027576A1 (en) Method and system for collecting operating time in statistics in internet of things
WO2018032253A1 (en) Secure search method and system for big data of images

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 16907761

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 16907761

Country of ref document: EP

Kind code of ref document: A1