CN113961589A - Internet information collection processing method and system - Google Patents

Internet information collection processing method and system Download PDF

Info

Publication number
CN113961589A
CN113961589A CN202111575615.3A CN202111575615A CN113961589A CN 113961589 A CN113961589 A CN 113961589A CN 202111575615 A CN202111575615 A CN 202111575615A CN 113961589 A CN113961589 A CN 113961589A
Authority
CN
China
Prior art keywords
information
basic
data
verification
internet
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202111575615.3A
Other languages
Chinese (zh)
Inventor
于东辉
陈婷慧
李乐
李雯雯
张威
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Zhongjing Future Beijing Media Technology Co ltd
Original Assignee
Zhongjing Future Beijing Media Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Zhongjing Future Beijing Media Technology Co ltd filed Critical Zhongjing Future Beijing Media Technology Co ltd
Priority to CN202111575615.3A priority Critical patent/CN113961589A/en
Publication of CN113961589A publication Critical patent/CN113961589A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/242Query formulation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/23Updating
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2455Query execution
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/29Geographical information databases

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Mathematical Physics (AREA)
  • Remote Sensing (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The embodiment of the invention relates to the technical field of information collection and processing, and particularly discloses a method and a system for collecting and processing internet information. The embodiment of the invention retrieves and collects the internet data through the key information to obtain the collected data; screening the collected data for basic integrity, and performing primary information extraction processing on the screened collected data to obtain basic frame information; verifying and screening the basic frame information to obtain verified frame information; and adding or updating information to the electronic map according to the verification frame information. After the retrieval and collection of the internet data can be carried out, the collected data are screened, primarily processed and verified and screened, then verification frame information is obtained, information addition or information updating is carried out on the electronic map according to the verification frame information, therefore, the information addition or updating can be automatically carried out on the electronic map, and the information in the electronic map is effectively enriched.

Description

Internet information collection processing method and system
Technical Field
The invention belongs to the technical field of information collection and processing, and particularly relates to an internet information collection and processing method and system.
Background
The information collection processing is a process of finding, acquiring and processing information to be known in various ways according to the needs of information work, the information collection processing work is a first step of better utilizing the information and is also a key step, and the quality of the information collection processing directly relates to the quality of the information. In the existing internet information collecting and processing process for adding or updating information of an electronic map, a user is usually required to actively upload the information, otherwise, the information of merchants in the electronic map cannot be perfected, so that the information in the electronic map is monotonous, and richer information cannot be automatically updated.
Disclosure of Invention
The embodiment of the invention aims to provide an internet information collecting and processing method and system, and aims to solve the problems in the background art.
In order to achieve the above purpose, the embodiments of the present invention provide the following technical solutions:
an internet information collection processing method specifically comprises the following steps:
retrieving and collecting the internet data through the key information to obtain collected data;
screening the collected data for basic integrity, and performing primary information extraction processing on the screened collected data to obtain basic frame information;
verifying and screening the basic frame information to obtain verified frame information;
and adding or updating information to the electronic map according to the verification frame information.
As a further limitation of the technical solution of the embodiment of the present invention, the retrieving and collecting internet data through the key information to obtain the collected data specifically includes the following steps:
receiving key information;
performing character retrieval on internet data through the key information to obtain character collection data;
and carrying out picture collection on the internet data through the key information to obtain picture collection data.
As a further limitation of the technical solution of the embodiment of the present invention, the screening of the basic integrity of the collected data and the preliminary information extraction processing of the screened collected data to obtain the basic frame information specifically include the following steps:
screening the collected data for basic integrity to obtain basic data;
and performing primary information extraction processing on the basic data to obtain basic frame information.
As a further limitation of the technical solution of the embodiment of the present invention, the screening of the basic integrity of the collected data to obtain the basic data specifically includes the following steps:
extracting keywords from the collected data to obtain a keyword set;
judging whether the keyword set contains preset basic information or not;
if the keyword set contains basic information, combining the keyword set to generate basic data;
and if the keyword set does not contain basic information, removing the collected data.
As a further limitation of the technical solution of the embodiment of the present invention, the performing the preliminary information extraction processing on the basic data to obtain the basic frame information specifically includes the following steps:
performing information type identification on the basic data to obtain a plurality of basic information types;
according to the basic information types, performing primary information extraction processing on basic data to obtain a plurality of extraction information;
and importing the plurality of extracted information into a preset basic classification frame to obtain basic frame information.
As a further limitation of the technical solution of the embodiment of the present invention, the verifying and screening the basic frame information to obtain the verified frame information specifically includes the following steps:
analyzing the basic frame information to acquire a verification mode and a verification problem;
performing verification screening according to the verification mode and the verification problem to obtain difference information;
and reconstructing the basic frame information according to the difference information to obtain verification frame information.
As a further limitation of the technical solution of the embodiment of the present invention, the adding or updating information to or from the electronic map according to the verification frame information specifically includes the following steps:
extracting address information in the verification frame information;
acquiring a standard information frame in the electronic map according to the address information;
and adding or updating information in the standard information frame according to the verification frame information.
An internet information collecting and processing system, the system comprising a retrieval collecting unit, a preliminary processing unit, a verification screening unit, and an addition updating unit, wherein:
the retrieval and collection unit is used for retrieving and collecting the internet data through the key information to obtain collected data;
the primary processing unit is used for screening the basic integrity of the collected data and extracting and processing the primary information of the screened collected data to obtain basic frame information;
the verification screening unit is used for verifying and screening the basic frame information to obtain verification frame information;
and the adding and updating unit is used for adding or updating information to the electronic map according to the verification frame information.
As a further limitation of the technical solution of the embodiment of the present invention, the retrieval and collection unit specifically includes:
the information receiving module is used for receiving key information;
the character retrieval module is used for carrying out character retrieval on the internet data through the key information to obtain character collection data;
and the picture collecting module is used for carrying out picture collection on the internet data through the key information to obtain picture collecting data.
As a further limitation of the technical solution of the embodiment of the present invention, the preliminary processing unit specifically includes:
the basic data acquisition module is used for screening the basic integrity of the collected data to obtain basic data;
and the frame information acquisition module is used for performing primary information extraction processing on the basic data to obtain basic frame information.
Compared with the prior art, the invention has the beneficial effects that:
the embodiment of the invention retrieves and collects the internet data through the key information to obtain the collected data; screening the collected data for basic integrity, and performing primary information extraction processing on the screened collected data to obtain basic frame information; verifying and screening the basic frame information to obtain verified frame information; and adding or updating information to the electronic map according to the verification frame information. After the retrieval and collection of the internet data can be carried out, the collected data are screened, primarily processed and verified and screened, then verification frame information is obtained, information addition or information updating is carried out on the electronic map according to the verification frame information, therefore, the information addition or updating can be automatically carried out on the electronic map, and the information in the electronic map is effectively enriched.
Drawings
In order to more clearly illustrate the technical solutions in the embodiments of the present invention, the drawings used in the description of the embodiments or the prior art will be briefly described below, and it is obvious that the drawings in the following description are only some embodiments of the present invention.
Fig. 1 shows a flow chart of a method provided by an embodiment of the invention.
Fig. 2 shows a flow chart of internet data retrieval collection in the method provided by the embodiment of the invention.
Fig. 3 shows a flowchart of basic integrity screening and preliminary information extraction processing in the method provided by the embodiment of the present invention.
Fig. 4 shows a flowchart of obtaining basic data in the method provided by the embodiment of the present invention.
Fig. 5 shows a flowchart for obtaining basic framework information in the method provided by the embodiment of the present invention.
Fig. 6 shows a flowchart for obtaining verification framework information in the method provided by the embodiment of the present invention.
Fig. 7 shows a flowchart of information addition or information update to the electronic map in the method provided by the embodiment of the invention.
Fig. 8 shows an application architecture diagram of a system provided by an embodiment of the invention.
Fig. 9 is a block diagram illustrating a structure of a search collection unit in the system according to the embodiment of the present invention.
Fig. 10 is a block diagram illustrating a configuration of a preliminary processing unit in the system according to the embodiment of the present invention.
Detailed Description
In order to make the objects, technical solutions and advantages of the present invention more apparent, the present invention is described in further detail below with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are merely illustrative of the invention and are not intended to limit the invention.
It can be understood that in the existing internet information collecting and processing process for adding or updating information of an electronic map, a user is usually required to actively upload information, otherwise, the information of merchants in the electronic map cannot be perfected, so that the information in the electronic map is monotonous, and richer information cannot be automatically updated.
In order to solve the problems, the embodiment of the invention retrieves and collects the internet data through the key information to obtain the collected data; screening the collected data for basic integrity, and performing primary information extraction processing on the screened collected data to obtain basic frame information; verifying and screening the basic frame information to obtain verified frame information; and adding or updating information to the electronic map according to the verification frame information. After the retrieval and collection of the internet data can be carried out, the collected data are screened, primarily processed and verified and screened, then verification frame information is obtained, information addition or information updating is carried out on the electronic map according to the verification frame information, therefore, the information addition or updating can be automatically carried out on the electronic map, and the information in the electronic map is effectively enriched.
Fig. 1 shows a flow chart of a method provided by an embodiment of the invention.
Specifically, the internet information collection processing method specifically comprises the following steps:
and step S101, retrieving and collecting the Internet data through the key information to obtain collected data.
In the embodiment of the invention, data retrieval is carried out in internet data through automatically generated or input key information, and the retrieved data is collected to obtain collected data. Specifically, there may be multiple kinds of key information, and different key information determines different search collection ranges, for example: the key information may be place name, market name, shopping, snacks, etc.
Specifically, fig. 2 shows a flowchart of internet data retrieval and collection in the method provided by the embodiment of the present invention.
In a preferred embodiment of the present invention, the retrieving and collecting internet data through the key information to obtain collected data specifically includes the following steps:
in step S1011, key information is received.
In the embodiment of the invention, the input key information is received, or the key information automatically generated according to the updating information of the electronic map is received. For example: the key information can be the inputted place name, and also can be the market name generated by the construction and the recruitment of a certain market.
And step S1012, performing character retrieval on the Internet data through the key information to obtain character collection data.
In the embodiment of the invention, the character retrieval of the internet data is carried out according to the key information, and the character collection data is obtained. In particular, the word collection data may be a merchant name, a location, a consumer rating, and the like.
And S1013, carrying out picture collection on the Internet data through the key information to obtain picture collection data.
In the embodiment of the invention, the picture collection of the internet data is carried out according to the key information, and the picture collection data is obtained. Specifically, the picture collection data may be a name of a merchant, positioning data of a picture, a gate photo, an internal photo, and the like.
Further, the internet information collection processing method further comprises the following steps:
and S102, screening the basic integrity of the collected data, and performing primary information extraction processing on the screened collected data to obtain basic frame information.
In the embodiment of the invention, whether the collected data contains basic data required by adding electronic map information or updating information is analyzed, and the collected data with the basic data required by adding electronic map information or updating information is subjected to preliminary information extraction processing of information classification by screening the basic integrity to obtain basic frame information.
Specifically, fig. 3 shows a flowchart of basic integrity screening and preliminary information extraction processing in the method provided by the embodiment of the present invention.
In a preferred embodiment provided by the present invention, the screening of the basic integrity of the collected data and the preliminary information extraction processing of the collected data after the screening to obtain the basic frame information specifically include the following steps:
and S1021, screening the basic integrity of the collected data to obtain basic data.
In the embodiment of the invention, the collected data is subjected to basic integrity screening, whether the collected data contains basic information required by adding electronic map information or updating information is judged, and inconsistent data is removed to obtain the basic data. For example: the basic information comprises the name, the address and the business hours of the merchant, and collected data which does not comprise the name, the address and the business hours of the merchant is removed.
Specifically, fig. 4 shows a flowchart for obtaining basic data in the method provided by the embodiment of the present invention.
In a preferred embodiment provided by the present invention, the screening of the basic integrity of the collected data to obtain basic data specifically includes the following steps:
step S10211, extracting keywords from the collected data to obtain a keyword set.
In the embodiment of the invention, the keywords in the collected data are extracted, and the extracted keywords are sorted to obtain the keyword set. Specifically, the keywords also include words and addresses in the pictures, and the keyword set at this time includes the corresponding pictures.
Step S10212, determining whether the keyword set contains preset basic information.
Step S10213, if the keyword set contains basic information, the keyword set is combined to generate basic data.
Step S10214, if the keyword set does not contain basic information, removing the collected data.
In the embodiment of the invention, whether the keywords in the keyword set contain basic information required for adding electronic map information or updating information is judged, the qualified keyword set is combined to generate basic data, and the collected data corresponding to the unqualified keyword set is removed.
Further, the step of screening the collected data for the basic integrity and performing preliminary information extraction processing on the screened collected data to obtain basic frame information further includes the following steps:
step S1022, perform preliminary information extraction processing on the basic data to obtain basic frame information.
In the embodiment of the invention, a plurality of different types of data in the basic data are extracted, and the extracted data are transmitted to a preset basic classification frame in a classification manner to generate basic frame information.
Specifically, fig. 5 shows a flowchart for obtaining basic framework information in the method provided by the embodiment of the present invention.
In a preferred embodiment provided by the present invention, the performing a preliminary information extraction process on the basic data to obtain basic framework information specifically includes the following steps:
step S10221, performing information type identification on the basic data to obtain a plurality of basic information types.
In the embodiment of the invention, the information types in the basic data are identified and classified to obtain a plurality of basic information types.
Step S10222, performing preliminary information extraction processing on the basic data according to the plurality of basic information types to obtain a plurality of extracted information.
In the embodiment of the invention, the information in the basic data is classified and extracted through a plurality of basic information types, and a plurality of pieces of extracted information of different information types are generated.
Step S10223, importing the extracted information into a preset basic classification frame to obtain basic frame information.
Further, the internet information collection processing method further comprises the following steps:
and step S103, verifying and screening the basic frame information to obtain verified frame information.
In the embodiment of the invention, the basic frame information is verified and screened, and the information with errors in the basic frame information is removed or replaced to generate the verified frame information. For example: the information of a certain hot pot restaurant comprises the name, the contact way, the address, the gate photo, the inside photo, the consumer evaluation and the like of the hot pot restaurant, and the information can be contacted with the hot pot restaurant through the contact way, confirmed and the incorrect information can be deleted or replaced.
Specifically, fig. 6 shows a flowchart for obtaining verification framework information in the method provided by the embodiment of the present invention.
In a preferred embodiment provided by the present invention, the performing verification screening on the basic framework information to obtain verification framework information specifically includes the following steps:
and step S1031, analyzing the basic frame information, and acquiring a verification mode and a verification problem.
In the embodiment of the invention, the basic frame information is analyzed, the information which can be verified in the basic frame information is extracted, and then the verification mode and the verification problem are generated according to the type of the information.
And S1032, carrying out verification screening according to the verification mode and the verification problem to obtain difference information.
And step S1033, reconstructing the basic frame information according to the difference information to obtain verification frame information.
Further, the internet information collection processing method further comprises the following steps:
and step S104, adding or updating information to the electronic map according to the verification frame information.
In the embodiment of the invention, a plurality of pieces of information in the verification frame information are matched with a standard information frame in the electronic map, the information in the verification frame information is uploaded to the electronic map, and information addition or information updating of a certain place is realized in the electronic map.
Specifically, fig. 7 shows a flowchart of adding or updating information to or from an electronic map in the method according to the embodiment of the present invention.
In a preferred embodiment of the present invention, the adding or updating information to or from the electronic map according to the verification framework information specifically includes the following steps:
step S1041, extracting address information in the verification frame information.
In the embodiment of the invention, the address information is identified and extracted from the verification frame information to obtain the address information.
Step S1042, according to the address information, a standard information frame in the electronic map is obtained.
In the embodiment of the invention, the standard information frame corresponding to the marking position of the address information in the electronic map is obtained according to the address information.
And step S1043, adding information or updating information in the standard information frame according to the verification frame information.
In the embodiment of the invention, the information of the verification frame is analyzed, extracted and uploaded according to the standard information frame, so that the information of the standard information frame which is not uploaded is added, and the information of the standard information frame which needs to be updated is updated and covered.
Further, fig. 8 is a diagram illustrating an application architecture of the system according to the embodiment of the present invention.
In another preferred embodiment, the present invention provides an internet information collection processing system, including:
and the retrieval and collection unit 101 is used for retrieving and collecting the internet data through the key information to obtain collected data.
In the embodiment of the present invention, the retrieval collecting unit 101 performs data retrieval on internet data by automatically generating or inputting key information, and collects retrieved data to obtain collected data. Specifically, there may be multiple kinds of key information, and different key information determines different search collection ranges, for example: the key information may be place name, market name, shopping, snacks, etc.
Specifically, fig. 9 shows a block diagram of a retrieval and collection unit 101 in the system according to the embodiment of the present invention.
In a preferred embodiment provided by the present invention, the search collection unit 101 specifically includes:
the information receiving module 1011 is configured to receive the key information.
In the embodiment of the present invention, the information receiving module 1011 receives the input key information or receives key information automatically generated according to the update information of the electronic map.
And a text retrieval module 1012, configured to perform text retrieval on the internet data through the key information to obtain text collection data.
In the embodiment of the present invention, the text retrieval module 1012 performs text retrieval on internet data according to the key information, and obtains text collection data. In particular, the word collection data may be a merchant name, a location, a consumer rating, and the like.
And the picture collecting module 1013 is configured to collect pictures of the internet data according to the key information to obtain picture collection data.
In the embodiment of the present invention, the picture collecting module 1013 collects pictures of internet data according to the key information, and obtains picture collection data. Specifically, the picture collection data may be a name of a merchant, positioning data of a picture, a gate photo, an internal photo, and the like.
Further, the internet information collecting and processing system further includes:
and the primary processing unit 102 is configured to perform basic integrity screening on the collected data, and perform primary information extraction processing on the collected data after screening to obtain basic framework information.
In the embodiment of the present invention, the preliminary processing unit 102 analyzes whether the collected data includes basic data required for adding electronic map information or updating information, and performs preliminary information extraction processing of information classification on the collected data having the basic data required for adding electronic map information or updating information by performing basic integrity screening, so as to obtain basic frame information.
Specifically, fig. 10 shows a block diagram of the preliminary processing unit 102 in the system according to the embodiment of the present invention.
In a preferred embodiment provided by the present invention, the preliminary processing unit 102 specifically includes:
and a basic data obtaining module 1021, configured to perform basic integrity screening on the collected data to obtain basic data.
In the embodiment of the present invention, the basic data obtaining module 1021 performs basic integrity screening on the collected data, determines whether the collected data includes basic information required for adding electronic map information or updating information, and eliminates inconsistent data to obtain basic data.
The frame information obtaining module 1022 is configured to perform preliminary information extraction processing on the basic data to obtain basic frame information.
In this embodiment of the present invention, the frame information obtaining module 1022 extracts a plurality of different types of data in the basic data, and transmits the extracted data to a preset basic classification frame in a classified manner, so as to generate basic frame information.
Further, the internet information collecting and processing system further includes:
and the verification screening unit 103 is configured to perform verification screening on the basic frame information to obtain verification frame information.
In the embodiment of the present invention, the verification screening unit 103 performs verification screening on the basic frame information, and removes or replaces information having an error in the basic frame information to generate verification frame information. For example: the information of a certain hot pot restaurant comprises the name, the contact way, the address, the gate photo, the inside photo, the consumer evaluation and the like of the hot pot restaurant, and the information can be contacted with the hot pot restaurant through the contact way, confirmed and the incorrect information can be deleted or replaced.
And an adding and updating unit 104, configured to add or update information to or in the electronic map according to the verification frame information.
In the embodiment of the present invention, the adding and updating unit 104 matches a plurality of pieces of information in the verification frame information with a standard information frame in the electronic map, uploads the pieces of information in the verification frame information to the electronic map, and adds or updates information in a certain location in the electronic map.
In summary, according to the embodiment of the present invention, after the internet data is retrieved and collected, the collected data is subjected to screening, preliminary processing, and verification screening, so as to obtain the verification framework information, and the electronic map is subjected to information addition or information update according to the verification framework information, so that the information addition or update can be automatically performed on the electronic map, and the information in the electronic map is effectively enriched.
It should be understood that, although the steps in the flowcharts of the embodiments of the present invention are shown in sequence as indicated by the arrows, the steps are not necessarily performed in sequence as indicated by the arrows. The steps are not performed in the exact order shown and described, and may be performed in other orders, unless explicitly stated otherwise. Moreover, at least a portion of the steps in various embodiments may include multiple sub-steps or multiple stages that are not necessarily performed at the same time, but may be performed at different times, and the order of performance of the sub-steps or stages is not necessarily sequential, but may be performed in turn or alternately with other steps or at least a portion of the sub-steps or stages of other steps.
It will be understood by those skilled in the art that all or part of the processes of the methods of the embodiments described above can be implemented by a computer program, which can be stored in a non-volatile computer-readable storage medium, and can include the processes of the embodiments of the methods described above when the program is executed. Any reference to memory, storage, database, or other medium used in the embodiments provided herein may include non-volatile and/or volatile memory, among others. Non-volatile memory can include read-only memory (ROM), Programmable ROM (PROM), Electrically Programmable ROM (EPROM), Electrically Erasable Programmable ROM (EEPROM), or flash memory. Volatile memory can include Random Access Memory (RAM) or external cache memory. By way of illustration and not limitation, RAM is available in a variety of forms such as Static RAM (SRAM), Dynamic RAM (DRAM), Synchronous DRAM (SDRAM), Double Data Rate SDRAM (DDRSDRAM), Enhanced SDRAM (ESDRAM), Synchronous Link DRAM (SLDRAM), Rambus Direct RAM (RDRAM), direct bus dynamic RAM (DRDRAM), and memory bus dynamic RAM (RDRAM).
The technical features of the embodiments described above may be arbitrarily combined, and for the sake of brevity, all possible combinations of the technical features in the embodiments described above are not described, but should be considered as being within the scope of the present specification as long as there is no contradiction between the combinations of the technical features.
The above-mentioned embodiments only express several embodiments of the present invention, and the description thereof is more specific and detailed, but not construed as limiting the scope of the present invention. It should be noted that, for a person skilled in the art, several variations and modifications can be made without departing from the inventive concept, which falls within the scope of the present invention. Therefore, the protection scope of the present patent shall be subject to the appended claims.
The above description is only for the purpose of illustrating the preferred embodiments of the present invention and is not to be construed as limiting the invention, and any modifications, equivalents and improvements made within the spirit and principle of the present invention are intended to be included within the scope of the present invention.

Claims (10)

1. An internet information collection processing method is characterized by specifically comprising the following steps:
retrieving and collecting the internet data through the key information to obtain collected data;
screening the collected data for basic integrity, and performing primary information extraction processing on the screened collected data to obtain basic frame information;
verifying and screening the basic frame information to obtain verified frame information;
and adding or updating information to the electronic map according to the verification frame information.
2. The internet information collection processing method according to claim 1, wherein the retrieving and collecting internet data through the key information to obtain the collected data specifically includes the following steps:
receiving key information;
performing character retrieval on internet data through the key information to obtain character collection data;
and carrying out picture collection on the internet data through the key information to obtain picture collection data.
3. The internet information collecting and processing method according to claim 1, wherein the step of screening the collected data for basic integrity and performing preliminary information extraction processing on the screened collected data to obtain basic frame information specifically includes the steps of:
screening the collected data for basic integrity to obtain basic data;
and performing primary information extraction processing on the basic data to obtain basic frame information.
4. The internet information collecting and processing method according to claim 3, wherein the step of screening the collected data for basic integrity to obtain basic data specifically comprises the steps of:
extracting keywords from the collected data to obtain a keyword set;
judging whether the keyword set contains preset basic information or not;
if the keyword set contains basic information, combining the keyword set to generate basic data;
and if the keyword set does not contain basic information, removing the collected data.
5. The internet information collection processing method according to claim 3, wherein the step of performing preliminary information extraction processing on the basic data to obtain basic framework information specifically includes the steps of:
performing information type identification on the basic data to obtain a plurality of basic information types;
according to the basic information types, performing primary information extraction processing on basic data to obtain a plurality of extraction information;
and importing the plurality of extracted information into a preset basic classification frame to obtain basic frame information.
6. The internet information collecting and processing method according to claim 1, wherein the step of performing verification screening on the basic framework information to obtain verification framework information specifically includes the steps of:
analyzing the basic frame information to acquire a verification mode and a verification problem;
performing verification screening according to the verification mode and the verification problem to obtain difference information;
and reconstructing the basic frame information according to the difference information to obtain verification frame information.
7. The internet information collection processing method according to claim 1, wherein the information adding or information updating of the electronic map according to the verification framework information specifically includes the steps of:
extracting address information in the verification frame information;
acquiring a standard information frame in the electronic map according to the address information;
and adding or updating information in the standard information frame according to the verification frame information.
8. An internet information collecting and processing system, comprising a retrieval collecting unit, a preliminary processing unit, a verification screening unit and an addition updating unit, wherein:
the retrieval and collection unit is used for retrieving and collecting the internet data through the key information to obtain collected data;
the primary processing unit is used for screening the basic integrity of the collected data and extracting and processing the primary information of the screened collected data to obtain basic frame information;
the verification screening unit is used for verifying and screening the basic frame information to obtain verification frame information;
and the adding and updating unit is used for adding or updating information to the electronic map according to the verification frame information.
9. The system for collecting and processing internet information according to claim 8, wherein the retrieving and collecting unit specifically includes:
the information receiving module is used for receiving key information;
the character retrieval module is used for carrying out character retrieval on the internet data through the key information to obtain character collection data;
and the picture collecting module is used for carrying out picture collection on the internet data through the key information to obtain picture collecting data.
10. The system of claim 8, wherein the preliminary processing unit comprises:
the basic data acquisition module is used for screening the basic integrity of the collected data to obtain basic data;
and the frame information acquisition module is used for performing primary information extraction processing on the basic data to obtain basic frame information.
CN202111575615.3A 2021-12-22 2021-12-22 Internet information collection processing method and system Pending CN113961589A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202111575615.3A CN113961589A (en) 2021-12-22 2021-12-22 Internet information collection processing method and system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202111575615.3A CN113961589A (en) 2021-12-22 2021-12-22 Internet information collection processing method and system

Publications (1)

Publication Number Publication Date
CN113961589A true CN113961589A (en) 2022-01-21

Family

ID=79473560

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202111575615.3A Pending CN113961589A (en) 2021-12-22 2021-12-22 Internet information collection processing method and system

Country Status (1)

Country Link
CN (1) CN113961589A (en)

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20110179080A1 (en) * 2010-01-20 2011-07-21 Clarion Co., Ltd. Map Update Data Delivery Method, Map Update Data Delivery Device and Terminal Device
US8799799B1 (en) * 2013-05-07 2014-08-05 Palantir Technologies Inc. Interactive geospatial map
CN104615710A (en) * 2015-02-04 2015-05-13 韩海丰 Electronic map frame data updating method
CN104899224A (en) * 2014-03-09 2015-09-09 上海能感物联网有限公司 Inquiry device for information of Chinese natural language text remote control inquiry way director
CN106845470A (en) * 2017-02-20 2017-06-13 百度在线网络技术(北京)有限公司 Map data collecting method and apparatus
CN108399224A (en) * 2018-02-12 2018-08-14 安徽千云度信息技术有限公司 A kind of method of the push of shopping at network information
CN109872392A (en) * 2019-02-19 2019-06-11 北京百度网讯科技有限公司 Man-machine interaction method and device based on high-precision map
CN112052410A (en) * 2020-09-30 2020-12-08 北京百度网讯科技有限公司 Map interest point updating method and device
CN112199570A (en) * 2020-10-29 2021-01-08 重庆撼地大数据有限公司 Real estate information visualization analysis system and method based on web crawler

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20110179080A1 (en) * 2010-01-20 2011-07-21 Clarion Co., Ltd. Map Update Data Delivery Method, Map Update Data Delivery Device and Terminal Device
US8799799B1 (en) * 2013-05-07 2014-08-05 Palantir Technologies Inc. Interactive geospatial map
CN104899224A (en) * 2014-03-09 2015-09-09 上海能感物联网有限公司 Inquiry device for information of Chinese natural language text remote control inquiry way director
CN104615710A (en) * 2015-02-04 2015-05-13 韩海丰 Electronic map frame data updating method
CN106845470A (en) * 2017-02-20 2017-06-13 百度在线网络技术(北京)有限公司 Map data collecting method and apparatus
CN108399224A (en) * 2018-02-12 2018-08-14 安徽千云度信息技术有限公司 A kind of method of the push of shopping at network information
CN109872392A (en) * 2019-02-19 2019-06-11 北京百度网讯科技有限公司 Man-machine interaction method and device based on high-precision map
CN112052410A (en) * 2020-09-30 2020-12-08 北京百度网讯科技有限公司 Map interest point updating method and device
CN112199570A (en) * 2020-10-29 2021-01-08 重庆撼地大数据有限公司 Real estate information visualization analysis system and method based on web crawler

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
D.RAJINIGIRINATH S.SELVAN: ""Investigations on vehicles guided by updated road and maps"", 《PRZEGLAD ELEKTROTECHNICZNY》 *
满亢 等: ""基于Internet的个人交通助理系统设计与实现"", 《自动化仪表》 *

Similar Documents

Publication Publication Date Title
US9390176B2 (en) System and method for recursively traversing the internet and other sources to identify, gather, curate, adjudicate, and qualify business identity and related data
CN109756760B (en) Video tag generation method and device and server
WO2018040068A1 (en) Knowledge graph-based semantic analysis system and method
CN109726664B (en) Intelligent dial recommendation method, system, equipment and storage medium
CN110674360B (en) Tracing method and system for data
CN110019542B (en) Generation of enterprise relationship, generation of organization member database and identification of same name member
CN112559526A (en) Data table export method and device, computer equipment and storage medium
CN110457394A (en) Vehicle information management method, apparatus, computer equipment and storage medium
CN112948504B (en) Data acquisition method and device, computer equipment and storage medium
CN114153898A (en) Method, device and application for combing relationships among database tables
CN111382570A (en) Text entity recognition method and device, computer equipment and storage medium
CN109359279B (en) Report generation method, report generation device, computer equipment and storage medium
CN113961589A (en) Internet information collection processing method and system
CN115577694B (en) Intelligent recommendation method for standard writing
CN110781310A (en) Target concept graph construction method and device, computer equipment and storage medium
CN110888977B (en) Text classification method, apparatus, computer device and storage medium
CN115631011A (en) Product pushing method and system based on Internet
CN115756486A (en) Data interface analysis method and device
CN111460268B (en) Method and device for determining database query request and computer equipment
CN112785095A (en) Loan prediction method, loan prediction device, electronic device, and computer-readable storage medium
CN113205442A (en) E-government data feedback management method and device based on block chain
CN113468339A (en) Label extraction method, system, electronic device and medium based on knowledge graph
CN116074378B (en) Internet information pushing method and system
CN110633446B (en) Webpage column recognition model training method, using method, device and storage medium
CN115098538B (en) Database query optimization method and system

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
AD01 Patent right deemed abandoned

Effective date of abandoning: 20230516

AD01 Patent right deemed abandoned