KR20190007628A - Weather search system using web data gathering - Google Patents

Weather search system using web data gathering Download PDF

Info

Publication number
KR20190007628A
KR20190007628A KR1020170088846A KR20170088846A KR20190007628A KR 20190007628 A KR20190007628 A KR 20190007628A KR 1020170088846 A KR1020170088846 A KR 1020170088846A KR 20170088846 A KR20170088846 A KR 20170088846A KR 20190007628 A KR20190007628 A KR 20190007628A
Authority
KR
South Korea
Prior art keywords
data
weather
web
search
keyword
Prior art date
Application number
KR1020170088846A
Other languages
Korean (ko)
Inventor
한승현
Original Assignee
한국과학기술원
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 한국과학기술원 filed Critical 한국과학기술원
Priority to KR1020170088846A priority Critical patent/KR20190007628A/en
Publication of KR20190007628A publication Critical patent/KR20190007628A/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/29Geographical information databases
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9537Spatial or temporal dependent retrieval, e.g. spatiotemporal queries
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Systems or methods specially adapted for specific business sectors, e.g. utilities or tourism
    • G06Q50/01Social networking
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Systems or methods specially adapted for specific business sectors, e.g. utilities or tourism
    • G06Q50/10Services

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Business, Economics & Management (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Tourism & Hospitality (AREA)
  • Human Resources & Organizations (AREA)
  • Strategic Management (AREA)
  • Primary Health Care (AREA)
  • General Business, Economics & Management (AREA)
  • Marketing (AREA)
  • General Health & Medical Sciences (AREA)
  • Economics (AREA)
  • Health & Medical Sciences (AREA)
  • Data Mining & Analysis (AREA)
  • General Engineering & Computer Science (AREA)
  • Computing Systems (AREA)
  • Remote Sensing (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

A weather search system using web data gathering according to one embodiment is disclosed. The weather search system includes the screen part of a terminal to which the Internet can be connected; a web data gathering part for gathering web data corresponding to search keywords; a weather keyword analyzing part for extracting only location/time/weather related words from web data received from the web data gathering part, and analyzing and classifying the weather related words; an analysis data verification and information providing part for verifying false information among the analyzed data. It is possible to have intuitive and convenient access to weather information.

Description

WEATHER SEARCH SYSTEM USING WEB DATA GATHERING USING WEB DATA COLLECTION

The following description relates to a system and method for providing weather data through Web page analysis. More specifically, the present invention relates to a system and method for providing weather data through web page analysis. More specifically, the system searches for time, location and weather keywords desired by the user, collects web pages according to the keywords, To a user and to a method thereof.

Although the present invention will be described in a manner applied to a computer web or a mobile application in the following description, the technique of the present invention is applicable to all objects capable of input / output such as a 'speech recognition speaker' and internet connection.

A recent poll by the Korea Meteorological Administration (KMA) in August 2016 showed that the public opinion on the weather forecast was higher than the public opinion for the first time, as the weather forecast of the Korea Meteorological Agency frequently deviated from the actual weather.

Despite growing disbelief in weather forecasts, the meteorological weather indicators provided by the Korea Meteorological Administration are also classified as unreliable.

The Meteorological Agency provides weather information by using various information such as the wind intensity and the humidity level of the day, but it is not true that the weather information is actually wrong, and even if the weather information is accurate, It is difficult. So, when I go out to see only the weather information, I get a problem when I feel sad all day because of weather other than expected.

In order to solve the above problem, when the user prepares for going out, it is considered to be the most reliable information to utilize the clothes or the destination information of the person who has actually gone out so that the weather of the day can be more accurately known.

Recently, SNS has become a platform to share people's daily life, and a lot of information is being uploaded in real time. In addition to the users' personal posts, SNS also includes user's response to the weather, clothing attuned to the day, And the like can be contacted indiscriminately.

The applicant has proposed the present invention in order to devise a system which allows a user to recognize the opinions of a person who has actually gone out in real time.

Non-Patent Literature: Public opinion specialist institution real meter http://www.realmeter.net/2016/08/ Meteorological Agency - Forecast - Disbelief - Opinion - Trust - First /

An object of the present invention is to provide a weather search system that allows a user to access desired weather information intuitively and conveniently by referring to a large amount of weather-related data on the web without using weather data provided by the weather station.

Another object of the present invention is to provide a weather-related content search system that enables searching for weather-related content along with weather, such as clothing.

It is another object of the present invention to provide a weather content recommendation system that automatically provides a variety of contents related to a search keyword beyond a search for a specific content related to weather, so that a user can easily access a lot of information.

According to an aspect of the present invention, A web data collection unit for collecting web data corresponding to search keywords; A search term clustering database for providing similar and various search terms to the web data collection unit in addition to the search keywords; A weather keyword analyzing unit for extracting only location / time / weather related words from the web data received from the web data collecting unit and generating analysis and analysis results for weather related words; Among the analyzed data, data for verifying false information are analyzed data verification and information providing; And a weather search result database having weather search result data according to the search keyword.

The screen portion is a part of a terminal device having a screen capable of accessing a web server and a database through which a wired / wireless Internet can be connected. And the weather search result can be confirmed through the screen.

The web data collection unit may include a server serving as a web data crawl, and may collect web page data corresponding to a given keyword set and transmit the web page data to the weather keyword analysis unit.

The web data collection unit tries to make similar keyword sets into one keyword set in order to collect richer data before crawling web data. At this time, a similar keyword cluster may be received from the keyword cluster database.

The web data collection unit may include a newly inputted keyword of the search term in the search term grouping database to perform the grouping operation again and update the new grouping data to the search word grouping database.

The weather keyword analyzing unit extracts location / time / weather related words from the transmitted web page data, generates statistics of each data through classification of the collected data, and transmits the statistics to the analysis data verification and information providing unit .

In order to verify the reliability of the transmitted analysis data, the analysis data verification and information providing unit may verify the reliable data by referring to the corresponding time and the weather station weather information of the area, and transmit only the verified data to the weather search result database.

The weather search system according to the present invention collects the actual feeling weather of the people who are performing outside activities in real time on the actual web in order to reduce the distance feeling of the weather data and the weather, To help you do so.

In addition, the weather-related content search system according to the present invention as described above enables users to collect other contents related to the weather in real time by collecting opinions of people who are actually outside, so that the user can select what activities and actions There is an effect of helping.

Other effects not mentioned may be clearly understood by those skilled in the art from the following description, unless the effects obtainable by the present invention are limited to the effects mentioned above will be.

FIG. 1 is a diagram showing a representative structure of the present invention, and is a diagram for explaining how an overall configuration is made and what data is exchanged among components. FIG.
FIG. 2 is a flow chart of the present invention showing a process according to the entire configuration. Data is exchanged according to the sequence shown in this flowchart.
3 is a diagram illustrating an example of weather and user-customized content recommendation system using web data according to an embodiment of the present invention.

Hereinafter, embodiments will be described in detail with reference to the accompanying drawings.

FIG. 1 is a block diagram showing a configuration of a navigation system according to an embodiment of the present invention. FIG. 1 is a block diagram illustrating a navigation system according to an embodiment of the present invention. Referring to FIG. 1, the navigation system includes a screen unit 00, a web data collection unit 10, a search word cluster database 20, a weather keyword analyzer 30, And a result database 50.

The screen portion 00 may be a smartphone application screen or a computer web page screen. The configuration of the screen unit 00 may be configured as shown in FIG. 1. However, the configuration of the screen unit 00 is not limited to the configuration shown in FIG.

When receiving the search request, the web data collecting unit 10 divides the search request sentence into words, and stores the time / location / weather / other expression words separately. For example, if the search term is "Kangnam Kwun-na," the stored information is stored as <11:00 / Kangnam / Kwunna>.

A group of weather expression words similar to the search request word is fetched from the search word grouping database 20. In the query word clustering database 20, weather expression words are clustered and stored. If there is no corresponding cluster, the search request words are not fetched.

A set of keyword phrases through these procedures is combined with existing search terms to create new keyword clusters, which include keywords that are richer than existing ones.

The web data collecting unit 10 creates a 'web data collection keyword collection' including a search keyword and words having similar meanings obtained from the search word clustering database 20.

If the search requested sentence is already in the weather search result database 50, the weather information is immediately received as a search result in the weather search result database 50 without any other procedure, and the other processes do not proceed.

The web data collecting unit 10 newly groups the search request words together with the words in the existing search word clustering database 20.

The clustering work includes "K-average algorithm", "center connection method" and "neural network technique" which are grouping algorithms of machine learning, but any clustering algorithm may be used.

If there is a change in the clustering content or a newly added item, it is transferred to the clustering database 20 and updated.

The web data collecting unit 10 collects the time, location, and location information on blogs, cafes and SNS (Facebook, Twitter, Instagram, etc.) of portal sites (Naver, Daum, Collect URLs of web pages that match weather expressions.

Web crawls the web page data of collected URLs and finds necessary data.

The basic algorithm of the web crawling is a simple structure for downloading a whole web page of a given URL, and it can be expanded in any way.

The web data collecting unit 10 transmits a 'search keyword collection' and stored web pages to the weather keyword analyzer 30.

The weather keyword analyzing unit 30 searches the collected web pages for time / location / weather related keywords, and stores the three items in a bundle. If there is any undiscovered time / location / weather, the data is excluded.

When all the web pages have been searched for keywords, the keywords are categorized according to the same time, location, and weather. As a result, the classified keywords are stored in the form of "yusunggu", "cold", 53%, <"ok", 16%>, <"too hot", 2%>.

The weather keyword analyzing unit 30 interlocks the sources of the statistical data corresponding to the analysis statistical data into one and transmits the same to the analysis data verification and information providing unit 40. [

The analysis data verification and information providing unit 40 fetches the temperature, precipitation, and wind data among the weather data of the weather station corresponding to the requested time.

The analysis data verification and information providing unit 40 receives the weather data and compares the received weather data with the keyword analysis statistical data, thereby removing the keyword analysis data having low reliability. If the temperature data received by the Korea Meteorological Administration is 6 ° C, remove contradictory keyword statistical data such as 'hot'.

The analysis data verification and information providing unit 40 stores the verified weather data and the keyword keyword collection in the weather search result database 50.

The weather search result database 50 transmits the searched weather result data to the screen unit 00.

The screen portion 00 of the application displays the weather statistical information of each data and the weather data source according to the configuration of the screen.

FIG. 2 is a flowchart illustrating a process according to an exemplary embodiment of the present invention.

When receiving the weather data from the portable terminal, the user first checks whether the search result exists in the database. If there is already a similar search result in the database, the weather result data will be imported directly without any further action.

This is an efficient configuration because there is a possibility that the procedure will be reduced when searching for weather.

If the same search results do not already exist in the database, the search term clusters are searched and retrieved from the search term clustering database. Collect web data with imported community keywords and infer weather.

3 is a diagram illustrating an example of guiding weather according to an embodiment of the present invention.

In this embodiment, in addition to the weather data search system, web data information is provided to provide user-customized recommendable content related to the weather.

The upper part of FIG. 3 (a) shows weather information weather information generally provided as a basic screen before a user searches for a specific keyword.

The lower part of FIG. 3 (a) stores the keyword data that the user has normally searched for, and refers to the data category frequently searched in the basic screen as the ranking information of the database, have.

FIG. 3 (b) is a screen after the user searches for a specific keyword and extracts weather data information from the collected web data based on the search keyword input by the user.

It refers to the past weather information similar to the weather data collected by the search keyword, and informs the information of the past. For example, if today's users searched for today's morning weather, they would refer to the weather forecasts for the other days with similar weather conditions.

This service uses weather data collection to provide weather data, so it displays links to view web pages of weather data that users have extracted.

The weather data source provides both the currently retrieved weather data and the expected weather data source.

FIG. 3 (c) is a service for recommending contents by referring to past weather which is similar to the weather corresponding to the current or search keyword, through a screen that can be viewed through the real-time customized recommendation item of FIG.

And recommends contents received positive feedback from a large number of people on the web page of the past time.

Link to the corresponding web page so that the user can directly see the detailed source and contents of such content.

The apparatus described above may be implemented as a hardware component, a software component, and / or a combination of hardware components and software components. For example, the apparatus and components described in the embodiments may be implemented within a computer system, such as, for example, a processor, a controller, an arithmetic logic unit (ALU), a digital signal processor, a microcomputer, a field programmable gate array (FPGA) , A programmable logic unit (PLU), a microprocessor, or any other device capable of executing and responding to instructions. The processing device may execute an operating system (OS) and one or more software applications running on the operating system. The processing device may also access, store, manipulate, process, and generate data in response to execution of the software. For ease of understanding, the processing apparatus may be described as being used singly, but those skilled in the art will recognize that the processing apparatus may have a plurality of processing elements and / As shown in FIG. For example, the processing unit may comprise a plurality of processors or one processor and one controller. Other processing configurations are also possible, such as a parallel processor.

The software may include a computer program, code, instructions, or a combination of one or more of the foregoing, and may be configured to configure the processing device to operate as desired or to process it collectively or collectively Device can be commanded. The software and / or data may be in the form of any type of machine, component, physical device, virtual equipment, computer storage media, or device As shown in FIG. The software may be distributed over a networked computer system and stored or executed in a distributed manner. The software and data may be stored on one or more computer readable recording media.

The method according to an embodiment may be implemented in the form of a program command that can be executed through various computer means and recorded in a computer-readable medium. The computer-readable medium may include program instructions, data files, data structures, and the like, alone or in combination. The program instructions to be recorded on the medium may be those specially designed and configured for the embodiments or may be available to those skilled in the art of computer software. Examples of computer-readable media include magnetic media such as hard disks, floppy disks and magnetic tape; optical media such as CD-ROMs and DVDs; magnetic media such as floppy disks; Magneto-optical media, and hardware devices specifically configured to store and execute program instructions such as ROM, RAM, flash memory, and the like. Examples of program instructions include machine language code such as those produced by a compiler, as well as high-level language code that can be executed by a computer using an interpreter or the like.

While the present invention has been particularly shown and described with reference to exemplary embodiments thereof, it is to be understood that the invention is not limited to the disclosed exemplary embodiments. For example, it is to be understood that the techniques described may be performed in a different order than the described methods, and / or that components of the described systems, structures, devices, circuits, Lt; / RTI &gt; or equivalents, even if it is replaced or replaced.

Therefore, other implementations, other embodiments, and equivalents to the claims are also within the scope of the following claims.

Claims (8)

A weather search system using web data collection,
A display unit of a terminal capable of connecting with a wired / wireless Internet and capable of communicating with a web server; And
A web data collection unit for collecting web data corresponding to search keywords; And
From the web data received from the web data collecting unit, only the position / time / weather related words
A weather keyword analyzer for extracting and analyzing weather related words and generating classification results; And
Among the analyzed data, data for verifying false information are analyzed data,
Based weather data search system.
The method according to claim 1,
And displaying the weather and related content data provided from the analysis data verification and information providing unit on the screen.
3. The method of claim 2,
The display unit includes:
And the data is displayed by changing a user interface (UI) according to the type of the item provided from the analysis data verification and information providing unit.
The method according to claim 1,
Wherein the web data collecting unit comprises:
And collecting web page data from web sites containing keywords by receiving the keyword requested to be searched from the user.
5. The method of claim 4,
Wherein the web data collecting unit comprises:
Retrieving a set of population keywords including the retrieved requested keywords in a retrieval word clustering database;
Clustering the search word clustering database with the search request keyword and updating the clustering database with the search word clustering database;
And crawling web page data on the web sites based on all the keywords in the cluster keyword set and generating web page data.
5. The method of claim 4,
The web site is characterized in that,
Including Facebook, Twitter, Instagram, Naver blog, Naver cafe, and next cafe, including one or more of the social network service (SNS)
A weather search system.
The method according to claim 1,
The weather keyword analyzing unit,
Extracting position / time / weather keywords from each of the web pages of the web page data, analyzing which weather condition is indicated at the corresponding position / time in each web page, and generating analysis and classification data including all result data Weather search system.
The method according to claim 1,
Wherein the analysis data verification and information providing unit comprises:
Comparing the analysis and classification data with weather station weather data to filter and verify false web data information, and transmitting the analyzed and classified data to the weather search result database and the screen.

KR1020170088846A 2017-07-13 2017-07-13 Weather search system using web data gathering KR20190007628A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
KR1020170088846A KR20190007628A (en) 2017-07-13 2017-07-13 Weather search system using web data gathering

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
KR1020170088846A KR20190007628A (en) 2017-07-13 2017-07-13 Weather search system using web data gathering

Publications (1)

Publication Number Publication Date
KR20190007628A true KR20190007628A (en) 2019-01-23

Family

ID=65280126

Family Applications (1)

Application Number Title Priority Date Filing Date
KR1020170088846A KR20190007628A (en) 2017-07-13 2017-07-13 Weather search system using web data gathering

Country Status (1)

Country Link
KR (1) KR20190007628A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20200112645A (en) * 2019-03-22 2020-10-05 (주)해안해양기술 High-precision wave prediction system with real-time verification

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20200112645A (en) * 2019-03-22 2020-10-05 (주)해안해양기술 High-precision wave prediction system with real-time verification

Similar Documents

Publication Publication Date Title
US11347782B2 (en) Internet text mining-based method and apparatus for judging validity of point of interest
CN107862022B (en) Culture resource recommendation system
CN102708174B (en) Method and device for displaying rich media information in browser
US8756224B2 (en) Methods, systems, and media for content ranking using real-time data
CN107220386A (en) Information-pushing method and device
CN107908789A (en) Method and apparatus for generating information
CN110825956A (en) Information flow recommendation method and device, computer equipment and storage medium
CN108563753A (en) Message pushes generation method, device and the computer readable storage medium of official documents and correspondence
CN102890702A (en) Internet forum-oriented opinion leader mining method
CN110688476A (en) Text recommendation method and device based on artificial intelligence
CN110019616A (en) A kind of POI trend of the times state acquiring method and its equipment, storage medium, server
US20200045122A1 (en) Method and apparatus for pushing information
CN107562939A (en) Vertical field news recommends method, apparatus and readable storage medium
KR102319438B1 (en) System for Providing Tourism information based on Bigdata and Driving method of the Same
CN107391675A (en) Method and apparatus for generating structure information
CN108062366B (en) Public culture information recommendation system
CN104899324A (en) Sample training system based on IDC (internet data center) harmful information monitoring system
CN106537387B (en) Retrieval/storage image associated with event
KR101873339B1 (en) System and method for providing interest contents
CN104090757A (en) Method and device for displaying rich media information in browser
CN104090923A (en) Method and device for displaying rich media information in browser
JP5848199B2 (en) Impact prediction device, impact prediction method, and program
CN112825089B (en) Article recommendation method, device, equipment and storage medium
CN106202312B (en) A kind of interest point search method and system for mobile Internet
US20170235835A1 (en) Information identification and extraction