JP2007164378A5 - - Google Patents

Download PDF

Info

Publication number
JP2007164378A5
JP2007164378A5 JP2005358328A JP2005358328A JP2007164378A5 JP 2007164378 A5 JP2007164378 A5 JP 2007164378A5 JP 2005358328 A JP2005358328 A JP 2005358328A JP 2005358328 A JP2005358328 A JP 2005358328A JP 2007164378 A5 JP2007164378 A5 JP 2007164378A5
Authority
JP
Japan
Prior art keywords
vocabulary data
domain
web document
advertising
unit
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
JP2005358328A
Other languages
Japanese (ja)
Other versions
JP2007164378A (en
JP4791169B2 (en
Filing date
Publication date
Application filed filed Critical
Priority to JP2005358328A priority Critical patent/JP4791169B2/en
Priority claimed from JP2005358328A external-priority patent/JP4791169B2/en
Publication of JP2007164378A publication Critical patent/JP2007164378A/en
Publication of JP2007164378A5 publication Critical patent/JP2007164378A5/ja
Application granted granted Critical
Publication of JP4791169B2 publication Critical patent/JP4791169B2/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Claims (5)

Webドキュメントの提供者が特定の業種・業界においてよく使われる語彙を解析するために、複数の広告語彙データから互いに関連した広告語彙データを関連づける関連語抽出装置であって、
通信回線を介して接続された記憶装置に記憶されたWebドキュメントを受信する受信部と、
前記受信部が受信したWebドキュメントを記憶するWebドキュメント記憶部と、
抽出する広告語彙データに関連する第1広告語彙データの入力を受け付ける入力部と、
前記入力部を介して入力された第1広告語彙データが含まれているWebドキュメントを、前記Webドキュメント記憶部から抽出するWebドキュメント抽出部と、
前記Webドキュメント抽出部により抽出されたWebドキュメント中に共通して含まれる第2広告語彙データを抽出する抽出部と、
前記抽出部により抽出された前記第2広告語彙データを、前記第1広告語彙データと関連づけたドメインを生成するドメイン生成部と、
前記ドメイン生成部により生成された前記ドメインを記憶するドメイン記憶部と、
を備える関連語抽出装置。
In order to analyze a vocabulary frequently used in a specific type of business / industry, a web document provider is a related word extraction device for associating mutually related advertising vocabulary data from a plurality of advertising vocabulary data,
A receiving unit that receives a Web document stored in a storage device connected via a communication line;
A Web document storage unit for storing the Web document received by the receiving unit;
An input unit for receiving input of first advertising vocabulary data related to the advertising vocabulary data to be extracted;
A Web document extraction unit that extracts a Web document including the first advertising vocabulary data input via the input unit from the Web document storage unit;
An extraction unit for extracting second advertisement vocabulary data commonly included in the Web document extracted by the Web document extraction unit;
A domain generation unit that generates a domain in which the second advertising vocabulary data extracted by the extraction unit is associated with the first advertising vocabulary data;
A domain storage unit for storing the domain generated by the domain generation unit;
A related word extraction device.
前記ドメイン生成部は、前記第1広告語彙データとは異なる他の第1広告語彙データ、及び当該他の第1広告語彙データから抽出された第2広告語彙データから生成されたドメインと、前記ドメイン記憶部に、既に記憶されているドメインとを関連づける、請求項1に記載の関連語抽出装置。   The domain generating unit includes a first advertisement vocabulary data different from the first advertisement vocabulary data, a domain generated from second advertisement vocabulary data extracted from the other first advertisement vocabulary data, and the domain The related word extraction device according to claim 1, wherein the storage unit associates a domain that is already stored with the storage unit. 前記抽出部は、前記Webドキュメント抽出部により抽出されたWebドキュメント中に共通して含まれる第2広告語彙データを抽出する際に、頻出度が高い第2広告語彙データを優先的に抽出する、請求項1または請求項2に記載の関連語抽出装置。 The extraction unit preferentially extracts second advertisement vocabulary data having a high frequency when extracting the second advertisement vocabulary data commonly included in the Web document extracted by the Web document extraction unit ; The related word extraction device according to claim 1 or 2. Webドキュメントの提供者が特定の業種・業界においてよく使われる語彙を解析するために、複数の広告語彙データから互いに関連した広告語彙データを関連づける関連語抽出方法であって、
通信回線を介して接続された記憶装置に記憶されたWebドキュメントを受信するステップと、
前記受信するステップにて受信したWebドキュメントを記憶するステップと、
抽出する広告語彙データに関連する第1広告語彙データの入力を受け付ける入力ステップと、
前記入力ステップにて入力された第1広告語彙データが含まれているWebドキュメントを、抽出する抽出ステップと、
前記Webドキュメント抽出部により抽出されたWebドキュメント中に共通して含まれる第2広告語彙データを抽出する第2広告語彙データ抽出ステップと、
前記第2広告語彙データ抽出ステップにより抽出された前記第2広告語彙データを、前記第1広告語彙データと関連づけたドメインを生成するドメイン生成ステップと、
前記ドメイン生成ステップにより生成された前記ドメインを記憶するドメイン記憶ステップと、
を備える関連語抽出方法。
A related word extraction method for associating advertisement vocabulary data related to each other from a plurality of advertisement vocabulary data in order for a web document provider to analyze vocabulary frequently used in a specific industry / industry,
Receiving a Web document stored in a storage device connected via a communication line;
Storing the web document received in the receiving step;
An input step for receiving input of first advertising vocabulary data related to the advertising vocabulary data to be extracted;
An extraction step of extracting a Web document including the first advertising vocabulary data input in the input step;
A second advertising vocabulary data extracting step for extracting second advertising vocabulary data commonly included in the Web document extracted by the Web document extracting unit ;
A domain generating step for generating a domain in which the second advertising vocabulary data extracted in the second advertising vocabulary data extracting step is associated with the first advertising vocabulary data;
A domain storage step of storing the domain generated by the domain generation step;
A related word extraction method comprising:
前記ドメイン生成ステップは、前記第1広告語彙データとは異なる他の第1広告語彙データ、及び当該他の第1広告語彙データから抽出された第2広告語彙データから生成されたドメインと、前記ドメイン記憶ステップにて、既に記憶されているドメインとを関連づける、請求項4に記載の関連語抽出方法。   The domain generation step includes a first advertisement vocabulary data different from the first advertisement vocabulary data, a domain generated from second advertisement vocabulary data extracted from the other first advertisement vocabulary data, and the domain The related word extraction method according to claim 4, wherein in the storing step, the already stored domain is associated.
JP2005358328A 2005-12-12 2005-12-12 Related word extraction device and related word extraction method Active JP4791169B2 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP2005358328A JP4791169B2 (en) 2005-12-12 2005-12-12 Related word extraction device and related word extraction method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP2005358328A JP4791169B2 (en) 2005-12-12 2005-12-12 Related word extraction device and related word extraction method

Publications (3)

Publication Number Publication Date
JP2007164378A JP2007164378A (en) 2007-06-28
JP2007164378A5 true JP2007164378A5 (en) 2008-07-17
JP4791169B2 JP4791169B2 (en) 2011-10-12

Family

ID=38247213

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2005358328A Active JP4791169B2 (en) 2005-12-12 2005-12-12 Related word extraction device and related word extraction method

Country Status (1)

Country Link
JP (1) JP4791169B2 (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107357851B (en) * 2017-06-28 2020-01-31 国信优易数据有限公司 information processing method and system

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2004062446A (en) * 2002-07-26 2004-02-26 Ibm Japan Ltd Information gathering system, application server, information gathering method, and program
JP2004234078A (en) * 2003-01-28 2004-08-19 Oki Electric Ind Co Ltd Information retrieval system
JP2004280488A (en) * 2003-03-17 2004-10-07 Hitachi Ltd Documents management method, and documents management device

Similar Documents

Publication Publication Date Title
EP2256642A3 (en) Animation system for generating animation based on text-based data and user information
WO2009103023A3 (en) Music score deconstruction
RU2009109687A (en) LOGOKONI - ADVERTISING PRODUCT FOR BRAND ADVERTISERS
WO2006094206A3 (en) Generating structured information
WO2009051939A3 (en) Automatically instrumenting a set of web documents
WO2010019567A8 (en) Signed digital documents
WO2010047794A3 (en) Environmental data collection
EP2690567A3 (en) Method for managing data and an electronic device thereof
EP2081126A3 (en) Information processing system, information processing apparatus, information processing program and recording medium
WO2017164510A3 (en) Voice data-based multimedia content tagging method, and system using same
JP2007164378A5 (en)
WO2008063615A3 (en) Apparatus for and method of performing a weight-based search
WO2009143486A3 (en) Method and apparatus for generating a composite media file
JP2009527853A5 (en)
Antonacopoulou et al. Practising Changing Change: How Middle Managers Take a Stance Towards Lived Experiences of Change
WO2010119262A3 (en) Apparatus and method for generating advertisements
Rahmanseresht et al. Competitiveness model of Iranian manufacturing industries
Dehghani Saryazdi et al. Analysis of knowledge management effectiveness on business excellence using system dynamics
CN104050195A (en) Advertisement sticker processing method and system
Dei Anti‐Racism
Hadian et al. A comparative analysis of phonological processes in Isfahani accent with Persian in the framework of Optimality Theory
HOSSEIN et al. ESTIMATION OF THE CONDITIONAL SURVIVAL FUNCTION OF A FAILURE TIME GIVEN A TIME-VARYING COVARIATE WITH INTERVAL-CENSORED OBSERVATIONS
CN104461492A (en) System and method for generating mobile application client side
MATIN et al. Review of the Commercialization Linear Model
Simbar et al. The Dangers of Rise in World Politics