WO2007085187A1 - Procédé d'extraction de données, procédé de production de fichiers d'index et moteur de recherche - Google Patents

Procédé d'extraction de données, procédé de production de fichiers d'index et moteur de recherche Download PDF

Info

Publication number
WO2007085187A1
WO2007085187A1 PCT/CN2007/000244 CN2007000244W WO2007085187A1 WO 2007085187 A1 WO2007085187 A1 WO 2007085187A1 CN 2007000244 W CN2007000244 W CN 2007000244W WO 2007085187 A1 WO2007085187 A1 WO 2007085187A1
Authority
WO
WIPO (PCT)
Prior art keywords
index file
information
condition
sub
search
Prior art date
Application number
PCT/CN2007/000244
Other languages
English (en)
Chinese (zh)
Inventor
Pengxi Zhu
Original Assignee
Huawei Technologies Co., Ltd.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Huawei Technologies Co., Ltd. filed Critical Huawei Technologies Co., Ltd.
Publication of WO2007085187A1 publication Critical patent/WO2007085187A1/fr

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/31Indexing; Data structures therefor; Storage structures

Definitions

  • the present invention relates to the field of data search, and in particular, to a data search method, an index file generation method, and a search engine.
  • Search engines use automatic crawlers, such as web crawlers (WEBCRAWLERS), search spiders (SPIDER), online robots (ROBOT), to traverse nodes on a wide area network or local area network (INTRANET), using full-text search techniques on each node.
  • WEBCRAWLERS web crawlers
  • SPIDER search spiders
  • ROBOT online robots
  • the captured information is analyzed, indexed, classified, and the corresponding index database is established for the user to query.
  • the search program reads the information in the index database and matches the user keyword, retrieves the corresponding or related information, and outputs it to the user through a certain organization.
  • Search engines are generally based on full-text search. Most of the current search engines use B-tree data structures to store index information. Each index node stores information about each word and connects to the word containing the word.
  • the identifier (ID) of the document for example, the address of the document containing the "word” is sorted by word frequency, and is connected to the index node in the form of a singly linked list or a doubly linked list.
  • the "word” refers to the smallest unit that can express information in a search engine.
  • FIG. 1 shows the structure of an index file in the prior art in which a search engine completes the completion of some documents. All information for all documents is stored in a B-tree, each node 110 on the B-tree It is a word. The word "I” points to a linked list 121. The linked list 121 holds all the documents containing "I” and the number of occurrences of "I" in the document.
  • the text "10" in the document 1 has the document 10298, the document 786; the word " China, pointing to another linked list 122, the linked list 122 holds all the documents containing "China” and the document "China,, the number of occurrences, the number of words "China” in the document 10298, document 786; The number of occurrences is indicated by the word frequency.
  • the number of occurrences of "I” in document 786 is 4, and the number of occurrences of "China” in document 786 is 1 time.
  • the search speed needs to be further improved.
  • index file mode the user needs to search for documents containing both "I” and “China,”, and enter keywords, then you can retrieve the document containing "I” as 10298 and document 786, containing " China,, has document 786 and document 26543, and the documents containing "I” and “China” are the intersection of (10298, 786) and (786, 26543).
  • the process of calculating the intersection affects the speed of the search.
  • the index file includes index information of a plurality of documents, and the amount of data is large, in the prior art, when the advanced search is performed, multiple keywords are used to search in multiple times, which may affect the search speed, and Loading the index file into memory each time brings a lot of physical overhead.
  • Embodiments of the present invention provide a data search method, an index file generation method, and a search engine, which can improve the search speed when performing advanced search.
  • a data search method includes:
  • the first index file Retrieving in the first index file, obtaining an identifier of the information conforming to the first sub-condition; the first index file corresponding to the plurality of information that needs to be retrieved;
  • a method for generating an index file includes:
  • the I file is linked by the identification of the information.
  • a search engine includes:
  • a search module for obtaining information
  • An indexing module configured to generate a first index file for the information acquired by the search module; and generate a second independent index file for each information;
  • An advanced search module configured to retrieve, in the first index file, an identifier of the information that meets the first sub-condition; according to the second sub-condition, in the second index file corresponding to the identifier of the information that meets the first sub-condition Search to obtain the search results.
  • the embodiment of the present invention creates a first index file for each piece of information, and separately generates a separate second index file for each piece of information, the search in the first index file can be reduced when performing advanced search.
  • the number of times, in combination with the first index file and the second index file for retrieval since the amount of information of the independent second index file for a single information is small, the search time can be reduced, and it takes a long time without being performed.
  • the process of calculating the intersection of information can improve the search speed in advanced retrieval.
  • the embodiment of the present invention may only need to load the first index file once and then call the independent second index file for a single information. Since the second index file has a small amount of data, physical cost can be saved.
  • FIG. 2 is a flow chart of generating a first index file in an embodiment of the present invention
  • FIG. 3 is a flow chart of generating a second index file in an embodiment of the present invention.
  • FIG. 4 is a structural diagram of a separate index file generated for a single document in an embodiment of the present invention.
  • Figure 5 is a flow chart of a data search method in an embodiment of the present invention.
  • Figure 6 is a block diagram of a search engine in an embodiment of the present invention.
  • Figure 7 is a block diagram of the query module shown in Figure 6.
  • a separate second index file is separately generated for each information, and when the data search is performed, the first index file is combined with The second index file is searched to improve data search The speed of the cable.
  • the information to be retrieved generally includes electronic information distributed on a server of an infinite number of nodes, and actually has many storage methods, such as: information storage methods such as documents, web pages or database records.
  • information storage methods such as documents, web pages or database records.
  • the storage form of the information that needs to be retrieved by the web page is taken as an example, but the present invention is not limited to the information of this form.
  • the first index file can generally also be referred to as a large index file.
  • the index file needs to store the index information with a certain data structure.
  • the general full-text search uses the B-tree.
  • the present invention does not limit the data structure of the index file, and any other feasible data structure is feasible, such as: Binary Search Tree (BST), Balance Tree (Adelson-Velskii and Landis Tree, AVL Tree) , heap and other data structures.
  • BST Binary Search Tree
  • Balance Tree Adelson-Velskii and Landis Tree
  • AVL Tree Address-Velskii and Landis Tree
  • heap other data structures.
  • FIG. 2 takes a B-tree structure as an example, and shows a flow of an embodiment for generating a first index file.
  • Step S210 determining whether the indexed webpage needs to be built, if yes, proceeding to step S220; otherwise, indicating that all the webpages that need to be retrieved have been analyzed, and the index information of all the webpages has been stored in the first index file, therefore, entering In step S280, the process ends.
  • Step S220 determining whether the webpage contains the next word, if not, proceeding to step S280, ending the analysis of the webpage; if so, executing step S230;
  • Step S230 it is determined whether there is a node corresponding to the word in the B tree of the first index file, if yes, step S240 is performed, if not, step S270 is performed;
  • Step S240 determining whether the identifier (ID) of the webpage exists in the linked list pointed to by the word, if yes, executing step S250, if not, executing step S260;
  • Step S250 adding 1 to the word corresponding to the webpage in the linked list pointed to by the word;
  • Step S260 adding the identifier of the webpage in the linked list pointed to by the word, and setting the corresponding word frequency to 1;
  • Step S270 adding a node corresponding to the word in the B tree of the first index file, and Add the identifier of the web page to the linked list, and set the frequency to 1.
  • first index file An embodiment of a method of generating a first index file is given above, and of course, it can be implemented by other methods well known to those skilled in the art, and the present invention is not limited thereto.
  • the first index file may adopt different data structures, and then it may have different generation steps. Since it is a technology known in the art, it will not be described herein.
  • each web page it is also necessary to generate an independent second index file for each web page.
  • the second index file is independent of each other, and each web page corresponds to a second index file of its own.
  • a second index file describes and records the words contained in the corresponding web page and the number of occurrences of the word. If the B-tree structure is still used, then a second index file is a B-tree and is associated with the first index file by the identity of the corresponding web page.
  • a B tree is taken as an example to show a flow of generating a separate second index file for each web page.
  • Step S310 determining whether a webpage contains the next word, if not, ending the generating step of the second index file for the webpage; if so, executing step S320;
  • Step S320 determining whether there is a node corresponding to the word in the B tree of the second index file, if yes, executing step S330, if not, executing step S360;
  • Step S330 determining whether the identifier (ID) of the webpage exists in the linked list of the first index file pointed to by the word, if yes, executing step S340, if not, executing step S350; step S340, pointing the word The word corresponding to the webpage in the linked list of the first index file is incremented by one;
  • Step S350 adding the identifier of the webpage to the linked list of the first index file pointed by the word, and setting the corresponding word frequency to 1;
  • Step S360 Add a node corresponding to the word in the B tree of the second index file, and add the identifier of the web page in the linked list of the first index file, and the frequency of the word is 1.
  • the index information storage structure map shown in Fig. 4 is formed for one web page. Therein, a plurality of index nodes 410 are included, and each index node corresponds to one word.
  • the first index file or the second index file adopts a B-tree.
  • a data structure other data structures well known to those skilled in the art may be used as the data storage form of the index file, which is not limited by the present invention.
  • a second index file is generated independently for each webpage, so that the first index file with a large amount of data can be avoided from being called multiple times, so the data can be improved.
  • the speed of the search it is easier to use multiple tasks (multi-process) to query multiple index files simultaneously, especially when performing advanced searches.
  • the advanced search is generally applicable to a more detailed and accurate search of a web page, and the search condition may include a single condition or a comprehensive condition.
  • the general condition is generally a combination of multiple sub-conditions, each of which is a single condition, and the sub-conditions can be connected by "and", "or", and "not".
  • performing data retrieval based on the generated index file includes the following steps:
  • Step S510 Generate a first index file for all the information that needs to be retrieved.
  • Step S520 generating an independent second index file for each information.
  • the search may be a general search or an advanced search:
  • the first index file is retrieved according to the input query conditions to obtain the relevant information required.
  • the search results can be sorted based on the number of times the keyword appears in the message and the result is output.
  • Step S530 decomposing the input query condition
  • Step S540 Retrieving the first sub-condition in the first index file, and acquiring an identifier of the information that satisfies the first sub-condition;
  • Step S550 according to the second sub-condition, performing a search in the second index file corresponding to the information satisfying the first condition.
  • the multi-task may be used to perform the search in the second index file corresponding to the information.
  • the second index file corresponding to the information satisfying the previous sub-condition may also be retrieved in sequence.
  • the input query condition when performing an advanced search, can be first decomposed into multiple sub-conditions, and the first index is retrieved in the large index file (first index file). Sub-conditions, the identifiers of all the documents satisfying the first sub-condition are obtained; on this basis, the retrieval for the second sub-conditions is performed sequentially or simultaneously in the small index file (second index file) corresponding to the document identifier. And so on, if there is a next sub-condition, according to the next sub-condition, the search is performed in the second index file corresponding to the document satisfying the previous sub-condition, thereby obtaining a more accurate query result.
  • the search engine when searching for documents that contain both "I” and “China", the search engine first finds all the documents contained in the file, ie, documents 10298, 786, and then starts 2 tasks (multi-process or multi-thread) Searching for the keywords of "China” in the index files of document 10298 and document 786 respectively, for a single document 10298, etc., the index information is relatively small, the search time is small, and it can be quickly determined that the document 10298 does not contain "China” "The word, document 786 contains "China,,, so both "I” and “China,” the document is 786.
  • the existing search method has only one index file, so when searching for two or more sub-conditions, only the sub-conditions can be queued, the index file is called in turn, and each sub-condition is calculated.
  • the intersection of the information sets can be used to obtain the result of satisfying the comprehensive query condition.
  • the first index file with a large amount of data and the intersection of the calculation information sets cause the search speed to be slow. Since the embodiment of the present invention changes the prior art to have only one index file, a plurality of second index files are provided, so that each time the index file is loaded is small, it is not necessary to load the first data amount each time.
  • the index file does not occupy too much memory, and the process of combining and retrieving the document IDs retrieved by each condition is omitted, so that the retrieval time can be saved and the retrieval speed can be improved. Moreover, multiple processes can be easily searched to further speed up the retrieval.
  • Step S560 the data search method may further sort the search results according to the number of times the keyword appears in the information and output the result.
  • an embodiment of the search engine of the present invention may include the following units:
  • the search module 610 is configured to obtain information.
  • the search module 610 can discover and collect information on the network through the corresponding code of the program, and perform crawling analysis according to the webpage link to join the database to realize the acquisition of the electronic information.
  • the indexing module 620 is configured to generate a first index file for the information acquired by the search module; and generate a separate second index file for each information.
  • the indexing module 620 is mainly used to understand the information collected by the search module 610 and extract the index items therefrom to generate corresponding description and expression information to represent the information, establish an index table of information, and form a unified physical index. Database, which enables the structuring of unstructured information.
  • the indexing module 620 generates an independent second index file for a single piece of information on the basis of generating an index file, which can reduce the number of times the first index file is called and can be searched by using multiple tasks, thereby improving retrieval. speed.
  • the query module 630 performs a query according to the input keyword and outputs the query result.
  • the query module 630 quickly detects the information in the first index file and the second index file according to the user's query, or in the first index file, performs correlation evaluation of the information and the keyword, and sorts the results to be output. Achieve better user feedback mechanisms.
  • the query module 630 scans each word in the document to create an inverted file in units of words, and continuously matches the search words according to the frequency and probability of occurrence of the keyword in the document. The relevance of the query of the document, sorting the documents containing such search terms, and outputting the search results.
  • the interface unit 640 is configured to input keywords and display query results.
  • the query module includes:
  • a query condition decomposition module 631 configured to decompose the input query condition, obtain the first sub-condition and the second sub-condition, and provide the advanced search module to the advanced search module;
  • the advanced search module 632 is configured to retrieve, in the first index file, an identifier of the information that meets the first sub-condition; according to the second sub-condition, the second index file corresponding to the identifier of the information that meets the first sub-condition Search in to obtain the search results;
  • the general retrieval module 633 is configured to retrieve in the first index file according to the input query condition, Obtaining search results;
  • the sorting module 634 is for sorting the search results obtained by the advanced search module 632 or the general search module 633.
  • the advanced retrieval module 632 can include:
  • the first index file retrieval module 6321 is configured to retrieve, in the first index file, an identifier of the information that meets the first sub-condition;
  • a second index file retrieval module 6322 configured to perform, by using a multitasking, respectively, in a second index file corresponding to the identifiers of the plurality of information that meet the first sub-conditions; or sequentially, in the information that meets the first sub-condition The identifier is retrieved corresponding to the second index file.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Software Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

L'invention concerne un procédé d'extraction de données, un procédé de production de fichiers d'index et un moteur de recherche. Le procédé de l'invention comporte les étapes consistant à: produire un premier fichier d'index par rapport à l'ensemble des données à extraire; produire un second fichier d'index par rapport à chaque donnée individuelle; et obtenir les données correspondantes selon les mots-clés introduits. Lors du traitement d'une recherche avancée, le procédé comporte aussi les étapes consistant à: séparer la condition de requête introduite; extraire une première sous-condition dans le premier fichier d'index pour obtenir une identification de données répondant à la première sous-condition; et extraire une deuxième sous-condition dans le second fichier d'index qui correspond aux données répondant à la première sous-condition. L'invention comporte une pluralité de seconds fichiers d'index, ce qui facilite l'interrogation coopérative par un processus à tâches multiples en vue d'augmenter la vitesse de recherche. La quantité de seconds fichiers d'index sollicités étant faible, la consommation matérielle est réduite et le calcul d'un ensemble de données de recoupement est superflu, ce qui augmente la vitesse de la recherche avancée.
PCT/CN2007/000244 2006-01-25 2007-01-23 Procédé d'extraction de données, procédé de production de fichiers d'index et moteur de recherche WO2007085187A1 (fr)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN200610002759.9 2006-01-25
CN2006100027599A CN1858737B (zh) 2006-01-25 2006-01-25 一种数据搜索的方法和系统

Publications (1)

Publication Number Publication Date
WO2007085187A1 true WO2007085187A1 (fr) 2007-08-02

Family

ID=37297646

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2007/000244 WO2007085187A1 (fr) 2006-01-25 2007-01-23 Procédé d'extraction de données, procédé de production de fichiers d'index et moteur de recherche

Country Status (2)

Country Link
CN (1) CN1858737B (fr)
WO (1) WO2007085187A1 (fr)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102981933A (zh) * 2012-12-18 2013-03-20 武汉邮电科学研究院 物联网感知层的数据增量备份方法及系统
CN109783508A (zh) * 2018-12-29 2019-05-21 亚信科技(南京)有限公司 数据查询方法、装置、计算机设备和存储介质
CN111190893A (zh) * 2018-11-15 2020-05-22 华为技术有限公司 建立特征索引的方法和装置
CN112100399A (zh) * 2020-09-09 2020-12-18 杭州凡闻科技有限公司 基于知识体系的知识图谱模型的创建方法及图谱检索方法
CN113535788A (zh) * 2021-07-12 2021-10-22 中国海洋大学 一种面向海洋环境数据的检索方法、系统、设备及介质

Families Citing this family (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101833511B (zh) * 2010-03-29 2012-06-06 瑞斯康达科技发展股份有限公司 数据管理方法、装置和系统
CN101930472A (zh) * 2010-09-09 2010-12-29 南京中兴特种软件有限责任公司 一种支持分布式数据库基于并行查询的方法
CN102567415B (zh) * 2010-12-31 2013-11-06 百度在线网络技术(北京)有限公司 一种数据库的控制方法和装置
CN102831174B (zh) * 2012-07-27 2015-06-17 福建富士通信息软件有限公司 结构化信息快速检索方法及系统
CN104536962A (zh) * 2014-11-11 2015-04-22 珠海天琴信息科技有限公司 一种用于嵌入式系统中的数据查询方法及装置
CN105653654B (zh) * 2015-12-28 2021-11-30 上海瀚之友信息技术服务有限公司 一种抽奖资格索引系统及方法
CN106411530B (zh) * 2016-10-25 2021-08-06 国家电网公司 一种弱口令的快速比对查找方法
CN107766414B (zh) * 2017-09-06 2020-06-12 北京三快在线科技有限公司 多文档交集获取方法、装置、设备及可读存储介质
CN107704599A (zh) * 2017-10-16 2018-02-16 杭州安恒信息技术有限公司 一种实现edr数据快速检索的方法与装置
CN111666370B (zh) * 2020-07-28 2022-04-22 中国人民解放军国防科技大学 面向多源异构航天数据的语义索引方法和装置

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1302412A (zh) * 1997-07-22 2001-07-04 微软公司 应用搜索结果的自然语言处理以改进整体精度的信息检索系统的设备和方法
CN1430165A (zh) * 2001-12-29 2003-07-16 财团法人资讯工业策进会 搜寻引擎关键字的关联建立系统及方法
WO2005022402A1 (fr) * 2003-08-29 2005-03-10 Vortaloptics, Inc. Procede, dispositif et logiciel de demande et de presentation de resultats de recherche

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1536483A (zh) * 2003-04-04 2004-10-13 陈文中 网络信息抽取及处理的方法及系统

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1302412A (zh) * 1997-07-22 2001-07-04 微软公司 应用搜索结果的自然语言处理以改进整体精度的信息检索系统的设备和方法
CN1430165A (zh) * 2001-12-29 2003-07-16 财团法人资讯工业策进会 搜寻引擎关键字的关联建立系统及方法
WO2005022402A1 (fr) * 2003-08-29 2005-03-10 Vortaloptics, Inc. Procede, dispositif et logiciel de demande et de presentation de resultats de recherche

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102981933A (zh) * 2012-12-18 2013-03-20 武汉邮电科学研究院 物联网感知层的数据增量备份方法及系统
CN102981933B (zh) * 2012-12-18 2015-05-13 武汉邮电科学研究院 物联网感知层的数据增量备份方法及系统
CN111190893A (zh) * 2018-11-15 2020-05-22 华为技术有限公司 建立特征索引的方法和装置
CN111190893B (zh) * 2018-11-15 2023-05-16 华为技术有限公司 建立特征索引的方法和装置
CN109783508A (zh) * 2018-12-29 2019-05-21 亚信科技(南京)有限公司 数据查询方法、装置、计算机设备和存储介质
CN109783508B (zh) * 2018-12-29 2021-04-09 亚信科技(南京)有限公司 数据查询方法、装置、计算机设备和存储介质
CN112100399A (zh) * 2020-09-09 2020-12-18 杭州凡闻科技有限公司 基于知识体系的知识图谱模型的创建方法及图谱检索方法
CN112100399B (zh) * 2020-09-09 2023-12-22 杭州凡闻科技有限公司 基于知识体系的知识图谱模型的创建方法及图谱检索方法
CN113535788A (zh) * 2021-07-12 2021-10-22 中国海洋大学 一种面向海洋环境数据的检索方法、系统、设备及介质
CN113535788B (zh) * 2021-07-12 2024-03-05 中国海洋大学 一种面向海洋环境数据的检索方法、系统、设备及介质

Also Published As

Publication number Publication date
CN1858737B (zh) 2010-06-02
CN1858737A (zh) 2006-11-08

Similar Documents

Publication Publication Date Title
WO2007085187A1 (fr) Procédé d'extraction de données, procédé de production de fichiers d'index et moteur de recherche
Li et al. Pfp: parallel fp-growth for query recommendation
US7930288B2 (en) Knowledge extraction for automatic ontology maintenance
CN104794242B (zh) 一种搜索方法
US9275128B2 (en) Method and system for document indexing and data querying
US20100121790A1 (en) Method, apparatus and computer program product for categorizing web content
CN105183884A (zh) 一种基于大数据技术的搜索引擎系统及搜索引擎方法
US20110264997A1 (en) Scalable Incremental Semantic Entity and Relatedness Extraction from Unstructured Text
Cheng et al. A web services discovery approach based on mining underlying interface semantics
Elshater et al. godiscovery: Web service discovery made efficient
CN110162522A (zh) 一种分布式数据搜索系统及方法
Adamu et al. A survey on big data indexing strategies
JP4207438B2 (ja) Xml文書格納/検索装置及びそれに用いるxml文書格納/検索方法並びにそのプログラム
CN112883030A (zh) 数据收集方法、装置、计算机设备和存储介质
US20120130999A1 (en) Method and Apparatus for Searching Electronic Documents
Carta et al. Iterative zero-shot llm prompting for knowledge graph construction
CN103226601B (zh) 一种图片搜索的方法和装置
CN107229714B (zh) 一种基于分布式数据库的全文搜索引擎
Cheng et al. MISDA: web services discovery approach based on mining interface semantics
US8484221B2 (en) Adaptive routing of documents to searchable indexes
US10380195B1 (en) Grouping documents by content similarity
CN111625570B (zh) 列表数据资源检索方法及装置
Ren et al. A summary of research on web data acquisition methods based on distributed crawler
US10235432B1 (en) Document retrieval using multiple sort orders
CN114238576A (zh) 数据匹配方法、装置、计算机设备和存储介质

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application
NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 07702173

Country of ref document: EP

Kind code of ref document: A1