TW201519071A - Technical documents capturing and patents analysis system and method - Google Patents

Technical documents capturing and patents analysis system and method Download PDF

Info

Publication number
TW201519071A
TW201519071A TW103136595A TW103136595A TW201519071A TW 201519071 A TW201519071 A TW 201519071A TW 103136595 A TW103136595 A TW 103136595A TW 103136595 A TW103136595 A TW 103136595A TW 201519071 A TW201519071 A TW 201519071A
Authority
TW
Taiwan
Prior art keywords
technical
module
information
technical document
reading
Prior art date
Application number
TW103136595A
Other languages
Chinese (zh)
Inventor
Chao-Chin Chang
Iou-Ming Lou
Original Assignee
Paitrix Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Paitrix Co Ltd filed Critical Paitrix Co Ltd
Publication of TW201519071A publication Critical patent/TW201519071A/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/048Interaction techniques based on graphical user interfaces [GUI]
    • G06F3/0481Interaction techniques based on graphical user interfaces [GUI] based on specific properties of the displayed interaction object or a metaphor-based environment, e.g. interaction with desktop elements like windows or icons, or assisted by a cursor's changing behaviour or appearance
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/34Browsing; Visualisation therefor
    • G06F16/345Summarisation for human users
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/93Document management systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/103Formatting, i.e. changing of presentation of documents
    • G06F40/106Display of layout of documents; Previewing
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/20Image preprocessing
    • G06V10/22Image preprocessing by selection of a specific region containing or referencing a pattern; Locating or processing of specific regions to guide the detection or recognition
    • G06V10/235Image preprocessing by selection of a specific region containing or referencing a pattern; Locating or processing of specific regions to guide the detection or recognition based on user input or interaction
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/40Document-oriented image-based pattern recognition
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2203/00Indexing scheme relating to G06F3/00 - G06F3/048
    • G06F2203/048Indexing scheme relating to G06F3/048
    • G06F2203/04803Split screen, i.e. subdividing the display area or the window area into separate subareas
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2203/00Indexing scheme relating to G06F3/00 - G06F3/048
    • G06F2203/048Indexing scheme relating to G06F3/048
    • G06F2203/04806Zoom, i.e. interaction techniques or interactors for controlling the zooming operation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2216/00Indexing scheme relating to additional aspects of information retrieval not explicitly covered by G06F16/00 and subgroups
    • G06F2216/11Patent retrieval

Abstract

Disclosed is a technical documents capturing and patent analysis system and method. The system may comprise a capturing system and a reading with commentating system. The capturing system selects related drawings from a group of technical documents. It then provides important information and the related drawings onto an image for the readers' review. The reading with commentating system allows the readers to process technical classification, management and export/import for the group of technical documents. Readers may make comments on an information sharing platform after reviewing the technical documents. Besides, other materials collected or generated from the technical analysis on the technical documents may be attached to the information sharing platform.

Description

技術文獻擷取及專利分析系統與方法 Technical literature extraction and patent analysis system and method

本發明係關於一種技術文獻擷取及專利分析系統與方法。 The present invention relates to a technical literature extraction and patent analysis system and method.

美國專利號6,058.417揭露一種線上貿易環境裡,資訊呈現與管裡之方法與裝置(Method and Apparatus for Information Presentation and Management in an Trading Environment),其中根據使用者提供的資訊,包括販賣項目的描述以及與此項目相關之影像可以擷取的位置,影像可以從多個網站來獲得。然後,對應此獲得的影像,建立一些略圖影像,並將其聚集後,放在一網頁上,以呈現在遠端的網站。類似地,當一使用者提出詢問時,滿足此使用者詢問的項目,其對應的略圖影像就會顯示出來,這些已建立之略圖影像的每一略圖影像係以一個由使用者指定的影像為基礎。 US Patent No. 6,058.417 discloses a Method and Apparatus for Information Presentation and Management in an Trading Environment, which provides information based on user information, including descriptions of sales items and The location of the image related to this project can be captured, and the image can be obtained from multiple websites. Then, corresponding to the obtained image, some thumbnail images are created, and then gathered and placed on a web page to be presented on the remote website. Similarly, when a user makes an inquiry, the corresponding thumbnail image is displayed, and each thumbnail image of the created thumbnail image is a user-specified image. basis.

此專利文獻揭露了在線上貿易環境裡,一種備有已安排好之略圖影像的界面。此專利文獻沒有揭露針對技術文獻分析或閱讀管理界面,及其所對應的技術和內容。所以此專利文獻無法對技術文獻裡之圖示與內容的安排或影像的擷取技術提供足夠的指引。 This patent document discloses an interface in the online trading environment with a thumbnail image that has been arranged. This patent document does not disclose a technical literature analysis or reading management interface, and its corresponding technology and content. Therefore, this patent document cannot provide sufficient guidance for the arrangement of the graphic and content in the technical literature or the image capturing technique.

美國專利號5,963.966揭露一種技術文獻之自動擷取技術,以供電子檢視和散佈(Electronic Review and Distribution)。此擷取技術是利用光學字元辨識(Optical Character Recognition,OCR),將技術文獻切分成圖檔 (Drawing File)和文字檔(Text File),並歸檔至適當位置以便儲存或顯示。然而,使用此OCR技術是將整件技術文獻從頭到尾掃瞄,因此會耗費大量的系統資源,也會導致低的產出效率。 U.S. Patent No. 5,963,966 discloses an automatic extraction technique for technical documentation for Electronic Review and Distribution. This capture technique uses optical character recognition (OCR) to divide the technical literature into image files. (Drawing File) and text file (Text File), and archived to the appropriate location for storage or display. However, the use of this OCR technology scans the entire technical literature from start to finish, thus consuming a large amount of system resources and resulting in low output efficiency.

研發團隊在技術的發展初期,經常需要花費大量的人力與時間來檢索與閱讀資料。例如,台灣專利公開號200417882揭露之專利分析資料的生成方法中,利用外部資料庫的檢索,擷取數個專利或是分類號,並且藉由專利、分類號,以特徵表等資料,做專利的分析與其間的聯結關係。 In the early stages of technology development, R&D teams often spend a lot of manpower and time to retrieve and read materials. For example, in the method for generating patent analysis data disclosed in Taiwan Patent Publication No. 200417882, a plurality of patents or classification numbers are retrieved by using an external database, and patents, classification numbers, feature sheets, and the like are used as patents. The relationship between the analysis and its connection.

第一圖所示之台灣專利號567432之專利資訊挖掘及分析系統及方法(System and Method for Mining and Statistical Analyzing Patent Information)中,用戶透過客戶端電腦之用戶界面,選擇分析類型及設置分析條件。此分析條件藉由應用軟體伺服器轉換成指定格式的查詢條件。根據此查詢條件搜索資料庫以獲取查詢結果,再將查詢結果透過應用軟體伺服器傳回至客戶端電腦,並顯示分析結果。 In the System and Method for Mining and Statistical Analyzing Patent Information of the Taiwan Patent No. 567432, the user selects the type of analysis and sets the analysis conditions through the user interface of the client computer. This analysis condition is converted into a query condition of a specified format by the application software server. Search the database according to the query conditions to obtain the query result, and then send the query result back to the client computer through the application software server, and display the analysis result.

用於專利文獻資料的分析中,通常如第二圖的範例所示,係利用列表清單的條列式模式將技術資料的相關資訊以文字方式條列呈現,包含有例如文件標題、文件編號、申請日、公開日、發表日、公告日、作者(發明人)、所有權人等。閱讀者欲進一步獲得相關訊息須再進入該主題,透過讀取摘要或甚至須開啟附件閱讀內文和圖式的方式才能了解資訊內容是否與閱讀者所關心的主題相關,往往在做大量資料處理時,產生時間上的耗費,造成人員於大量進行技術(專利)文獻分析工作上的瓶頸。 In the analysis of the patent literature, as shown in the example of the second figure, the related information of the technical data is presented in a textual manner by using the arranging mode of the list of lists, including, for example, a file title, a file number, and Application date, publication date, publication date, announcement date, author (inventor), owner, etc. Readers who want to obtain further information must re-enter the topic. By reading the abstract or even opening the attachment to read the text and schema, it is necessary to know whether the information content is related to the subject of the reader's concern, and often do a lot of data processing. At the time, the time spent is high, causing a large number of bottlenecks in the technical (patent) literature analysis work.

因此,如何有效地整合多個技術資料庫的文件,快速且有效地分析技術資料的內容與趨勢,是目前最大的課題之一。 Therefore, how to effectively integrate the files of multiple technical databases and analyze the content and trends of technical materials quickly and effectively is one of the biggest issues at present.

上述技術中,沒有揭露外部資料庫資料的關聯性運用,只揭露抓取外部資料庫資料回本地資料庫或伺服器來分析。也沒有提供閱讀者能快速獲得相關訊息的畫面,或是提供閱讀者所需的互動式報告評論平台,來分享前人閱讀技術資料的心得,進而增進閱讀技術資料的效率。 In the above technology, the related application of the external database data is not disclosed, and only the external database data is retrieved and returned to the local database or the server for analysis. It also does not provide a screen for readers to quickly obtain relevant information, or provide an interactive report commenting platform for readers to share their experiences in reading technical materials, thereby increasing the efficiency of reading technical materials.

依本發明所揭露的實施範例中可提供一種針對技術文獻的管理系統與方法,可進行資訊分類、擷取、閱讀、評論、分享與意見交換。此實施範例將一組技術文獻的資料透過前處理,將一組技術文獻資料中的關聯圖式選取出,並與擷取的重要資訊整合於一畫面,提供給閱讀者以圖形化的畫面來閱讀技術文獻的資料,而減少重覆操作浪費資源(Resource-Consuming)的圖形化的格式檔案(如PDF或TIFF),並可進行技術分類、管理、匯出/匯入等處理。閱讀者可進一步透過資訊分享平台,將技術資料閱讀後的心得評論記載於此資訊分享平台。 According to the embodiment disclosed by the present invention, a management system and method for technical documents can be provided, which can perform information classification, retrieval, reading, commenting, sharing, and exchange of opinions. In this embodiment, a set of technical literature data is pre-processed, and a related pattern in a set of technical literature is selected, and the important information captured is integrated into a picture, and the reader is provided with a graphical picture. Read the technical literature, and reduce the redundant format of resource-consume (such as PDF or TIFF), and can perform technical classification, management, export/import, etc. Readers can further record the comments after reading the technical materials on this information sharing platform through the information sharing platform.

在一實施範例中,所揭露者是關於一種技術文獻擷取系統。此擷取系統包含此擷取系統包含一驗證篩選單元、一擷取選定模組、以及一資料擷取模組。驗證篩選單元將一技術文獻資料的關聯性作為判斷依據,來設定此技術文獻資料之擷取參數。擷取選定模組透過此驗證篩選單元設定的擷取參數,選取此技術文獻資料中的相關資訊。資料擷取模組將選定的重要資訊擷取出,並與擷取選定模組選取的相關資訊一同提供在一畫面上,來提供給閱讀者閱讀。 In one embodiment, the disclosed person is directed to a technical literature retrieval system. The capture system includes the capture system including a verification screening unit, a selection module, and a data capture module. The verification screening unit uses the relevance of a technical literature as a basis for judging the parameters of the technical literature. The selected module is selected through the parameters selected by the verification screening unit to select relevant information in the technical literature. The data capture module extracts the selected important information and provides it to the reader along with the relevant information selected by the selected module.

在另一實施範例中,所揭露者是關於一種技術文獻之閱讀系統。此閱讀系統至少包含一圖形化的畫面、一分類模組、一管理模組、以 及一匯出/匯入模組。透過圖形化的畫面,分類模組提供閱讀者將一組圖形化的技術文獻資料,作技術分類或產品分類,並建置於一系統中。管理模組提供閱讀者於完成技術資料的判斷後,依技術資料的屬性來歸檔於此系統中,或是從此系統中刪除所選定的技術資料。匯出/匯入模組提供閱讀者將所選定的資料匯出/匯入於此系統中。 In another embodiment, the disclosed person is a reading system for a technical document. The reading system includes at least a graphical picture, a classification module, a management module, and And a export/import module. Through the graphical picture, the classification module provides the reader with a set of graphical technical literature for technical classification or product classification, and is built into a system. The management module provides the reader to archive the technical data according to the attributes of the technical data, or delete the selected technical data from the system. The export/import module provides the reader with the selected data to be exported/imported into the system.

此閱讀系統除了進行分類、歸檔、刪除、匯出/匯入等處理外,可以再提供一超連結模組,來連結此系統中的資料庫,以取得此資料庫中相對應的文獻資料。 In addition to processing, archiving, deleting, exporting/importing, the reading system can provide a hyperlink module to link the database in the system to obtain the corresponding literature in the database.

在另一實施範例中,所揭露者是關於一種技術文獻之評論系統。此評論系統至少包含一評論主題單元、一閱讀評論單元,以及一附加單元。評論主題單元提供閱讀者輸入評論主題,並且顯示不同的評論主題。閱讀評論單元提供閱讀者記載技術文獻閱讀後的評論,與顯示不同閱讀者的評論,附加單元附加或產生閱讀者在分析閱讀過程中所收集或產生的其他資料。 In another embodiment, the disclosed person is a review system for a technical document. The comment system includes at least one comment subject unit, a read comment unit, and an additional unit. The comment subject unit provides the reader with a comment subject and displays different comment topics. The reading commentary unit provides readers with a review of the technical literature after reading, and displays comments from different readers, additional units attached or other information collected or generated by the reader during the analysis of the reading process.

在另一實施範例中,所揭露者是關於一種技術文獻擷取方法。此擷取方法包含:將一技術文獻資料的關聯性作為判斷依據,來設定此組技術文獻資料之擷取參數;透過設定的擷取參數,選取此技術文獻資料中的關聯性資訊;以及擷取出選定的重要資訊,並與所選取的關聯性資訊,以適當的排列方式,一同提供在一畫面上,方便使用者以相較於前案較佳之視覺效果閱讀技術文獻,提升技術文獻分析的效率。 In another embodiment, the disclosed person is directed to a method of extracting technical documents. The method for extracting includes: setting the relevance of a technical literature as a basis for judging, setting parameters of the technical literature of the group; selecting the relevance information in the technical literature through the set parameters; and The selected important information is taken out and provided with the selected association information in a proper arrangement on a screen, so that the user can read the technical literature with better visual effects than the previous case, and improve the technical literature analysis. effectiveness.

在另一實施範例中,所揭露者是關於一種技術文獻之閱讀方法。此閱讀方法包含:擷取一組技術文獻資料的重要資訊與相關資訊,並 提供在一畫面上,來提供給閱讀者閱讀;以及透過此畫面,進行此組技術文獻資料的分類、歸檔、刪除、匯出/匯入,之前述功能的其中任一組合。 In another embodiment, the disclosed person is directed to a method of reading a technical document. This reading method includes: extracting important information and related information of a set of technical literature, and Provided on a screen for reading to the reader; and through this screen, sorting, archiving, deleting, exporting/importing the set of technical documents, any combination of the foregoing functions.

在另一實施範例中,所揭露者是關於一種技術文獻之閱讀暨評論方法。此閱讀暨評論方法包含:前述實施範例之閱讀方法中再包括提供一資訊分享平台的步驟,此資訊分享平台讓閱讀者記載技術文獻閱讀後的評論,以及附加其在分析閱讀過程中所收集或產生的其他資料,或是顯示不同閱讀者的評論。 In another embodiment, the disclosed person is directed to a method of reading and commenting on a technical document. The reading and commenting method includes: the reading method of the foregoing embodiment further includes the step of providing an information sharing platform, the information sharing platform allowing the reader to record the comments after reading the technical literature, and attaching the collected information during the analysis reading or Other information generated, or comments from different readers.

上述實施範例中,技術文獻之擷取系統與閱讀系統可整合成一種技術文獻之擷取與閱讀系統。技術文獻之閱讀系統與評論系統也可整合而成一種技術文獻之閱讀暨評論系統,或是整合擷取系統、閱讀系統以及評論系統而成為一種技術文獻之擷取與閱讀暨評論的系統。 In the above embodiment, the capture system and the reading system of the technical literature can be integrated into a capture and reading system of a technical document. The reading and commenting system of the technical literature can also be integrated into a reading and commenting system of technical literature, or a system of reading, reading and commenting on the technical literature by integrating the retrieval system, the reading system and the commenting system.

當本發明應用於設計專利分析時,可描繪出有力的結果。例如,本發明藉由顯示一組多個專利文獻中的代表圖式,使使用者能夠於初始搜尋階段加速專利地圖的篩選。以減少使用者在較耗資源的圖檔格式上的需求,以及改善系統資源使用率。 When the invention is applied to design patent analysis, powerful results can be drawn. For example, the present invention enables a user to speed up the screening of patent maps during an initial search phase by displaying a representative set of a plurality of patent documents. To reduce the user's need for more resource-intensive image formats and to improve system resource usage.

茲配合下列圖示、實施範例之詳細說明及申請專利範圍,將上述及本發明之其他目的與優點詳述於後。 The above and other objects and advantages of the present invention will be described in detail with reference to the accompanying drawings.

300‧‧‧擷取系統 300‧‧‧ capture system

301‧‧‧驗證篩選單元 301‧‧‧Verification screening unit

301a‧‧‧設定的擷取參數 301a‧‧‧Setting parameters

302‧‧‧擷取選定模組 302‧‧‧Select selected modules

303‧‧‧資料擷取模組 303‧‧‧ Data Capture Module

310‧‧‧技術文獻資料 310‧‧‧Technical literature

320‧‧‧關聯圖式 320‧‧‧Association schema

330‧‧‧畫面 330‧‧‧ screen

330a‧‧‧圖形化的畫面 330a‧‧‧ Graphical picture

350‧‧‧超連結模組 350‧‧‧ Hyperlink Module

3201、3202、3203‧‧‧相對位置 3201, 3202, 3203‧‧‧ relative position

410‧‧‧時間選定模組 410‧‧‧ time selected module

420‧‧‧資格選定模組 420‧‧‧Qualified selection module

421‧‧‧矩形 421‧‧‧Rectangle

422‧‧‧區域 422‧‧‧Area

423‧‧‧擷取範圍調整後的矩形 423‧‧‧Dimensions with adjusted range

424‧‧‧技術文獻之原始影像的兩邊界交點 424‧‧‧Two boundary intersections of the original images of the technical literature

430‧‧‧申請狀態選定模組 430‧‧‧Application Status Selection Module

500‧‧‧閱讀系統 500‧‧‧Reading system

501‧‧‧分類模組 501‧‧‧Classification module

502‧‧‧管理模組 502‧‧‧Management module

503‧‧‧匯出/匯入模組 503‧‧‧Export/import module

504‧‧‧變焦模組 504‧‧‧Zoom module

520‧‧‧圖形化的畫面 520‧‧‧ Graphical picture

800‧‧‧評論系統 800‧‧‧Comment System

801‧‧‧評論主題單元 801‧‧‧Comment theme unit

802‧‧‧閱讀評論單元 802‧‧‧Reading comment unit

803‧‧‧附加單元 803‧‧‧Additional unit

804‧‧‧律師稽查單元 804‧‧‧Lawyer Unit

901a‧‧‧不同的輸入主題 901a‧‧‧Different input topics

902a‧‧‧日期 Date 902a‧‧

902b‧‧‧閱讀者的姓名或代號 902b‧‧‧Reader's name or code

902c‧‧‧閱讀文獻的摘要 902c‧‧·Reading abstracts of the literature

902d‧‧‧閱讀文獻的評論內容 902d‧‧‧Reading the contents of the literature

950‧‧‧部落格的型態顯示 950‧‧‧ Blog type display

1000‧‧‧技術文獻之擷取與閱讀系統 1000‧‧‧Technical literature acquisition and reading system

1100‧‧‧技術文獻之閱讀暨評論系統 1100‧‧‧Reading and commenting system for technical literature

1200‧‧‧技術文獻之擷取與閱讀暨評論的系統 1200‧‧‧System for the acquisition and reading of technical literature

1310‧‧‧將一技術文獻資料的關聯性作為判斷依據,來設定此技術文獻資料之擷取參數 1310‧‧‧ Use the relevance of a technical literature as a basis for judging the parameters of this technical literature

1320‧‧‧透過設定的擷取參數,選取此技術文獻資料中的關聯性資訊 1320‧‧‧Select the relevant information in this technical literature by setting the parameters

1330‧‧‧擷取出選定的重要資訊,並與所選取的關聯性資訊一同提供在一畫面上 1330‧‧‧Draw out selected important information and provide it on the screen together with the selected relevance information

1410‧‧‧擷取一組技術文獻資料的重要資訊與關聯性資訊,並提供在一畫面上,來提供給閱讀者閱讀 1410‧‧‧Collecting important information and related information from a set of technical literature and providing it on a screen for readers to read

1420‧‧‧透過此畫面,進行該組技術文獻資料的分類、歸檔、刪除、匯出/匯入,之前述功能的其中任一組合 1420‧‧‧Use this screen to classify, archive, delete, export/import the technical literature of the group, any combination of the above functions

1510‧‧‧提供一資訊分享平台,讓閱讀者記載技術文獻閱讀後的評論以及附加其在分析閱讀過程中所收集或產生的其他資料,或是顯示不同閱讀者的評論 1510‧‧‧ Provides an information sharing platform that allows readers to record comments after reading the technical literature and to attach other information they have collected or generated during the analytical reading process, or to display comments from different readers

164‧‧‧顯示模組 164‧‧‧ display module

165‧‧‧擷取模組 165‧‧‧Capture module

166‧‧‧追蹤模組 166‧‧‧Tracking module

167‧‧‧任務指派模組 167‧‧‧Task Assignment Module

169‧‧‧記錄模組 169‧‧‧recording module

168‧‧‧報告產生模組 168‧‧‧Report Generation Module

1691‧‧‧搜尋規則模擬器 1691‧‧‧Search Rule Simulator

1692‧‧‧方案選單 1692‧‧‧Program Menu

1693‧‧‧光學字元辨識(OCR)模組 1693‧‧‧Optical Character Recognition (OCR) Module

1695‧‧‧變焦模組 1695‧‧‧Zoom module

1694‧‧‧指示器模組 1694‧‧‧ indicator module

1696‧‧‧介面 1696‧‧‧ interface

171、172‧‧‧畫面區域 171, 172‧‧‧ screen area

1721、1722、1723、1724‧‧‧資訊區塊 1721, 1722, 1723, 1724‧‧‧Information blocks

183、184、185‧‧‧資訊區塊 183, 184, 185 ‧ ‧ information blocks

191‧‧‧視窗 191‧‧‧ Window

1911‧‧‧選項 1911‧‧‧ options

192‧‧‧資訊區塊 192‧‧‧Information block

2001、2002、P1、P2、P3‧‧‧專利文獻 2001, 2002, P1, P2, P3‧‧‧ patent documents

211‧‧‧畫面區域 211‧‧‧Screen area

X1、X2、X3、X4‧‧‧基本單元 X1, X2, X3, X4‧‧‧ basic unit

2110、2111‧‧‧文字內容 2110, 2111‧‧‧ text content

A、B、C‧‧‧資料分類 A, B, C‧‧‧ data classification

A-1~A-3、B-1~B-3、C-1~C-3‧‧‧子資料分類 Classification of sub-data of A-1~A-3, B-1~B-3, C-1~C-3‧‧

G、H‧‧‧資料庫 G, H‧‧ ‧ database

J‧‧‧短軸 J‧‧‧ short axis

K‧‧‧長軸 K‧‧‧ long axis

第一圖是一範例流程圖,說明一種專利分析資料的生成方法的運作。 The first figure is an example flow chart illustrating the operation of a method for generating patent analysis data.

第二圖展示了一列表清單模式,用於在一傳統的專利文獻分析範例中展示技術文獻的相關資訊。 The second diagram shows a list of listings for displaying information about the technical literature in a traditional patent document analysis paradigm.

第三A圖是一種技術文獻擷取系統的一個範例示意圖,與本發明所揭露的某些實施範例一致。 Figure 3A is a schematic diagram of an example of a technical document retrieval system consistent with certain embodiments of the present invention.

第三B圖是圖形化的畫面的一個範例示意圖,與本發明所揭露的某些實施範例一致。 Figure 3B is a schematic diagram of an example of a graphical picture consistent with certain embodiments of the present invention.

第四A圖進一步說明驗證篩選單內包含的選定模組的一個範例示意圖,與本發明所揭露的某些實施範例一致。 A fourth diagram further illustrates an exemplary schematic diagram of selected modules included in the verification screening list, consistent with certain embodiments of the present disclosure.

第四B圖以專利文獻為例來說明擷取參數,與本發明所揭露的某些實施範例一致。 The fourth B diagram illustrates the extraction parameters by taking a patent document as an example, which is consistent with some embodiments of the present invention.

第五圖是技術文獻之閱讀系統的一個範例示意圖,與本發明所揭露的某些實施範例一致。 The fifth figure is a schematic diagram of an example of a reading system of the technical literature, consistent with certain embodiments of the present invention.

第六圖說明透過分類模組,將技術資料分類的一個範例示意圖,與本發明所揭露的某些實施範例一致。 The sixth figure illustrates an example schematic diagram for classifying technical data through a classification module, consistent with certain embodiments of the present invention.

第七圖說明透過管理模組,將技術資料歸檔於系統中的兩個子資料庫的一個範例示意圖,與本發明所揭露的某些實施範例一致。 The seventh figure illustrates an example schematic diagram of two sub-libraries that archive technical data in a system through a management module, consistent with certain embodiments of the present disclosure.

第八圖是技術文獻之評論系統的一個範例架構圖,與本發明所揭露的某些實施範例一致。 The eighth figure is an exemplary architectural diagram of the review system of the technical literature, consistent with certain embodiments of the present invention.

第九圖是評論系統之各單元實現的一範例示意圖,與本發明所揭露的某些實施範例一致。 The ninth diagram is an exemplary diagram of the implementation of the various elements of the review system, consistent with certain embodiments of the present invention.

第十圖是技術文獻之擷取與閱讀系統的一個範例示意圖,與本發明所揭露的某些實施範例一致。 The tenth figure is a schematic diagram of an example of a capture and reading system of the technical literature, consistent with certain embodiments of the present invention.

第十一圖是技術文獻之閱讀暨評論系統的一個範例示意圖,與本發明所揭露的某些實施範例一致。 An eleventh drawing is an exemplary schematic diagram of a reading and commenting system of the technical literature, consistent with certain embodiments of the present invention.

第十二圖是技術文獻之擷取與閱讀暨評論的系統的一個範例示意圖,與本發明所揭露的某些實施範例一致。 A twelfth diagram is an exemplary diagram of a system for capturing and reading and reviewing technical literature, consistent with certain embodiments of the present invention.

第十三圖是技術文獻之擷取方法的一個範例流程圖,與本發明所揭露的某些實施範例一致。 A thirteenth diagram is an exemplary flow chart of a method of capturing the technical literature, consistent with certain embodiments of the present invention.

第十四圖是技術文獻之閱讀方法的一個範例流程圖,與本發明所揭露的某些實施範例一致。 Figure 14 is an exemplary flow chart of a method of reading the technical literature, consistent with certain embodiments of the present invention.

第十五圖是技術文獻之閱讀暨評論方法的一個範例流程圖,與本發明所揭露的某些實施範例一致。 The fifteenth diagram is an exemplary flow chart of the method of reading and commenting on the technical literature, consistent with certain embodiments of the present invention.

第十六圖是本發明另一實施例的示意圖。 Figure 16 is a schematic view of another embodiment of the present invention.

第十七A圖至第十七D圖是文獻單元於不同顯示比率的示意圖。 Figures 17A through 17D are schematic views of document units at different display ratios.

第十八圖是一拖-放操作示意圖。 The eighteenth figure is a schematic diagram of a drag-and-drop operation.

第十九圖是評論一專利文獻之介面範例的示意圖。 The nineteenth figure is a schematic diagram of an interface example of a patent document.

第二十圖是在分析多個專利文獻時的一追蹤清單的示意圖。 Figure 20 is a schematic diagram of a tracking list when analyzing a plurality of patent documents.

第二十一圖是在光學字元辨識(OCR)過程中產生一圖式的示意圖。 The twenty-first figure is a schematic diagram of generating a pattern in an optical character recognition (OCR) process.

本揭露之實施範例可運用多個來源資料庫,例如技術文獻等資料庫,將一組原始技術文獻資料經由系統分析後,重新組成具有關聯性對應的資料群組,來建立內部資料庫。此組技術文獻,例如專利文獻或論文等相關文獻,之關聯性資訊被擷取出後,可以提供給閱讀者以圖形化的畫面來閱讀此組技術文獻的資料,並可以讓閱讀者進行分類、歸檔、刪除、 匯出/匯入等處理。也可以讓閱讀者記載技術文獻閱讀後的心得評論,或顯示不同閱讀者的評論。還可以附加或產生在分析閱讀過程中所收集或產生的其他資料。依此,客戶端的使用者可以快速且有效地分析技術資料的內容與趨勢。 The implementation example of the disclosure may use a plurality of source databases, such as a technical literature database, to analyze a set of original technical documents and then reconstitute the associated data groups to establish an internal database. This group of technical documents, such as patent documents or papers, related information, after the relevant information is extracted, can provide readers with a graphical picture to read the data of the group of technical documents, and allows the reader to classify, Archive, delete, Processing such as export/import. It also allows readers to record comments after reading the technical literature, or to display comments from different readers. Other materials collected or generated during the analysis reading process may also be attached or generated. In this way, the user of the client can quickly and effectively analyze the content and trends of the technical data.

第三A圖是一種技術文獻擷取系統的一個範例示意圖,與本發明所揭露的某些實施範例一致。如第三A圖所示,一技術文獻資料310,例如專利文獻或論文,透过此擷取系統300可擷取出此技術文獻之關聯性資訊320,此擷取出的關聯性資訊320可包括圖像(Image)或文(Text)。並可提供圖形化的畫面,通常以規則化排列方式給閱讀者閱讀此相關資訊。關聯性資訊320係透過至少一內部資料庫或技術文件進行關聯性運算與分析而產生的結果。 Figure 3A is a schematic diagram of an example of a technical document retrieval system consistent with certain embodiments of the present invention. As shown in FIG. 3A, a technical document 310, such as a patent document or a paper, can retrieve the association information 320 of the technical document through the retrieval system 300, and the extracted association information 320 can include a map. Like (Image) or Text (Text). Graphical images can be provided, which are usually read by readers in a regular arrangement. The relevance information 320 is a result of correlation calculation and analysis through at least one internal database or technical file.

關聯性係指技術文件或是內部資料庫中資料(包含文字與圖示)間的相對關係,例如申請號與申請日的相對關係,期刊標題與發明人的相對關係等。 Relevance refers to the relative relationship between technical documents or materials in the internal database (including text and illustrations), such as the relative relationship between the application number and the filing date, and the relative relationship between the journal title and the inventor.

關聯性運算係指技術文件或是內部資料庫中資料之間的各式運算、比對或篩選等,例如布林邏輯、權重加成等。以專利文件為例,資料高相關性運算的篩選,可以選取相同專利之專利號、申請日期、發明人、所有權人等資訊。若是低資料相關性運算的篩選,則可以選取依專利號之發明人等資訊,也可以選取其他專利號之發明人等資訊。 Relevance computing refers to various operations, comparisons, or screenings between technical documents or data in internal databases, such as Boolean logic, weighting, and so on. Taking patent documents as an example, the screening of high correlation calculations of data can select the patent number, application date, inventor, owner and other information of the same patent. If the screening of low data correlation operations is performed, the information of the inventor according to the patent number may be selected, and the inventor of other patent numbers may also be selected.

在第三A圖之擷取系統300的範例中,此擷取系統300包含一驗證篩選單元301、以及一擷取選定模組302。驗證篩選單元301將一技術文獻資料310的關聯性作為判斷依據,來設定此技術文獻資料310之擷取參 數。擷取選定模組302透過此驗證篩選單元301最後設定的擷取參數301a,來擷取出此技術文獻資料中的關聯性資訊320。每一關聯圖式是一篇技術文獻之主要圖式,主要圖式意指首頁圖式或是代表技術特徵的圖式。於本範例中,此「關聯性資訊」一詞可為自一或一個技術領域以上的一技術文獻中被抽取、被計算、被轉換、且/或被推導出的資訊,用以處理技術文獻。此「相關資訊」一詞於本範例中可指為技術文獻中之技術領域、或技術文獻中任何部分有指出技術文獻的一屬性、一參數、一狀態或其他資訊。例如,關聯性資訊可指為一專利檔案中首頁所載的一圖式之估計起始地點,而「相關資訊」例如專利號、所有權人資訊等專利文獻領域可被用來幫助確定此起使地點。在本專利文獻的例子中,此相關資訊係被用來取得此關聯性資訊,以及此關聯性資訊更是用來取得相關圖式,而相關圖式亦為一專利文獻的一種相關資訊。 In the example of the capture system 300 of FIG. 3A, the capture system 300 includes a verification screening unit 301 and a capture selection module 302. The verification screening unit 301 sets the relevance of a technical document material 310 as a basis for determining the parameters of the technical document data 310. number. The selected module 302 retrieves the association information 320 in the technical literature through the capture parameter 301a finally set by the verification screening unit 301. Each association schema is the main schema of a technical document, and the main schema refers to a homepage schema or a schema representing technical features. In this example, the term "relevant information" may be information that has been extracted, calculated, transformed, and/or derived from a technical literature in one or more technical fields to process technical documentation. . The term "related information" in this example may refer to an attribute, a parameter, a state, or other information indicating a technical document in the technical field of the technical literature or in any part of the technical literature. For example, the related information may refer to the estimated starting point of a schema contained in the first page of a patent file, and the "related information" such as patent number, owner information, etc. can be used to help determine the origin. location. In the example of the patent document, the related information is used to obtain the related information, and the related information is used to obtain the related schema, and the related schema is also a related information of a patent document.

擷取系統300可再包括一資料擷取模組303,來選定重要資訊,例如是專利文獻的標題、所有權人、發明人、文獻編號、或是特定標示等,並與擷取選定模組302擷取出的關聯性資訊320,提供在一畫面330上給閱讀者閱讀,畫面330的範例示意圖如第三B圖之圖形化的畫面330a所示。 The capture system 300 can further include a data capture module 303 for selecting important information, such as the title of the patent document, the owner, the inventor, the document number, or the specific logo, and the selected module 302. The extracted association information 320 is provided for reading by the reader on a screen 330. The schematic diagram of the screen 330 is as shown in the graphical screen 330a of FIG.

圖形化的畫面330a的範例示意圖是一技術文獻資料之8筆相關資訊和其相對應的重要資訊。如果此8筆相關資訊是8篇美國專利之首頁主圖,及/或其相對應的重要資訊,例如是其相對應的美國專利號,及/或專利名稱及/或所有權人資訊。圖形化的畫面330a也可以將相關資訊和其相對應的重要資訊以規則化方式來排列,例如表格或列表清單。也可以採用連續型態靠網頁的方式來供閱讀者以捲軸瀏覽方式閱讀。或是採用非連續型 態以換頁方式顯示,可依閱讀者的習慣來設定每頁一定數量的圖像。圖形化的畫面(或至少一文件單元)也可提供一超連結模組,例如標號350所示,來連結系統資料庫以取得進階的相關資訊,例如專利文獻的全文(Full Text)。且當閱讀者以游標靠近其中之一的相關資訊,會主動彈出一視窗以顯示該相關資訊所對應之技術文獻的摘要、名稱、作者(發明人)、所有人(所有權人)、公開(公告)日、或是上述之任意組合。 An example schematic diagram of the graphical picture 330a is eight pieces of related information of a technical document and its corresponding important information. If the eight related information is the main map of the top page of the eight US patents, and/or its corresponding important information, such as its corresponding US patent number, and / or patent name and / or owner information. The graphical image 330a can also arrange related information and its corresponding important information in a regular manner, such as a table or a list of lists. It is also possible to use a continuous type of web page for the reader to read by scrolling. Or non-continuous The status is displayed in a form-changing manner, and a certain number of images per page can be set according to the reader's habits. The graphical image (or at least one file unit) may also provide a hyperlink module, such as indicated by reference numeral 350, to link the system database to obtain advanced relevant information, such as the full text of the patent document (Full Text). And when the reader approaches the relevant information of one of the cursors, a window pops up to display the abstract, name, author (inventor), owner (owner), and public (announcement) of the technical document corresponding to the related information. ), or any combination of the above.

任意區塊的相關(包含關聯性)資訊及/或相對應的重要資訊可以定義為一個文件單位(Document Unit)。文件單位內的相關資訊可以是一篇專利之首頁裡的主要圖式(Main Figure)或是特徵圖式(Characteristic Figure),而相對應的重要資訊可以是名稱、文件(專利)號碼、作者(發明人)、所有人(所有權人)、申請序號、申請日/公開日/公告日、摘要、或是上述之任意組合。多個文件單位被排列成m×n之影像表法,其中m與n皆為大於1的整數。m的較佳範圍是3至7,而n的較佳範圍是3至1000。使視覺感受更好的n較佳值是介於8與30之間。 Relevant (including relevance) information and/or corresponding important information of any block can be defined as a Document Unit. The relevant information in the file unit can be the main figure or the characteristic figure in the front page of a patent, and the corresponding important information can be the name, the file (patent) number, and the author ( Inventor), owner (owner), application serial number, filing date/publication date/announcement date, abstract, or any combination of the above. A plurality of file units are arranged in an m×n image table method, where m and n are integers greater than one. A preferred range of m is from 3 to 7, and a preferred range of n is from 3 to 1,000. The preferred value for making the visual experience better is between 8 and 30.

例如,對於顯示100篇專利文獻的一網頁,則此網頁的版面設計(layout)可以是4×25的文件單位、5×20的文件單位、或是其他任意的規則排列。如果專利文獻的數目不能安排成m×n,例如97,則5×20的排列會留下3個空位(vacancy)。然而,此排列仍在m×n格式的範疇中。相對應的重要資訊可以置於靠近相關資訊的地方,例如在相關(關聯性)資訊的上方、下方、右方或左方。另一種設計是當游標停留在文件單元或其相關(包含關聯性)資訊的位置時,會砰出一視窗,將一部份的相對應的重要資訊顯示在此視窗中。例如,在第3B圖中,顯示出的相對應的重要資訊含有名稱、公告 號和公告日。當游標停留在相關資訊的位置時,砰出的視窗中顯示出名稱、摘要、優先權日/申請日/公開日/公告日、專利家族、作者(發明人)、所有權人、或是上述之任意組合。 For example, for a web page displaying 100 patent documents, the layout of the web page may be a 4×25 file unit, a 5×20 file unit, or any other regular arrangement. If the number of patent documents cannot be arranged in m x n, such as 97, then a 5 x 20 arrangement would leave 3 vacancy. However, this arrangement is still in the category of the mxn format. Relevant important information can be placed close to relevant information, such as above, below, to the right or to the left of related (related) information. Another design is to pop up a window when the cursor stays in the file unit or its associated (including associative) information, and a portion of the corresponding important information is displayed in this window. For example, in Figure 3B, the corresponding important information displayed contains the name and announcement. No. and announcement day. When the cursor stays at the location of the relevant information, the pop-up window displays the name, abstract, priority date/application date/open day/announcement date, patent family, author (inventor), owner, or above. random combination.

值得一提的是,也可以在文件單位上執行放大(Zoom In)/縮小(Zoom Out)的操作。有三種方式來實現推近/拉遠的操作。第一種方式是,當游標停留在文件單位時,可以顯示此文件單位的較大影像。第二種方式是,系統根據m及/或n的值,來決定此文件單位之的影像的顯示比率(Display Ratio)。第三種方式是,使用一種變焦模組(Zoom Module),讓使用者自行調整文件單位之的影像的顯示大小(Display Size),請參第五圖的細節說明。 It is worth mentioning that Zoom In/Zoom Out can also be performed on file units. There are three ways to achieve the approach of zooming in/out. The first way is to display a larger image of this file unit when the cursor stays in the file unit. The second way is that the system determines the display ratio of the image unit in this file unit based on the values of m and/or n. The third way is to use a Zoom Module to allow the user to adjust the display size of the image in the file unit. Please refer to the details in the fifth figure.

技術文獻資料310若以專利文獻為例,則擷取系統300可以採用如專利的時間、申請狀態、資格或專利內容等,作為判斷依據,來進行專利資訊擷取與分析。依此,驗證篩選單元301可以是一種判斷邏輯,將此組專利文獻之資格、申請狀態、時間、專利內容等作為判斷依據,來進行此專利文獻之擷取參數的設定。再則,驗證篩選單元301,如第四A圖所示,可以再包括一時間選定模組410、一資格選定模組420、一申請狀態選定模組430,或前述模組的其中任一組合。第四A圖進一步說明驗證篩選單元301內包含這些選定模組的一個範例示意圖,與本發明所揭露的某些實施範例一致。 The technical literature 310 takes the patent document as an example, and the retrieval system 300 can use the time, the application status, the qualification or the patent content of the patent as a basis for judging the patent information acquisition and analysis. Accordingly, the verification screening unit 301 can be a kind of judgment logic, and the qualification parameters, the application status, the time, the patent content, and the like of the group of patent documents are used as a judgment basis to set the capture parameters of the patent document. In addition, the verification screening unit 301, as shown in FIG. 4A, may further include a time selection module 410, a qualification selection module 420, an application status selection module 430, or any combination of the foregoing modules. . A fourth diagram further illustrates an exemplary schematic diagram of the verification screening unit 301 containing these selected modules, consistent with certain embodiments of the present disclosure.

當技術文獻資料310以專利文獻為範例時,第四A圖之驗證篩選單元301的範例中,時間選定模組410可以根據專利文獻中與時間日期有關的部份,例如申請日、公開日、公告日、優先權日等日期,作為判斷的依據,依照不同時期來進行一組專利文獻之擷取參數的設定。設定出的 擷取參數例如是技術文獻資料310的擷取起始點、擷取範圍、以及相對位移(Related Offset)等。第四B圖以一技術文獻為例來說明這些擷取參數,與本發明所揭露的某些實施範例一致。 When the technical literature 310 is exemplified by the patent document, in the example of the verification screening unit 301 of the fourth A diagram, the time selection module 410 may be based on the part of the patent document related to the time and date, such as the application date, the publication date, The date of the announcement, the priority date, etc., as the basis for the judgment, the setting of the capture parameters of a set of patent documents is performed according to different periods. Set The acquisition parameters are, for example, the extraction starting point, the extraction range, and the relative offset (Related Offset) of the technical literature 310. The fourth B diagram illustrates a capture parameter using a technical document as an example, consistent with certain embodiments of the present invention.

參考第四B圖,擷取起始點與擷取範圍可以是機定值(Default Value),例如可以設定為一技術文獻之原始影像的兩邊界交點,如標號424,或是此技術文獻之幾何中心為擷取起始點或座標軸原點(0,0)。擷取範圍例如可以預定為長度為X而寬度為Y所形成的矩形421;或是長軸為K而短軸為J的任何一種幾何圖形所形成之區域422。 Referring to the fourth B diagram, the starting point and the capturing range may be a Default Value, for example, may be set as a two-boundary intersection of the original image of a technical document, such as reference numeral 424, or the technical literature. The geometric center is the starting point or coordinate axis origin (0,0). The extraction range may be, for example, a rectangle 421 formed by a length of X and a width Y, or an area 422 formed by any geometric pattern having a major axis K and a minor axis J.

參考第四B圖,此技術文獻例如是一篇專利文獻,若以此專利文獻之申請日作為判斷依據的範例,取專利文獻之原始影像的兩邊界交點為原點,擷取範圍的預定值例如是330×400(像素)之矩形。當申請日是2006年時,例如可以設定其擷取的起始點為距離座標軸原點420相對位移△d(+10mil),且其擷取範圍例如是X-軸向的△X為+10像素而Y-軸向的△Y為+20像素,也就是說擷取範圍調整為長度是X+△X而寬度為Y+△Y所形成的矩形423。若僅以某申請日為判斷依據,則擷取起始點例如是距離專利文獻之原始影像的兩邊界交點+5mil處,且其擷取範圍為310×420像素之矩形。△X,△Y可以為正數或負數。1mil等於1/1000英吋(inch)。 Referring to FIG. 4B, the technical document is, for example, a patent document. If the application date of the patent document is used as an example of the judgment basis, the intersection of the two boundaries of the original image of the patent document is taken as the origin, and the predetermined value of the range is taken. For example, a rectangle of 330 × 400 (pixels). When the application date is 2006, for example, the starting point of the extraction may be set to be the relative displacement Δd (+10 mil) from the coordinate axis origin 420, and the extraction range is, for example, the X-axis ΔX is +10. The ΔY of the pixel and the Y-axis is +20 pixels, that is, the extraction range is adjusted to a rectangle 423 formed by a length of X + ΔX and a width of Y + ΔY. If only a certain application date is used as a basis for judgment, the starting point is, for example, a distance of +5 mil from the boundary of the original image of the patent document, and the extraction range is a rectangle of 310×420 pixels. ΔX, ΔY can be positive or negative. 1 mil is equal to 1/1000 inch (inch).

類似地,資格選定模組420可以是一種專利文獻之資格驗證模組,可以根據專利文獻中與人員、屬地、分類或是資格有關的部份,例如發明人(Inventor)、所有權人(Assignee)、代理人(Agent)、國際分類(IPC)、美國分類(USC)、歐洲分類(ELCA)、日本分類(FI/F-Term)、國際工業設計分類(Locarno Classification)等技術/領域分類或是審查委員(Examiner)等或其 數量,作為判斷的依據,來進行專利文獻之擷取參數的設定。此設定出的擷取參數如前述之擷取起始點、擷取範圍、以及相對位移。以所有權人作為判斷依據為例,例如所有權人為工研院,則其設定擷取的相對位移△d可以是座標軸原點+20mil,且其擷取範圍之△X例如是+50像素,而△Y例如是+60像素。△X,△Y可以為正數或負數。 Similarly, the qualification selection module 420 may be a qualification verification module of a patent document, which may be based on parts related to personnel, territories, classifications or qualifications in the patent literature, such as Inventor, Assignee , Agent (Agent), International Classification (IPC), US Classification (USC), European Classification (ELCA), Japanese Classification (FI/F-Term), International Industrial Design Classification (Locarno Classification), etc. Examiner, etc. The quantity is used as a basis for judging to set the parameters of the patent document. The set parameters are as follows: the starting point, the drawing range, and the relative displacement. Taking the owner as the basis for judgment, for example, if the owner is the ITRI, the relative displacement Δd set by the coordinate can be the coordinate axis origin + 20 mil, and the ΔX of the extraction range is, for example, +50 pixels, and Δ Y is for example +60 pixels. ΔX, ΔY can be positive or negative.

類似地,申請狀態選定模組430則可以根據專利文獻的申請狀態(例如公開、核准通知、分割案數、或延續案數等)或參考前案提呈(Information Disclosure Statement,IDS)或家族(Family)數目或摘要字數或申請國別作為判斷依據,來進行專利文獻之擷取參數的設定/調整。 Similarly, the application status selection module 430 may be based on the application status of the patent document (eg, disclosure, approval notice, number of divisions, or number of continuation cases, etc.) or reference to an Information Disclosure Statement (IDS) or family ( The number of the Family or the number of abstract words or the country of application is used as a basis for judging to set/adjust the parameters of the patent document.

時間選定模組410、資格選定模組420、申請狀態選定模組430,或是前述模組的其中任一種組合的擷取參數設定之後,也可以對每一選定模組的擷取參數運用不同的比例權重再調整或是搭配擷取區域之布林邏輯關係(例如交集(AND)、聯集(OR)等),來進行專利文獻之最後擷取參數的設定。以前述之申請日為2006年,所有權人為工研院為例,其擷取之相對位移△d的總和等於申請日是2006年時擷取之相對位移乘以申請日權重W1,與所有權人是工研院時擷取之相對位移乘以所有權人權重W2,此兩者的總和,其中W1+W2=100%。當W1=W2=50%時,則相對位移△d的總和等於15mil。 After the time selection module 410, the qualification selection module 420, the application status selection module 430, or the combination of any of the foregoing modules is set, the capture parameters of each selected module may be used differently. The proportional weights are adjusted or matched with the Boolean logical relationships of the captured regions (such as intersections, associations, etc.) to set the final parameters of the patent documents. Taking the aforementioned filing date as 2006, the owner is the Industrial Research Institute as an example. The sum of the relative displacements Δd taken is equal to the relative displacement of the application date in 2006 multiplied by the application date weight W1, and the owner is The relative displacement taken by the ITRI multiplied by the weight of ownership human rights W2, the sum of the two, where W1 + W2 = 100%. When W1 = W2 = 50%, the sum of the relative displacements Δd is equal to 15 mils.

同理,可得申請日為2006年之擷取範圍的權重W3與所有權人為工研院之擷取範圍的權重W4。當W3=40%、W4=60%時,其擷取範圍之△X的總和等於34,而△Y的總和等於28。因此,若2006年工研院申請之專利文獻,則擷取起始點為距離專利文獻之原始影像的兩邊界交點+15mil 處,且其擷取範圍為334×328像素之矩形。依此,可以得知專利文獻的擷取起始點與擷取範圍的大小。 For the same reason, the weight W3 of the scope of the application for the year 2006 and the weight W4 of the scope of the acquisition by the owner of the ITRI may be obtained. When W3 = 40%, W4 = 60%, the sum of ΔX of the extraction range is equal to 34, and the sum of ΔY is equal to 28. Therefore, if the patent documents applied for by the Industrial Technology Research Institute in 2006, the starting point is the intersection of the two borders of the original image of the patent document +15 mil. Where, and its extraction range is a rectangle of 334 × 328 pixels. Accordingly, the starting point of the patent document and the size of the extraction range can be known.

根據本發明,閱讀者閱讀圖形化的畫面或是透過如超連結模組350後,可再進行資料的分類、歸檔、刪除、匯出/匯入等處理。第五圖是技術文獻之閱讀系統的一個範例示意圖,與本發明所揭露的某些實施範例一致。 According to the present invention, after the reader reads the graphical picture or through the hyperlink module 350, the data can be classified, archived, deleted, exported/imported, and the like. The fifth figure is a schematic diagram of an example of a reading system of the technical literature, consistent with certain embodiments of the present invention.

參考第五圖的實施範例,此技術文獻之閱讀系統500可包含一個備有m×n個文件單位(Document Unit)之圖形化的畫面520,m與n為正整數。通常,m與n皆大於1。技術文獻之閱讀系統500也可再包括分類模組501、管理模組502、匯出/匯入模組503、變焦模組504,之前述模組之其中任一組合。圖形化的畫面520可透過擷取系統300擷取出來。透過圖形化的畫面520,分類模組501提供閱讀者,將一組圖形化的技術文獻資料作技術分類或是產品分類,並建置於一系統中。管理模組502提供閱讀者於完成技術資料的判斷後,依技術資料的屬性來歸檔於此系統中,或是從此系統中刪除所選定的技術資料。匯出/匯入模組503將所選定的資料匯出/匯入。變焦模組504將文件單位放大或縮小,如此,可以將文件單位之不同大小的影像顯示出來。此閱讀系統500中,實現各模組有多種方式。 Referring to the embodiment of the fifth figure, the reading system 500 of this technical document may include a graphical picture 520 prepared with m x n Document Units, m and n being positive integers. Usually, both m and n are greater than one. The reading system 500 of the technical literature may further include any combination of the classification module 501, the management module 502, the export/import module 503, and the zoom module 504. The graphical screen 520 can be retrieved through the capture system 300. Through the graphical screen 520, the classification module 501 provides a reader, and a set of graphical technical literature is classified into a technical classification or product classification, and is built into a system. The management module 502 provides the reader to archive the technical data according to the attributes of the technical data, or delete the selected technical data from the system. The export/import module 503 exports/imports the selected data. The zoom module 504 enlarges or reduces the file unit, so that images of different sizes in the file unit can be displayed. In this reading system 500, there are multiple ways to implement each module.

在閱讀系統500中,閱讀者可依技術資料的屬性歸類於細部的分類項次中,來完成技術資料的分類。第六圖說明透過分類模組501,將技術資料分類的一個範例示意圖,與本發明所揭露的某些實施範例一致。第六圖中的範例中,是將技術資料分為A、B、C三類,可視需要再進行更細部的分類,例如A類再細分為A-1、A-2、A-3三類,C類再細分為C-1、C-2 兩類。 In the reading system 500, the reader can classify the technical data according to the attributes of the technical data classified in the detailed classification items. The sixth figure illustrates an example schematic diagram for classifying technical data through the classification module 501, consistent with certain embodiments of the present invention. In the example in the sixth figure, the technical data is divided into three categories: A, B, and C. Further classification can be performed as needed. For example, class A is subdivided into A-1, A-2, and A-3. , Class C is subdivided into C-1, C-2 Two types.

閱讀者完成技術資料的判斷後,透過管理模組502,也可以依技術資料的特徵(Feature)將資料歸檔於系統中的子資料庫中,第七圖說明技術資料被歸檔於系統中的兩個子資料庫G與H,與本發明所揭露的某些實施範例一致。 After the reader completes the judgment of the technical data, the management module 502 can also archive the data in the sub-database in the system according to the feature of the technical data. The seventh figure shows that the technical data is archived in the system. The sub-databases G and H are consistent with certain embodiments of the present invention.

根據本發明,閱讀者可以記載技術文獻閱讀後的心得評論,或是顯示不同閱讀者的評論。依此,本揭露之另一實施範例提供一種技術文獻之評論系統。此評論系統是一種閱讀報告之資訊分享平台。可以提供閱讀者記載技術文獻閱讀後的心得評論,也可以顯示不同閱讀者的評論,還可以附加或產生在分析閱讀過程中所收集或產生的其他資料,例如專利文獻的審批歷史(File Wrapper)、新聞事件、交易資訊、訴訟(Lawsuit)資訊等。 According to the present invention, the reader can record the comments of the technical literature after reading, or display the comments of different readers. Accordingly, another embodiment of the present disclosure provides a review system of the technical literature. This comment system is a news sharing platform for reading reports. The reader can record the comments of the technical literature after reading, or display the comments of different readers, and can add or generate other materials collected or generated during the analysis of the reading process, such as the file wrapper (File Wrapper) , news events, trading information, lawsuits (Lawsuit) information.

依此,如第八圖所示,本揭露的實施範例中,此評論系統800至少包含一評論主題單元801、一閱讀評論單元802、一附加單元803,且/或一律師稽查單元(Attorney Audit Unit)804。透過評論主題單元801,可讓閱讀者輸入評論主題,並且也可顯示不同的評論主題於評論主題單元801上。透過閱讀評論單元802,閱讀者可記載技術文獻閱讀後的評論,並且也可以顯示不同閱讀者的評論於閱讀評論單元802上。透過附加單元803,可附加閱讀者在分析閱讀過程中所收集或產生的其他資料。律師稽查單元804提供給律師一個界面來執行評論工作,例如在評論主題單元801或閱讀評論單元802上評論,如此,可將分析表或資料用於尋求法律意見或採取法律行動時。此界面可設計成被授權者或律師以特定帳號進入此界面來檢查資料或評論,以及執行確認、溝通或是互動,特別在訴訟時,可在律師-當事人特 權(Attorney-Client Privilege)的保護之下,保護相關的內容,使當事人的法律權益獲得完整的保障。 Accordingly, as shown in the eighth embodiment, in the embodiment of the disclosure, the comment system 800 includes at least a comment subject unit 801, a read comment unit 802, an additional unit 803, and/or a lawyer audit unit (Attorney Audit). Unit) 804. Through the comment topic unit 801, the reader can be input to the comment topic, and different comment topics can also be displayed on the comment topic unit 801. Through the reading commenting unit 802, the reader can record the comments after reading the technical documents, and can also display the comments of different readers on the reading commenting unit 802. Through the additional unit 803, other materials collected or generated by the reader during the analysis of the reading process can be attached. The attorney auditing unit 804 provides the attorney with an interface to perform the review work, such as commenting on the review subject unit 801 or the reading review unit 802, such that the analysis form or material can be used to seek legal advice or take legal action. This interface can be designed to allow an authorized person or lawyer to access this interface with a specific account to check data or comments, and to perform confirmation, communication or interaction, especially in litigation, at the lawyer-party Under the protection of Attorney-Client Privilege, the relevant content is protected and the legal rights of the parties are fully protected.

此評論系統中,實現各單元有多種方式。例如,第九圖的範例,於閱讀評論單元802上,評論系統800可對一主題A自動產生日期、閱讀者的姓名或代號,於閱讀評論單元802上。閱讀者可在一評論編輯區902a記載技術文獻閱讀後的評論心得,例如閱讀文獻的摘要、閱讀文獻的評論內容等。不同閱讀者的評論也可以顯示在閱讀評論單元802上,例如主題A列表中,有三份閱讀者的評論,每一份閱讀者的評論可包括如評論的基本資訊與評論內容,可採用如部落格的型態來顯示。於評論主題單元801上,閱讀者可輸入主題,並且列表不同的輸入主題A、B、C等。在分析閱讀過程中,閱讀者收集或產生的其他資料也可以採用附加檔案或超連結的方式,當成附件產生於附加單元803上,附件列表中,附件1例如是專利文獻的審批歷史、附件2例如是新聞事件、或交易資訊等。 There are several ways to implement each unit in this review system. For example, in the example of the ninth figure, on the reading commenting unit 802, the commenting system 800 can automatically generate a date, a reader's name, or a code number for a topic A on the reading commenting unit 802. The reader can record the comments after reading the technical documents in a comment editing area 902a, such as reading abstracts of the documents, reviewing the contents of the documents, and the like. Comments from different readers may also be displayed on the reading comment unit 802. For example, in the list of topics A, there are three readers' comments, and each reader's comments may include basic information such as comments and comment content, such as tribes. The type of the grid is displayed. On the comment subject unit 801, the reader can enter a topic and list different input topics A, B, C, and the like. During the analysis and reading process, other materials collected or generated by the reader may also be attached to the additional unit 803 as an attachment, and the attachment list is, for example, the approval history of the patent document, and the attachment 2, as an attachment. For example, news events, or transaction information.

根據本發明,上述之技術文獻擷取系統300與閱讀系統500也可整合成一種技術文獻之擷取與閱讀系統1000,如第十圖所示。類似地,上述之技術文獻之閱讀系統500與評論系統800也可整合成一種技術文獻之閱讀暨評論系統1100,如第十一圖所示。在某一特殊實施範例中,閱讀暨評論系統1100可包括兩個評論系統800,其中一個評論系統800是提供給所有的系統使用者來分享所有的資訊,而另一個評論系統800是有授權控制的,並且僅提供給特定的系統使用者來分享資訊。擷取系統300、閱讀系統500以及評論系統800也可以整合成一種技術文獻之擷取與閱讀暨評論的系統1200,如第十二圖所示。各系統可依實際需求與應用來整合。 According to the present invention, the above-described technical document capture system 300 and reading system 500 can also be integrated into a capture and reading system 1000 of a technical document, as shown in the tenth figure. Similarly, the reading system 500 and the commenting system 800 of the above-described technical documents can also be integrated into a reading and commenting system 1100 of a technical document, as shown in FIG. In a particular embodiment, the reading and commenting system 1100 can include two review systems 800, one of which is provided to all system users to share all of the information, while the other review system 800 is authorized to control. And only for specific system users to share information. The retrieval system 300, the reading system 500, and the review system 800 can also be integrated into a system 1200 for reading and reading and reviewing technical documents, as shown in FIG. Each system can be integrated with the application according to actual needs.

承第三A圖及上述之描述,本發明也揭露了一種技術文獻之擷取方法,如第十三圖之範例流程。參考第十三圖,首先,將一技術文獻資料的關聯性作為判斷依據,來設定此技術文獻資料之擷取參數,如步驟1310所示。之後,透過設定的擷取參數,選取此技術文獻資料中的關聯性資訊,如步驟1320所示。再擷取出選定的重要資訊,並與所選取的關聯性資訊一同提供在一畫面上,如步驟1330所示。 Based on the third A diagram and the above description, the present invention also discloses a method for capturing the technical literature, such as the example flow of the thirteenth diagram. Referring to the thirteenth figure, first, the relevance of a technical literature is used as a basis for judging, and the parameters of the technical literature are set, as shown in step 1310. Then, through the set parameters, the association information in the technical literature is selected, as shown in step 1320. The selected important information is retrieved and provided along with the selected association information on a screen, as shown in step 1330.

如前述所提及,設定出的擷取參數至少包括此組技術文獻資料擷取起始點、擷取範圍、以及相對位移。此畫面也可提供超連結功能,來連結系統資料庫以取得進階的相關資訊,或是此技術文獻(或專利)的全部文件或全文字資料。 As mentioned above, the set extraction parameters include at least the starting point, the extraction range, and the relative displacement of the set of technical literature. This screen can also provide hyperlinks to link to the system database for advanced information, or all documents or full text of this technical literature (or patent).

承第五A圖及上述之描述,本發明也揭露了一種技術文獻之閱讀方法,如第十四圖之範例流程。參考第十四圖,首先,擷取一組技術文獻資料的重要資訊與關聯性資訊,並提供在一畫面上,來提供給閱讀者閱讀,如步驟1410所示。然後,透過此畫面,可進行該組技術文獻資料的分類、歸檔、刪除、匯出/匯入,或前述功能的其中任一組合,如步驟1420所示。 In accordance with Figure 5A and the above description, the present invention also discloses a method of reading a technical document, such as the example flow of Figure 14. Referring to Figure 14, first, an important set of information and related information of a technical literature is extracted and provided on a screen for reading by the reader, as shown in step 1410. Then, through this screen, the group of technical documents can be classified, archived, deleted, exported/imported, or any combination of the foregoing functions, as shown in step 1420.

繼第十四圖之步驟1420之後,本發明可再包括提供一資訊分享平台的步驟,此資訊分享平台可讓閱讀者記載技術文獻閱讀後的評論以及附加其在分析閱讀過程中所收集或產生的其他資料,或是顯示不同閱讀者的評論,如第十五圖之範例流程中步驟1510,依此,本發明也揭露了一種技術文獻之閱讀暨評論方法。 Following step 1420 of FIG. 14, the present invention may further comprise the step of providing an information sharing platform that allows the reader to record the review of the technical literature and to add or generate it during the analytical reading process. The other information, or the comments of different readers, such as step 1510 in the example flow of the fifteenth figure, accordingly, the present invention also discloses a method of reading and commenting on the technical literature.

所以,本發明提供了一個技術文獻之系統平台,組成技術資 料文件網,有效累積與分享團隊知識能量,快速完成技術文獻資料的檢索與分析。並且,由於本發明將專利文件轉譯成不同的狀態,因此本發明可大幅降低因重復或多個使用者同時企圖打開圖形化格式檔所引起之系統的負荷。 Therefore, the present invention provides a system platform of technical literature, which constitutes a technical resource. The material file network effectively accumulates and shares the team's knowledge and energy, and quickly completes the retrieval and analysis of technical literature. Moreover, since the present invention translates the patent documents into different states, the present invention can greatly reduce the load on the system caused by repeated or multiple users attempting to open the graphical format file at the same time.

請參閱第十六圖,第十六圖是本發明另一實施例的示意圖。在第十六圖中揭露了用於管理專利文獻的一系統。該系統可與儲存於非暫存性之電腦可讀媒介的電腦程式碼,以及可由一處理器或任何電子裝置執行的電腦程式碼。該系統有一顯示模組164、一擷取模組165、一追蹤模組166、一任務指派模組167、一變焦模組1695、一指示器模組1694、一光學字元辨識(OCR)模組1693、一方案選單1692、一搜尋規則模擬器1691、一記錄模組169以及一報告產生模組168。需留意的是,雖然在第十六圖中展示了許多模組,但一至多個模組係可根據設計需要而被移除的。同時,其他未被繪示於圖式中的模組亦可被加入該系統。以上模組可被軟體程式碼編碼並在一機器上執行,或是被軟體程式碼編碼並在多個機器上執行以提供以下所述的功能。該些模組經由介面1696相互通聯,該介面可為但不限於一機構、一虛擬機器或一操作系統等。以下將一一分別說明該些模組。 Please refer to a sixteenth diagram, which is a schematic view of another embodiment of the present invention. A system for managing patent documents is disclosed in the sixteenth figure. The system can be computer code stored on a non-transitory computer readable medium, and computer code executable by a processor or any electronic device. The system has a display module 164, a capture module 165, a tracking module 166, a task assignment module 167, a zoom module 1695, an indicator module 1694, and an optical character recognition (OCR) module. A group 1693, a program menu 1692, a search rule simulator 1691, a record module 169, and a report generation module 168. It should be noted that although many modules are shown in Figure 16, one or more modules can be removed according to design needs. At the same time, other modules not shown in the drawings can also be added to the system. The above modules can be encoded by software code and executed on a machine, or encoded by software code and executed on multiple machines to provide the functions described below. The modules are interconnected via an interface 1696, which may be, but is not limited to, a mechanism, a virtual machine, or an operating system. The modules will be described separately below.

請參閱第十七A至第十七D圖,第十七A圖至第十七D圖繪示了多個專利文獻展示於文件單元172並以不同顯示比率顯示於一畫面區域171。一變焦模組(未繪示)與一顯示模組合作用以展示第十七A圖至第十七D圖所繪示之內容。該變焦模組根據一使用者的指示放大或縮小文件單元中以不同顯示比率顯示的多個專利文獻。此外,在不同顯示比率下,圖式或字型可以不同大小來顯示,藉由使用者的指示,不同資訊量的資訊可以不 同顯示比率來顯示。 Referring to FIGS. 17A through 17D, FIGS. 17A through 17D illustrate a plurality of patent documents displayed on file unit 172 and displayed in a picture area 171 at different display ratios. A zoom module (not shown) cooperates with a display module to display the contents of the seventeenth through seventeenth Dth. The zoom module enlarges or reduces a plurality of patent documents displayed in different file units in different display ratios according to a user's instruction. In addition, at different display ratios, the schema or font can be displayed in different sizes. With the user's instruction, the information of different information amounts may not be displayed. Displayed with the display ratio.

例如在第十七A圖中展示了一個文件單元內的兩個資訊區塊1721、1722所顯示的資訊。該區塊1721所顯示的資訊可為一圖式或一部分之圖式,例如先前提及的該相關資訊。該區塊1722所顯示的資訊可為一重要資訊,例如申請日、申請號、所有權人、發明人等文字的組合。第十七B圖展示了一較大的顯示比率,例如每一文件單元在畫面區域171保持原顯示比率下,仍佔據了一較大區域。此外在第十七B圖,除了區塊1721與1722之外,另一資訊區塊1723亦由該文件單元172所顯示。例如一專利文獻的一部分或是所有的摘要文字係由區塊1723所顯示。要注意的是,區塊1721、1722以及1723在不同設計下可包含不同資訊,不限於此處的範例說明。 For example, in Figure 17A, the information displayed by the two information blocks 1721, 1722 in one file unit is shown. The information displayed by the block 1721 can be a schema or a portion of the schema, such as the related information previously mentioned. The information displayed by the block 1722 can be an important piece of information, such as a combination of the application date, the application number, the owner, the inventor, and the like. The seventeenth B-picture shows a large display ratio, for example, each file unit still occupies a large area while maintaining the original display ratio in the picture area 171. Further, in the seventeenth Bth diagram, in addition to the blocks 1721 and 1722, another information block 1723 is also displayed by the file unit 172. For example, part or all of the abstract text of a patent document is shown by block 1723. It should be noted that blocks 1721, 1722, and 1723 may contain different information under different designs and are not limited to the example descriptions herein.

第十七C圖與第十七D圖分別展示了以一較大顯示比率,以及以一較小顯示比率來顯示專利文獻的示意圖。除了其他四個區塊1721、1722、1723以及1724之外,第十七C圖中的額外資訊區塊1725,例如其他使用者的評論,可由文件單元172顯示。相反的,第十七D圖中,每一文件單元172僅顯示一個資訊區塊1721。要注意的是,即使一個文件單元只顯示一個資訊區塊,可能有多個資訊結合並出現在該資訊區塊中。舉例來說,一專利名稱、一部分的摘要或申請專利範圍。另一方面,一最小化的圖式或一部分的圖式可能為一文件單元的一資訊區塊所顯示。 The seventeenth Cth and seventeenth Dth views respectively show a schematic view of the patent document displayed at a larger display ratio and at a smaller display ratio. In addition to the other four blocks 1721, 1722, 1723, and 1724, the additional information block 1725 in FIG. 17C, such as comments from other users, may be displayed by file unit 172. In contrast, in the seventeenth Dth diagram, each file unit 172 displays only one information block 1721. It should be noted that even if only one information block is displayed in one file unit, multiple pieces of information may be combined and appear in the information block. For example, a patent name, a partial abstract, or a patent application. Alternatively, a minimized schema or a portion of the schema may be displayed as an information block of a file unit.

指導放大或縮小畫面的指示可被建置於一手勢模組裡。該手勢模組將觸控面板上一或一個以上的觸控手勢轉換成一項對應的變焦指示。舉例來說,當利用兩根手指在觸控面板上同時按壓並且相互滑開移動時,該手勢模組編譯該手勢動作為放大,並以一較大的顯示比率顯示該文 件。現在的iPad與安裝了Windows 8的筆記型電腦皆有使用觸控面板。為了簡化說明書內容,於此不細部說明本案如何建置手勢模組。 Instructions to zoom in or out of the picture can be built into a gesture module. The gesture module converts one or more touch gestures on the touch panel into a corresponding zoom indicator. For example, when two fingers are simultaneously pressed on the touch panel and slided to each other, the gesture module compiles the gesture to zoom in, and displays the text at a larger display ratio. Pieces. Today's iPads and touchpads with Windows 8 have touch panels. In order to simplify the content of the manual, the details of how to construct the gesture module are described herein.

另一種變焦指示可經由一指示器模組來達成,該指示器模組將一指示器裝置的操作編譯成一變焦指示。例如,該指示器裝置可為一滑鼠裝置。當一使用者在一顯示資訊上按壓滑鼠的一滾輪或拖動一變焦控制功能,該些操作將被轉譯成一相對應的變焦指示,以指示顯示模組如何顯示多個專利文獻。 Another type of zoom indication can be achieved via an indicator module that compiles the operation of an indicator device into a zoom indicator. For example, the indicator device can be a mouse device. When a user presses a scroll wheel of a mouse on a display message or drags a zoom control function, the operations are translated into a corresponding zoom indication to indicate how the display module displays a plurality of patent documents.

除了實行放大與縮小操作外,使用者亦可使用該手勢模組及/或該指示器模組來進行拖動一或一個以上的專利文獻,以進行分類。第十八圖展示了分類操作。兩個對應於兩份專利文獻之被選擇的文件單元被拖放到一類別183,該類別183指的是一分類法則下的一類別或一子類別,如同類別184與185。該些文件單元被拖放進入一類別之後可能被其他文件單元取代,或從畫面區域中被移除。換句話說,在分類模式下,一使用者可輕易的將所有專利文獻以其相關類別進行分類。另外在另一種模式下,一專利文獻可歸屬於多個類別中。舉例來說,一專利文獻可被分類到「有用的」、「需要細讀」以及「LED模組」等類別。第一類別「有用的」係指出專利文獻在特定分析中的一根據或一數值,例如去搜尋使一公開的專利無效的引證前案。其他平行的選項例如「無關的」可以一圖項標示被表列於一類別之旁側。第二類別「需要細讀」可與其他專利文獻一起被歸類到稍後再閱讀的類別中。第三類別「LED模組」係指專利文獻被分類到與LED模組技術有關的類別。要注意的是,如何定義與使用類別的方式不限於此處所說明者,其他在定義與使用類別上的變化可依照不同需求而被推道出 來。 In addition to performing the zooming and zooming operations, the user can also use the gesture module and/or the indicator module to drag one or more patent documents for classification. Figure 18 shows the classification operation. Two selected file units corresponding to the two patent documents are dragged and dropped to a category 183, which refers to a category or a subcategory under a taxonomy, like categories 184 and 185. These file units may be replaced by other file units after being dragged and dropped into a category, or removed from the screen area. In other words, in the classification mode, a user can easily classify all patent documents by their relevant categories. In addition, in another mode, a patent document can be attributed to multiple categories. For example, a patent document can be classified into categories such as "useful", "requires perusal", and "LED module". The first category "useful" refers to a basis or a value of a patent document in a particular analysis, such as a search for a pre-citation that invalidates a public patent. Other parallel options such as "unrelated" can be listed on the side of a category. The second category, "need to be read", can be categorized with other patent documents into categories that will be read later. The third category of "LED modules" refers to the classification of patent documents into categories related to LED module technology. It should be noted that how to define and use categories is not limited to those described here. Other changes in definition and usage categories can be derived according to different needs. Come.

第十九圖中,一或多個使用者可利用一評論模組在他們正在閱讀的專利文獻上作出評論。舉例來說,使用者選擇一文件單元192,並顯示一跳出視窗191,以供使用者輸入與文件單元192有關的專利文獻相關的評論。該評論介面可具有許多不同的設計方式。例如自多個選項1911中的一選項所提供的及/或一文字區域可使一使用者輸入一名稱,一或一個以上的純文字格式的文字段落或一富含圖像與附加檔案的文字段落,例如一網頁的URL,一PDF文件或一圖像檔案等。 In the nineteenth figure, one or more users can use a comment module to comment on the patent documents they are reading. For example, the user selects a file unit 192 and displays a pop-up window 191 for the user to enter comments related to the patent document associated with file unit 192. The review interface can have many different design approaches. For example, a text field provided by an option in the plurality of options 1911 may cause a user to enter a name, one or more text paragraphs in plain text format, or a text paragraph rich in images and additional files. For example, a web page URL, a PDF file or an image file.

當有一或一個以上的使用者透過該系統處理許多專利文獻時,一追蹤模組被提供,用以追蹤使用者閱覽、分析及/或評論專利文獻的閱覽進程。通常在使用者閱讀一專利文獻後,使用者可能受到專利文獻的線索啟發,而蒐尋更多的專利文獻。因此,提供一擷取模組來取得更多專利文獻來閱覽、分析與評論。該追蹤模組可被設計來不只是記錄使用者閱覽專利文獻的進程,例如依序閱覽專利文獻P1,接著專利文獻P2,接著專利文獻P3等,而且也記錄特定專利文獻係如何在分析過程中出現。例如當閱讀特定專利文獻時,搜尋並擷取新專利文獻。追蹤軌跡可被反映成一樹狀圖、一列表或一具有該些專利文獻連結的網路。而且追蹤軌跡有助於使用者在研讀巨量的專利文獻時不會遺漏重要文獻。例如,第二十圖繪示了一使用者如何分析處理多個專利文獻的一軌跡範例。左列的專利文獻2001是在第一搜尋規則下尋獲的文獻。專利文獻2002是使用者閱讀第二專利文獻後尋獲的文獻。搜尋規則可被記錄,以及階層組織展示了在一邏輯述架構下如何進行搜尋。 When one or more users process a number of patent documents through the system, a tracking module is provided to track the user's viewing progress of viewing, analyzing, and/or reviewing the patent documents. Usually after the user reads a patent document, the user may be inspired by the clues of the patent document to search for more patent documents. Therefore, a capture module is provided to obtain more patent documents for viewing, analysis and commentary. The tracking module can be designed not only to record the progress of the user reading the patent document, such as sequentially reading the patent document P1, then the patent document P2, then the patent document P3, etc., and also recording how the specific patent document is in the analysis process. appear. For example, when reading a particular patent document, search for and capture new patent documents. The tracking trajectory can be reflected as a tree diagram, a list, or a network with links to the patent documents. Moreover, the tracking trajectory helps users not miss important documents when studying a large amount of patent documents. For example, the twentieth diagram illustrates an example of how a user analyzes a trajectory that processes a plurality of patent documents. Patent Document 2001 in the left column is a document found under the first search rule. Patent Document 2002 is a document that is found after a user reads a second patent document. Search rules can be recorded, and hierarchical organizations show how to search under a logical structure.

當多個使用者在共同進行特定專利文獻分析的專案計畫時,該系統設計有一任務指派模組用以指派任務。例如一擷取模組或不同的搜尋規則擷取多組不同的專利文獻至多個使用者。該任務指派模組可被設計成動態性的收集執行資訊,並將此資訊展示給專案管理者以控制一分析任務的過程。一授權模組可被設計成控制一群使用者的連接權限。如之前所述,一評論模組可提供一團隊內的使用者分享相對應於不同專利文獻的相關評論。 When multiple users work together on a project plan for specific patent document analysis, the system is designed with a task assignment module for assigning tasks. For example, a plurality of sets of different patent documents can be retrieved from multiple modules or different search rules to multiple users. The task assignment module can be designed to dynamically collect execution information and present this information to the project manager to control the process of an analysis task. An authorization module can be designed to control the connection rights of a group of users. As mentioned previously, a comment module can provide a user within a team to share relevant comments corresponding to different patent documents.

通常一分析任務結束之後需要整理報告。該系統亦可包含一報告產生模組以產生報告。例如將分析處理時間與處理人合併、在何種搜尋條件下擷取與閱覽專利文獻以及從評論模組搜集有用的資訊等。藉由此功能,能夠減少工作量並提升工作效能。 Usually a report needs to be organized after the end of an analysis task. The system can also include a report generation module to generate a report. For example, the analysis processing time is combined with the processing person, under which search conditions are captured and viewed, and useful information is collected from the comment module. With this function, you can reduce the workload and improve your work efficiency.

此外,現有的專利資料庫通常提供圖像格式的圖式,另外還有文字格式的文字說明。第二十一圖展示了如何利用一光學字元辨識(OCR)模組將專利資料庫內的一專利說明書的文字內容與一圖式產生關聯。在畫面區域211中,以基本單元X1、X2、X3以及X4構成一圖像。在光學字元辨識(OCR)的處理下,「511」、「40」以及「Fig.2」等數值形態的文字從圖式的圖像檔案中被抽取出來。這些數值形態的文字係用來尋找專利說明書中相關的文字內容。在圖式中數值文字的旁邊可直接附加與數值文字相關的文字內容2111。或者是利用一跳出視窗來顯示相關的文字內容2110。以上這些文字內容2110、2111與其原出處,例如第七欄第14-15行,可被複製並貼上至一報告,而該報告是由一評論模組與先前提及的一報告產生模組所維持,並結合以上資訊以使報告更方便閱讀。 In addition, the existing patent database usually provides a graphic representation of the image format, as well as a textual description of the text format. The twenty-first figure shows how an optical character recognition (OCR) module can be used to correlate the textual content of a patent specification in a patent database with a schema. In the screen area 211, an image is formed by the basic units X1, X2, X3, and X4. Under the processing of optical character recognition (OCR), numerical forms such as "511", "40", and "Fig.2" are extracted from the image file of the drawing. These numerical forms of text are used to find relevant textual content in the patent specification. The text content 2111 related to the numerical character can be directly attached to the numerical character in the drawing. Or use a pop-up window to display related text content 2110. The above texts 2110, 2111 and their original sources, such as the seventh column, lines 14-15, can be copied and pasted into a report, which is composed of a comment module and a previously mentioned report generation module. Maintained and combined with the above information to make the report easier to read.

因為不同的分析工作有不同的需求,以及不同的任務類型。該系統可提供一方案選單以使使用者或一團隊在執行以下分析前選擇方案。舉例來說,使一公開專利無效化的前案檢索具有與產生專利地圖相當不同的特色。不同方案被決定時伴隨著不同的呈現方式,例如報告如何產生?如何展現一文件單元?而且文件單元內的區塊要展示什麼類型的資訊?以及要包含什麼樣的評論區域?等等在包含前述各模組的組合下,提供使用者不同的介面。 Because different analytical work has different needs, as well as different task types. The system can provide a program menu to allow the user or a team to select a solution before performing the following analysis. For example, a pre-case search that invalidates a public patent has a rather distinctive feature than a patent map. Different scenarios are decided with different presentations, such as how are reports generated? How to display a file unit? And what type of information does the block in the file unit display? And what kind of comment area to include? And so on, in combination with the various modules described above, provide different interfaces for the user.

另外,一搜尋模擬器可提供在不同搜尋法則下,專利文獻的資訊預覽。藉由前述設計,使用者可檢視何種搜尋法則較適當,例如在專利文獻都確實擷取並呈現之前,避免帶來太多的專利文獻,以提升分析速度並增加工作效率。 In addition, a search simulator provides a preview of the information in the patent literature under different search rules. With the aforementioned design, the user can view which search rule is more appropriate, for example, before the patent documents are actually captured and presented, avoiding bringing too many patent documents to improve the analysis speed and increase work efficiency.

儘管在以上範例中,專利文獻係用作實施例。但請注意的是,在相同或相似邏輯下的其他類型文獻,例如技術文獻、商業廣告文獻或其他類型的文獻,應也被視為歸屬在本發明的範圍之內。 Although in the above examples, the patent literature is used as an embodiment. It should be noted, however, that other types of documents, such as technical documents, commercial literature, or other types of documents, under the same or similar logic, are also considered to be within the scope of the present invention.

另外,專利文獻係作為解釋本發明的實施例運作,但其他類型的對等文件型態亦可用於本系統。例如本系統可用於擷取商標資料庫的商標。有時商標文件包含圖式,例如標誌或識別符用以區分產品。這些圖式可被視為先前提及的相關資訊。另一範例係為藥品資料庫,該資料庫包含了許多藥品產品,並具備圖式以及相關文字描述,例如藥品名稱、副作用等。上述文件且不限於此處所描述的文件,可用於本系統以增加有益的效果。如前述文字內容與圖式所描述之內容,除了在專利文獻、商標、商業外觀文件或其他智慧財產相關的文獻外,亦可被視為落入同等範圍極利 用相同或相似方法與功能已達到相同或相似效果的解釋下。 In addition, the patent literature operates as an embodiment to explain the invention, but other types of peer-to-peer file types may also be used in the system. For example, the system can be used to retrieve trademarks of trademark databases. Sometimes a trademark file contains a graphic, such as a logo or identifier to distinguish the product. These schemas can be considered as relevant information previously mentioned. Another example is a drug database that contains a number of pharmaceutical products with graphical and related textual descriptions such as drug names, side effects, and the like. The above documents are not limited to the documents described herein and can be used in the present system to add beneficial effects. As described in the foregoing text and drawings, in addition to patent documents, trademarks, trade dress documents or other intellectual property related documents, it can be considered to fall into the same range. Explain the same or similar effects with the same or similar methods.

以上範例揭露了本發明的實施例可被建置於成電腦可讀的指示。而此些電腦可讀指示可被儲存於一非暫存性之電腦可讀媒介,並被一至多個處理器執行。然而,需注意的是,隨著技術進展,此些電腦可讀指示可被寫入於多種電子設備,並可部分地體現成硬體,而其餘部分則建置成軟體或韌體。其餘實施方式的變化應亦被視作落入本發明所保護之範圍內。 The above examples disclose that embodiments of the invention can be constructed to be computer readable instructions. The computer readable instructions can be stored on a non-transitory computer readable medium and executed by one or more processors. However, it should be noted that as the technology advances, such computer readable instructions can be written to a variety of electronic devices and can be partially embodied as hardware, while the remainder is built into software or firmware. Variations of the remaining embodiments are also considered to fall within the scope of the present invention.

藉由以上較佳具體實施例之詳述,係希望能更加清楚描述本發明之特徵與精神,而並非以上述所揭露的較佳具體實施例來對本發明之範疇加以限制。相反地,其目的是希望能涵蓋各種改變,及具相等性的安排於本發明所欲申請之專利範圍的範疇內。因此,本發明所申請之專利範圍的範疇應根據上述的說明作最寬廣的解釋,以致使其涵蓋所有可能的改變以及具相等性的安排。 The features and spirit of the present invention will be more apparent from the detailed description of the preferred embodiments. On the contrary, the intention is to cover various modifications, and the equivalents are within the scope of the scope of the invention as claimed. Therefore, the scope of the patented scope of the invention should be construed in the broadest

300‧‧‧擷取系統 300‧‧‧ capture system

301‧‧‧驗證篩選單元 301‧‧‧Verification screening unit

301a‧‧‧設定的擷取參數 301a‧‧‧Setting parameters

302‧‧‧擷取選定模組 302‧‧‧Select selected modules

303‧‧‧資料擷取模組 303‧‧‧ Data Capture Module

310‧‧‧技術文獻資料 310‧‧‧Technical literature

Claims (10)

一種技術文獻之閱讀系統,該閱讀系統至少包含:一圖形化介面,該圖形化介面被分成多個文件單元,該多個文件單元的每一文件單元是以一畫面區域顯示一技術文獻的一文件資訊,並且該文件資訊包括該技術文獻的一重要資訊、或至少一相關圖式、或該重要資訊及該至少一相關圖式皆有;一顯示模組,用以顯示該技術文獻的該重要資訊、或該至少一相關圖式、或該重要資訊及該至少一相關圖式皆有;該技術文獻的該重要資訊、或該至少一相關圖式、或該重要資訊及該至少一相關圖式被顯示於該畫面區域內的該文件單元;以及一變焦模組,用以在不同顯示比率下放大或縮小該文件單元中該技術文獻的顯示方式;其中當該文件單元以一第一顯示比率顯示的一技術文獻資訊較以一第二顯示比率顯示的該技術文獻資訊為多時,該第一顯示比率係大於該第二顯示比率。 A reading system of a technical document, the reading system comprising at least: a graphical interface, the graphical interface is divided into a plurality of file units, each file unit of the plurality of file units displaying a technical document in a picture area File information, and the file information includes an important information of the technical literature, or at least one related schema, or the important information and the at least one related schema; a display module for displaying the technical document Important information, or the at least one related schema, or the important information and the at least one related schema; the important information of the technical document, or the at least one related schema, or the important information and the at least one correlation The file is displayed in the file unit in the screen area; and a zoom module is configured to enlarge or reduce the display manner of the technical document in the file unit at different display ratios; wherein when the file unit is first When the technical literature information displayed by the display ratio is greater than the technical literature information displayed by the second display ratio, the first display ratio is greater than the first Display ratio. 如申請專利範圍第1項所述的技術文獻之閱讀系統,該系統進一步包含一手勢模組用以將一第一觸控手勢轉譯成一變焦指示,以指示該變焦模組放大或縮小。 The reading system of the technical document described in claim 1, the system further comprising a gesture module for translating a first touch gesture into a zoom indicator to indicate that the zoom module is zoomed in or out. 如申請專利範圍第2項所述的技術文獻之閱讀系統,該系統進一步包含一分類模組;其中該手勢模組將一第二觸控手勢轉譯成一拖放指示,以指示顯示於該文件單元內的該技術文獻被拖至一類別,並實行該多個技術文獻的分類。 The reading system of the technical document described in claim 2, the system further comprising a sorting module; wherein the gesture module translates a second touch gesture into a drag and drop indication to indicate display on the file unit This technical document within is dragged to a category and the classification of the plurality of technical documents is carried out. 如申請專利範圍第1項所述的技術文獻之閱讀系統,該系統進一步包含 一指示器模組,用以將一指示器裝置操作轉譯成一變焦指示,以指示該變焦模組放大或縮小。 The reading system of the technical document described in claim 1 of the patent scope further includes An indicator module for translating an indicator device operation into a zoom indicator to indicate that the zoom module is zoomed in or out. 如申請專利範圍第4項所述的技術文獻之閱讀系統,該系統進一步包含一分類模組;其中該指示器模組將一第二指示器操作轉譯成一拖放指示,以指示顯示於該文件單元內的該技術文獻被拖至一類別,並實行該技術文獻的分類。 The reading system of the technical document of claim 4, further comprising a sorting module; wherein the indicator module translates a second indicator operation into a drag and drop indication to indicate display on the file This technical document within the unit is towed to a category and the classification of the technical literature is carried out. 如申請專利範圍第1至5項中任一項所述的技術文獻之閱讀系統,該系統進一步包含一光學字元辨識(Optical Character Recognition,OCR)處理模組,用以對該技術文獻中的該至少一相關圖式施以光學字元辨識,並將該至少一相關圖式中的數值文字映射至該技術文獻之說明書中的一文字內容。 The reading system of the technical document of any one of claims 1 to 5, further comprising an optical character recognition (OCR) processing module for use in the technical literature The at least one related pattern applies optical character recognition, and maps the numerical characters in the at least one related pattern to a text content in the specification of the technical document. 如申請專利範圍第6項所述的技術文獻之閱讀系統,其中當該數值文字被選擇時,該文字內容係由一跳出視窗顯示。 The reading system of the technical document of claim 6, wherein when the numerical character is selected, the text content is displayed by a pop-up window. 如申請專利範圍第1項所述的技術文獻之閱讀系統,該系統進一步包含一方案選單,用以使一使用者選擇用來分析該技術文獻與提供執行不同分析的一方案,該方案選單可選擇性包含可專利性的前案檢索與專利地圖生成。 The reading system of the technical document described in claim 1, the system further comprising a program menu for enabling a user to select a solution for analyzing the technical document and providing different analysis, the program menu may be Optional includes patentable pre-case retrieval and patent map generation. 如申請專利範圍第1項所述的技術文獻之閱讀系統,該系統進一步包含一搜尋法則模擬器,用以在不同搜尋法則下預覽該技術文獻資訊。 The reading system of the technical document described in claim 1, the system further comprising a search rule simulator for previewing the technical document information under different search rules. 如申請專利範圍第1項所述的技術文獻之閱讀系統,該系統進一步包含一追蹤模組,用以追蹤該使用者依序閱覽以及擷取該技術文獻的閱覽進程。 The reading system of the technical document described in claim 1, the system further comprising a tracking module for tracking the user to sequentially view and retrieve the browsing process of the technical document.
TW103136595A 2013-10-23 2014-10-23 Technical documents capturing and patents analysis system and method TW201519071A (en)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
US14/060,617 US20140195904A1 (en) 2013-01-06 2013-10-23 Technical documents capturing and patents analysis system and method

Publications (1)

Publication Number Publication Date
TW201519071A true TW201519071A (en) 2015-05-16

Family

ID=51061979

Family Applications (1)

Application Number Title Priority Date Filing Date
TW103136595A TW201519071A (en) 2013-10-23 2014-10-23 Technical documents capturing and patents analysis system and method

Country Status (2)

Country Link
US (1) US20140195904A1 (en)
TW (1) TW201519071A (en)

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9223769B2 (en) 2011-09-21 2015-12-29 Roman Tsibulevskiy Data processing systems, devices, and methods for content analysis
CN104699752A (en) * 2014-12-29 2015-06-10 芜湖乐锐思信息咨询有限公司 Intellectual property inquiry system based on cloud database
CN104699753A (en) * 2014-12-29 2015-06-10 芜湖乐锐思信息咨询有限公司 Intellectual property inquiry system based on cloud database
TWI573030B (en) * 2015-10-01 2017-03-01 Integral Search Tech Ltd Method for analyzing patent-technical side
US20220398273A1 (en) * 2021-06-11 2022-12-15 UnitedLex Corp. Software-aided consistent analysis of documents

Family Cites Families (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5963966A (en) * 1995-11-08 1999-10-05 Cybernet Systems Corporation Automated capture of technical documents for electronic review and distribution
US6613100B2 (en) * 1997-11-26 2003-09-02 Intel Corporation Method and apparatus for displaying miniaturized graphical representations of documents for alternative viewing selection
US6181342B1 (en) * 1998-07-06 2001-01-30 International Business Machines Corp. Computer file directory system displaying visual summaries of visual data in desktop computer documents for quickly identifying document content
US20020111824A1 (en) * 2000-11-27 2002-08-15 First To File, Inc. Method of defining workflow rules for managing intellectual property
US7039647B2 (en) * 2001-05-10 2006-05-02 International Business Machines Corporation Drag and drop technique for building queries
JP2003085204A (en) * 2001-09-11 2003-03-20 Ricoh Co Ltd Document processing managing device
US20030187751A1 (en) * 2001-10-31 2003-10-02 Mike Watson Interactive electronic reference systems and methods
US7849052B2 (en) * 2004-01-28 2010-12-07 Paul David Vicars Electronic document manager
US20050210009A1 (en) * 2004-03-18 2005-09-22 Bao Tran Systems and methods for intellectual property management
US20070220041A1 (en) * 2005-10-14 2007-09-20 Leviathan Entertainment, Llc Prior Art Notes Associated with Patent Applications
CN101529419B (en) * 2006-10-17 2013-05-01 慷孚系统公司 Method and system for offline indexing of content and classifying stored data
US8065307B2 (en) * 2006-12-20 2011-11-22 Microsoft Corporation Parsing, analysis and scoring of document content
WO2009026193A2 (en) * 2007-08-17 2009-02-26 Accupatent, Inc. System and method for search
US20090222364A1 (en) * 2008-02-29 2009-09-03 Ourcashflow.Com, Llc System and method for attribute-based transaction categorization

Also Published As

Publication number Publication date
US20140195904A1 (en) 2014-07-10

Similar Documents

Publication Publication Date Title
US8922804B2 (en) Technical documents capturing and patents analysis system and method
US11914945B2 (en) Web browser extension for creating annotations referenceable from external contexts
CN102834832B (en) Evidence-obtaining system and evidence collecting method
CN109074383B (en) Document search with visualization within the context of a document
US11042689B2 (en) Generating a document preview
JP2014510968A (en) Electronic document search method and electronic document search graphical display method
CN104820686A (en) Network search method and network search system
TW201519071A (en) Technical documents capturing and patents analysis system and method
CN102959578A (en) Forensic system and forensic method, and forensic program
Wan et al. Improving government services with social media feedback
Kiesel et al. Web page segmentation revisited: evaluation framework and dataset
CN110414926A (en) Account management method, device and computer readable storage medium
JPWO2014049708A1 (en) Document analysis apparatus and program
US10592560B2 (en) Knowledge object and collaboration management system
TWI409648B (en) Capturing system and method for technical documents and capturing system for patent documents thereof
US20220318484A1 (en) Systems and methods of previewing a document
KR101575802B1 (en) A automatic recording system for information about operations of searching and reading patent documents and the method thereof
Su et al. KaitoroCap: A document navigation capture and visualisation tool
US10713270B2 (en) Emerging issue detection and analysis
CN114935996B (en) Method, computer device and storage medium for online processing of documents
Düring et al. impresso Text Reuse at Scale. An interface for the exploration of text reuse data in semantically enriched historical newspapers
di Sciascio et al. Exploring and Summarizing Document Colletions with Multiple Coordinated Views
US20120290573A1 (en) Information capturing methods and components
Gundelsweiler et al. An innovative user interface concept for large hierarchical data spaces by example of the epdm domain
JP2011008714A (en) Device and method for managing document, program, and storage medium