WO2006061899A8 - 文字列照合装置および文字列照合プログラム - Google Patents

文字列照合装置および文字列照合プログラム

Info

Publication number
WO2006061899A8
WO2006061899A8 PCT/JP2004/018348 JP2004018348W WO2006061899A8 WO 2006061899 A8 WO2006061899 A8 WO 2006061899A8 JP 2004018348 W JP2004018348 W JP 2004018348W WO 2006061899 A8 WO2006061899 A8 WO 2006061899A8
Authority
WO
WIPO (PCT)
Prior art keywords
state
transition table
character string
state transition
transition
Prior art date
Application number
PCT/JP2004/018348
Other languages
English (en)
French (fr)
Other versions
WO2006061899A1 (ja
Inventor
Mitsunori Kori
Original Assignee
Mitsubishi Electric Corp
Mitsunori Kori
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Mitsubishi Electric Corp, Mitsunori Kori filed Critical Mitsubishi Electric Corp
Priority to PCT/JP2004/018348 priority Critical patent/WO2006061899A1/ja
Priority to US11/792,564 priority patent/US8032479B2/en
Priority to BRPI0419214-1A priority patent/BRPI0419214B1/pt
Priority to JP2007531511A priority patent/JP4535130B2/ja
Priority to CNB2004800445705A priority patent/CN100524301C/zh
Publication of WO2006061899A1 publication Critical patent/WO2006061899A1/ja
Publication of WO2006061899A8 publication Critical patent/WO2006061899A8/ja

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/903Querying
    • G06F16/90335Query processing
    • G06F16/90344Query processing by using string matching techniques

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Stored Programmes (AREA)
  • Document Processing Apparatus (AREA)

Abstract

 正規表現で記述された照合条件に基づいて状態遷移表を生成する状態遷移表生成部と、前記状態遷移表生成部により生成された状態遷移表に基づいて遷移するオートマトンとを備えるとともに、前記オートマトンは、前記照合条件に基づいて生成された状態遷移表において、現状態と入力文字の組に対する次の遷移先状態が存在しない場合、入力文字を読み進めずに初期状態へ遷移する。  また、正規表現で記述された照合条件に基づいて状態遷移表を生成する状態遷移表生成部と、前記状態遷移表生成部により生成された状態遷移表に基づいて遷移するオートマトンとを備えるとともに、前記状態遷移表生成部は、前記照合条件に基づいて生成された状態遷移表において、現状態と入力文字の組に対する次の遷移先状態が存在しない場合、入力文字を読み進めずに所定の状態へ遷移する除外文字を設定して状態遷移表を作成する。
PCT/JP2004/018348 2004-12-09 2004-12-09 文字列照合装置および文字列照合プログラム WO2006061899A1 (ja)

Priority Applications (5)

Application Number Priority Date Filing Date Title
PCT/JP2004/018348 WO2006061899A1 (ja) 2004-12-09 2004-12-09 文字列照合装置および文字列照合プログラム
US11/792,564 US8032479B2 (en) 2004-12-09 2004-12-09 String matching system and program therefor
BRPI0419214-1A BRPI0419214B1 (pt) 2004-12-09 2004-12-09 Sistema e método de correspondência de sequência
JP2007531511A JP4535130B2 (ja) 2004-12-09 2004-12-09 文字列照合装置および文字列照合プログラム
CNB2004800445705A CN100524301C (zh) 2004-12-09 2004-12-09 字符串对照装置

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/JP2004/018348 WO2006061899A1 (ja) 2004-12-09 2004-12-09 文字列照合装置および文字列照合プログラム

Publications (2)

Publication Number Publication Date
WO2006061899A1 WO2006061899A1 (ja) 2006-06-15
WO2006061899A8 true WO2006061899A8 (ja) 2007-08-30

Family

ID=36577729

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/JP2004/018348 WO2006061899A1 (ja) 2004-12-09 2004-12-09 文字列照合装置および文字列照合プログラム

Country Status (5)

Country Link
US (1) US8032479B2 (ja)
JP (1) JP4535130B2 (ja)
CN (1) CN100524301C (ja)
BR (1) BRPI0419214B1 (ja)
WO (1) WO2006061899A1 (ja)

Families Citing this family (34)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP4810915B2 (ja) * 2005-07-28 2011-11-09 日本電気株式会社 データ検索装置及び方法、並びにコンピュータ・プログラム
US20070226362A1 (en) * 2006-03-21 2007-09-27 At&T Corp. Monitoring regular expressions on out-of-order streams
US8903840B2 (en) * 2006-08-31 2014-12-02 International Business Machines Corporation System and method for launching a specific program from a simple click on a string of characters
WO2008084594A1 (ja) * 2007-01-12 2008-07-17 Nec Corporation パターンマッチング装置及び方法
US7630982B2 (en) 2007-02-24 2009-12-08 Trend Micro Incorporated Fast identification of complex strings in a data stream
US20090006316A1 (en) * 2007-06-29 2009-01-01 Wenfei Fan Methods and Apparatus for Rewriting Regular XPath Queries on XML Views
JP5224953B2 (ja) 2008-07-17 2013-07-03 インターナショナル・ビジネス・マシーンズ・コーポレーション 情報処理装置、情報処理方法およびプログラム
FR2939535B1 (fr) * 2008-12-10 2013-08-16 Canon Kk Procede et systeme de traitement pour la configuration d'un processseur exi
US8862603B1 (en) * 2010-11-03 2014-10-14 Netlogic Microsystems, Inc. Minimizing state lists for non-deterministic finite state automatons
US9858051B2 (en) * 2011-06-24 2018-01-02 Cavium, Inc. Regex compiler
US8990259B2 (en) 2011-06-24 2015-03-24 Cavium, Inc. Anchored patterns
US8719331B2 (en) 2011-08-02 2014-05-06 Cavium, Inc. Work migration in a processor
JP5554304B2 (ja) * 2011-09-16 2014-07-23 株式会社東芝 オートマトン決定化方法、オートマトン決定化装置およびオートマトン決定化プログラム
US8818783B2 (en) * 2011-09-27 2014-08-26 International Business Machines Corporation Representing state transitions
US9455996B2 (en) * 2011-10-03 2016-09-27 New York University Generating progressively a perfect hash data structure, such as a multi-dimensional perfect hash data structure, and using the generated data structure for high-speed string matching
CN102542038A (zh) * 2011-12-27 2012-07-04 浪潮通信信息系统有限公司 一种通用可配置的标准局数据入库方法
WO2013137864A1 (en) * 2012-03-13 2013-09-19 Hewlett-Packard Development Company, L.P. Submatch extraction
US9558299B2 (en) 2012-04-30 2017-01-31 Hewlett Packard Enterprise Development Lp Submatch extraction
US8725749B2 (en) * 2012-07-24 2014-05-13 Hewlett-Packard Development Company, L.P. Matching regular expressions including word boundary symbols
US8793251B2 (en) * 2012-07-31 2014-07-29 Hewlett-Packard Development Company, L.P. Input partitioning and minimization for automaton implementations of capturing group regular expressions
US8938454B2 (en) * 2012-10-10 2015-01-20 Polytechnic Institute Of New York University Using a tunable finite automaton for regular expression matching
US9268881B2 (en) 2012-10-19 2016-02-23 Intel Corporation Child state pre-fetch in NFAs
US9117170B2 (en) 2012-11-19 2015-08-25 Intel Corporation Complex NFA state matching method that matches input symbols against character classes (CCLs), and compares sequence CCLs in parallel
US9665664B2 (en) 2012-11-26 2017-05-30 Intel Corporation DFA-NFA hybrid
US9304768B2 (en) 2012-12-18 2016-04-05 Intel Corporation Cache prefetch for deterministic finite automaton instructions
US9251440B2 (en) * 2012-12-18 2016-02-02 Intel Corporation Multiple step non-deterministic finite automaton matching
US9268570B2 (en) 2013-01-23 2016-02-23 Intel Corporation DFA compression and execution
US20140271326A1 (en) 2013-03-15 2014-09-18 3D Systems, Inc. Powder Distribution for Laser Sintering Systems
US9086688B2 (en) * 2013-07-09 2015-07-21 Fisher-Rosemount Systems, Inc. State machine function block with user-definable actions on a transition between states
WO2015084360A1 (en) * 2013-12-05 2015-06-11 Hewlett-Packard Development Company, L.P. Regular expression matching
US9275336B2 (en) 2013-12-31 2016-03-01 Cavium, Inc. Method and system for skipping over group(s) of rules based on skip group rule
US9544402B2 (en) 2013-12-31 2017-01-10 Cavium, Inc. Multi-rule approach to encoding a group of rules
US9667446B2 (en) 2014-01-08 2017-05-30 Cavium, Inc. Condition code approach for comparing rule and packet data that are provided in portions
US11782983B1 (en) * 2020-11-27 2023-10-10 Amazon Technologies, Inc. Expanded character encoding to enhance regular expression filter capabilities

Family Cites Families (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4764863A (en) * 1985-05-09 1988-08-16 The United States Of America As Represented By The Secretary Of Commerce Hardware interpreter for finite state automata
JP2702927B2 (ja) * 1987-06-15 1998-01-26 株式会社日立製作所 文字列検索装置
US5309358A (en) * 1992-02-18 1994-05-03 International Business Machines Corporation Method for interchange code conversion of multi-byte character string characters
JP2994926B2 (ja) 1993-10-29 1999-12-27 松下電器産業株式会社 有限状態機械作成方法とパターン照合機械作成方法とこれらを変形する方法および駆動方法
JP4118363B2 (ja) 1996-06-27 2008-07-16 富士通株式会社 スパースな状態遷移表に基づく複数記号列の照合装置および方法
US5995963A (en) * 1996-06-27 1999-11-30 Fujitsu Limited Apparatus and method of multi-string matching based on sparse state transition list
JP4056962B2 (ja) 1996-06-27 2008-03-05 富士通株式会社 スパースな状態遷移表に基づく複数記号列の照合装置および方法
JP4021832B2 (ja) 1996-06-27 2007-12-12 富士通株式会社 スパースな状態遷移表に基づく複数記号列の照合装置および方法
JP3231673B2 (ja) 1996-11-21 2001-11-26 シャープ株式会社 文字,文字列検索方法及び該方法に用いる記録媒体
EP1436936A4 (en) * 2001-09-12 2006-08-02 Safenet Inc RECOGNITION OF FORMS OF HIGH-SPEED DATA FLOW
EP1436718B1 (en) * 2001-09-12 2007-09-19 SafeNet, Inc. Method of generating a DFA state machine that groups transitions into classes in order to conserve memory
US7346511B2 (en) * 2002-12-13 2008-03-18 Xerox Corporation Method and apparatus for recognizing multiword expressions
US7552051B2 (en) * 2002-12-13 2009-06-23 Xerox Corporation Method and apparatus for mapping multiword expressions to identifiers using finite-state networks
US7305391B2 (en) * 2003-02-07 2007-12-04 Safenet, Inc. System and method for determining the start of a match of a regular expression
WO2004107404A2 (en) * 2003-05-23 2004-12-09 Sensory Networks, Inc. Apparatus and method for large hardware finite state machine with embedded equivalence classes
US20050273450A1 (en) * 2004-05-21 2005-12-08 Mcmillen Robert J Regular expression acceleration engine and processing model
US7539681B2 (en) * 2004-07-26 2009-05-26 Sourcefire, Inc. Methods and systems for multi-pattern searching
US7356663B2 (en) * 2004-11-08 2008-04-08 Intruguard Devices, Inc. Layered memory architecture for deterministic finite automaton based string matching useful in network intrusion detection and prevention systems and apparatuses

Also Published As

Publication number Publication date
JP4535130B2 (ja) 2010-09-01
CN101076798A (zh) 2007-11-21
CN100524301C (zh) 2009-08-05
JPWO2006061899A1 (ja) 2009-09-03
WO2006061899A1 (ja) 2006-06-15
US20080109431A1 (en) 2008-05-08
US8032479B2 (en) 2011-10-04
BRPI0419214A (pt) 2008-04-15
BRPI0419214B1 (pt) 2021-09-21

Similar Documents

Publication Publication Date Title
WO2006061899A8 (ja) 文字列照合装置および文字列照合プログラム
WO2006017444A3 (en) Gaming machine with environment aware audio configuration
WO2006006028A8 (en) Method, apparatus and computer program product to utilize context ontology in mobile device application personalization
WO2006026733A3 (en) A method of designing a probe card apparatus with desired compliance characteristics
WO2007001765A3 (en) Using language models to expand wildcards
WO2008013720A3 (en) Method and apparatus for font subsetting
WO2005038626A3 (en) Adventure figure system and method
WO2005110033A3 (en) Video game including time dilation effect and a storage medium sotring software for the video game
WO2006010737A3 (en) Methods, apparatus and software for validating entries made on a form
WO2007050234A3 (en) System for obtaining reviews using selections created by user base
WO2005052733A3 (en) Electronic device and user interface and input method therefor
WO2006078912A3 (en) Automatic dynamic contextual data entry completion system
EP1426877A3 (en) Importing and exporting hierarchically structured data
WO2007095375A3 (en) Method and apparatus for configuring nodes in a wireless network
EA200601710A1 (ru) Способ, аппаратура и устройство хранения программ, пригодные для автоматического проектирования бурильной колонны на основе требований геометрии и траектории скважины
WO2002037472A3 (en) User interface for the administration of an external database
WO2008051783A3 (en) Context-free grammar
WO2004059435A3 (en) Using shared files in a game console or computer for cross-game state sharing
WO2003081476A3 (en) Method and data structure for a low memory overhead database
EA200300742A1 (ru) Объектно-ориентированная система компьютерного моделирования углеводородных резервуаров
WO2008146807A1 (ja) オントロジ処理装置、オントロジ処理方法、及びオントロジ処理プログラム
WO2007146809A3 (en) Identifying content of interest
WO2007092194A3 (en) System and method of analyzing freeform mathematical responses
WO2008051791A3 (en) Pattern-based file relationship inference
WO2006012207A3 (en) Report generating method and apparatus

Legal Events

Date Code Title Description
WWE Wipo information: entry into national phase

Ref document number: 2007531511

Country of ref document: JP

AK Designated states

Kind code of ref document: A1

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BW BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE EG ES FI GB GD GE GH GM HR HU ID IL IN IS KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NA NI NO NZ OM PG PH PL PT RO RU SC SD SE SG SK SL SY TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): BW GH GM KE LS MW MZ NA SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IS IT LT LU MC NL PL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
WWE Wipo information: entry into national phase

Ref document number: 11792564

Country of ref document: US

Ref document number: 1020077012822

Country of ref document: KR

WWE Wipo information: entry into national phase

Ref document number: 200480044570.5

Country of ref document: CN

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 04822547

Country of ref document: EP

Kind code of ref document: A1

WWW Wipo information: withdrawn in national office

Ref document number: 4822547

Country of ref document: EP

ENP Entry into the national phase

Ref document number: PI0419214

Country of ref document: BR

WWP Wipo information: published in national office

Ref document number: 11792564

Country of ref document: US