WO2009017131A1 - ε遷移を含まない非決定性有限オートマトン生成システムと方法およびプログラム - Google Patents

ε遷移を含まない非決定性有限オートマトン生成システムと方法およびプログラム Download PDF

Info

Publication number
WO2009017131A1
WO2009017131A1 PCT/JP2008/063604 JP2008063604W WO2009017131A1 WO 2009017131 A1 WO2009017131 A1 WO 2009017131A1 JP 2008063604 W JP2008063604 W JP 2008063604W WO 2009017131 A1 WO2009017131 A1 WO 2009017131A1
Authority
WO
WIPO (PCT)
Prior art keywords
nfa
syntax tree
storage unit
initial setting
transition
Prior art date
Application number
PCT/JP2008/063604
Other languages
English (en)
French (fr)
Inventor
Norio Yamagaki
Original Assignee
Nec Corporation
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nec Corporation filed Critical Nec Corporation
Priority to US12/452,987 priority Critical patent/US20100138367A1/en
Priority to JP2009525412A priority patent/JP5381710B2/ja
Publication of WO2009017131A1 publication Critical patent/WO2009017131A1/ja

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/903Querying
    • G06F16/90335Query processing
    • G06F16/90344Query processing by using string matching techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N20/00Machine learning
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N5/00Computing arrangements using knowledge-based models
    • G06N5/01Dynamic search techniques; Heuristics; Dynamic trees; Branch-and-bound

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Software Systems (AREA)
  • Evolutionary Computation (AREA)
  • Computing Systems (AREA)
  • Mathematical Physics (AREA)
  • Artificial Intelligence (AREA)
  • Medical Informatics (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Devices For Executing Special Programs (AREA)

Abstract

 正規表現から構文木を経由して、ε遷移を含まないNFAへの変換処理を高速化する。入力装置1から文字、又は選択、0回以上マッチを示す2種類のメタキャラクタ(‘|’、‘*’)のみで記述された正規表現から生成し、連結を示す記号‘・’と空を示す記号‘Φ’をノードに用いた構文木を受け取り、構文木記憶部31へ記憶させると共に生成するNFAの初期設定を行い、設定処理が完了すれば、NFAをNFA記憶部32へ記憶させ、構文木記憶部31から読み出した構文木データをNFA変換手段22へ送出する初期設定手段21と、初期設定手段21から受け取った構文木データと、NFA記憶部32からNFAを読み込み、構文木の各ノードに対して5つの変換パターンを適用し、ε遷移を含まないNFAへ直接変換し、変換が完了すれば、NFA記憶部32へ生成したNFAを記憶し、出力装置4へ出力するNFA変換手段22と、を備えている。
PCT/JP2008/063604 2007-08-02 2008-07-29 ε遷移を含まない非決定性有限オートマトン生成システムと方法およびプログラム WO2009017131A1 (ja)

Priority Applications (2)

Application Number Priority Date Filing Date Title
US12/452,987 US20100138367A1 (en) 2007-08-02 2008-07-29 SYSTEM, METHOD, AND PROGRAM FOR GENERATING NON-DETERMINISTIC FINITE AUTOMATON NOT INCLUDING e-TRANSITION
JP2009525412A JP5381710B2 (ja) 2007-08-02 2008-07-29 ε遷移を含まない非決定性有限オートマトン生成システムと方法およびプログラム

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
JP2007201510 2007-08-02
JP2007-201510 2007-08-02

Publications (1)

Publication Number Publication Date
WO2009017131A1 true WO2009017131A1 (ja) 2009-02-05

Family

ID=40304361

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/JP2008/063604 WO2009017131A1 (ja) 2007-08-02 2008-07-29 ε遷移を含まない非決定性有限オートマトン生成システムと方法およびプログラム

Country Status (3)

Country Link
US (1) US20100138367A1 (ja)
JP (1) JP5381710B2 (ja)
WO (1) WO2009017131A1 (ja)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2012524338A (ja) * 2009-04-17 2012-10-11 アルカテル−ルーセント 可変ストライド型ストリームのセグメント化、およびマルチパターンマッチング
CN104426909A (zh) * 2013-08-30 2015-03-18 凯为公司 为具有高级特征的正则表达式图样生成非确定有限自动机(nfa)图形
US9904630B2 (en) 2014-01-31 2018-02-27 Cavium, Inc. Finite automata processing based on a top of stack (TOS) memory
US10002326B2 (en) 2014-04-14 2018-06-19 Cavium, Inc. Compilation of finite automata based on memory hierarchy
US10110558B2 (en) 2014-04-14 2018-10-23 Cavium, Inc. Processing of finite automata based on memory hierarchy

Families Citing this family (19)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP5232191B2 (ja) * 2010-03-19 2013-07-10 株式会社東芝 情報処理装置、情報処理方法およびプログラム
US8589405B1 (en) 2010-07-16 2013-11-19 Netlogic Microsystems, Inc. Token stitcher for a content search system having pipelined engines
US8572106B1 (en) 2010-07-16 2013-10-29 Netlogic Microsystems, Inc. Memory management in a token stitcher for a content search system having pipelined engines
US8700593B1 (en) * 2010-07-16 2014-04-15 Netlogic Microsystems, Inc. Content search system having pipelined engines and a token stitcher
CN102075511B (zh) * 2010-11-01 2014-05-14 北京神州绿盟信息安全科技股份有限公司 一种数据匹配设备和方法以及网络入侵检测设备和方法
US9398033B2 (en) 2011-02-25 2016-07-19 Cavium, Inc. Regular expression processing automaton
US9858051B2 (en) * 2011-06-24 2018-01-02 Cavium, Inc. Regex compiler
US9203805B2 (en) 2011-11-23 2015-12-01 Cavium, Inc. Reverse NFA generation and processing
US9426166B2 (en) 2013-08-30 2016-08-23 Cavium, Inc. Method and apparatus for processing finite automata
US9426165B2 (en) 2013-08-30 2016-08-23 Cavium, Inc. Method and apparatus for compilation of finite automata
US10242125B2 (en) * 2013-12-05 2019-03-26 Entit Software Llc Regular expression matching
US9419943B2 (en) 2013-12-30 2016-08-16 Cavium, Inc. Method and apparatus for processing of finite automata
US9602532B2 (en) 2014-01-31 2017-03-21 Cavium, Inc. Method and apparatus for optimizing finite automata processing
US9438561B2 (en) 2014-04-14 2016-09-06 Cavium, Inc. Processing of finite automata based on a node cache
US9875045B2 (en) * 2015-07-27 2018-01-23 International Business Machines Corporation Regular expression matching with back-references using backtracking
CN107193776A (zh) * 2017-05-24 2017-09-22 南京大学 一种用于正则表达式匹配的新型转换算法
US9996328B1 (en) * 2017-06-22 2018-06-12 Archeo Futurus, Inc. Compiling and optimizing a computer code by minimizing a number of states in a finite machine corresponding to the computer code
US10481881B2 (en) * 2017-06-22 2019-11-19 Archeo Futurus, Inc. Mapping a computer code to wires and gates
US11669751B2 (en) * 2020-11-27 2023-06-06 At&T Intellectual Property I, L.P. Prediction of network events via rule set representations of machine learning models

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2003242179A (ja) * 2002-02-05 2003-08-29 Internatl Business Mach Corp <Ibm> 文字列照合方法、これを用いた文書処理装置及びプログラム
WO2008081932A1 (ja) * 2006-12-28 2008-07-10 Nec Corporation 文字列照合用有限オートマトン生成システム、その生成方法、及び生成プログラム

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2003242179A (ja) * 2002-02-05 2003-08-29 Internatl Business Mach Corp <Ibm> 文字列照合方法、これを用いた文書処理装置及びプログラム
WO2008081932A1 (ja) * 2006-12-28 2008-07-10 Nec Corporation 文字列照合用有限オートマトン生成システム、その生成方法、及び生成プログラム

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
HISAKAZU TAJIMI: "Programming Technique 36 grep(7", UNIX MAGAZINE, vol. 15, no. 10, 1 October 2000 (2000-10-01), pages 58 - 66 *
NORIO YAMAGAKI ET AL.: "Seiki Hyogen Kensaku Engine ni Okeru Multibyte Shorika ni Kansuru Kento", PROCEEDINGS OF THE 2007 IEICE GENERAL CONFERENCE, 7 March 2007 (2007-03-07), pages 188 *

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2012524338A (ja) * 2009-04-17 2012-10-11 アルカテル−ルーセント 可変ストライド型ストリームのセグメント化、およびマルチパターンマッチング
CN104426909A (zh) * 2013-08-30 2015-03-18 凯为公司 为具有高级特征的正则表达式图样生成非确定有限自动机(nfa)图形
US9785403B2 (en) 2013-08-30 2017-10-10 Cavium, Inc. Engine architecture for processing finite automata
US9823895B2 (en) 2013-08-30 2017-11-21 Cavium, Inc. Memory management for finite automata processing
CN104426909B (zh) * 2013-08-30 2019-01-04 凯为有限责任公司 用于将图样编译成非确定有限自动机(nfa)图形的方法和计算机系统
US10466964B2 (en) 2013-08-30 2019-11-05 Cavium, Llc Engine architecture for processing finite automata
US9904630B2 (en) 2014-01-31 2018-02-27 Cavium, Inc. Finite automata processing based on a top of stack (TOS) memory
US10002326B2 (en) 2014-04-14 2018-06-19 Cavium, Inc. Compilation of finite automata based on memory hierarchy
US10110558B2 (en) 2014-04-14 2018-10-23 Cavium, Inc. Processing of finite automata based on memory hierarchy

Also Published As

Publication number Publication date
JP5381710B2 (ja) 2014-01-08
US20100138367A1 (en) 2010-06-03
JPWO2009017131A1 (ja) 2010-10-21

Similar Documents

Publication Publication Date Title
WO2009017131A1 (ja) ε遷移を含まない非決定性有限オートマトン生成システムと方法およびプログラム
TWI267025B (en) Method, apparatus, machine accessible medium, and system for regular expression searching
WO2010138818A8 (en) Specifying a parser using a properties file
TW201246071A (en) Unrolling quantifications to control in-degree and/or out degree of automation
CN106970820A (zh) 代码存储方法及代码存储装置
KR101012504B1 (ko) 의존 문법 구문 트리의 탐색을 통한 자동 관계 트리플 추출방법
WO2007115031A3 (en) Method and system for processing xml-type telecommunications documents
CN101697126A (zh) 一种针对Excel文件的增量数据的ETL实现方法
WO2008149675A1 (ja) 波形発生装置、波形発生方法およびプログラム
WO2008081932A1 (ja) 文字列照合用有限オートマトン生成システム、その生成方法、及び生成プログラム
US10742772B2 (en) Method for transforming data for low volume transmission of meta model based protocol which monitors power amount data of new renewable energy, and system for transmitting data for low volume transmission of meta model based protocol which monitors power amount data of new renewable energy
CN101771675B (zh) 一种对数据包进行特征匹配的方法及装置
WO2006107770A3 (en) Method and system for inspecting a home for energy efficiency
CN103336706A (zh) 将伪代码转换为编程语言的系统和方法
JP2001137788A5 (ja)
CN207382479U (zh) 遥控器
CN206541292U (zh) 一种虚拟机的自动部署系统
CN101151612A (zh) 对文档进行随机访问的方法和系统
Jiang et al. Applying improved AO* based on DPSO algorithm in the optimal test sequencing problem of large scale complicated electronic system
WO2009103776A3 (en) Method and apparatus for correct mappings of uml to owl ontology format
JP5366709B2 (ja) 情報処理装置、共通文字列出力方法及びプログラム
JP2009169798A (ja) データ処理システム、データ変換方法およびこの方法のプログラム
Nguyen et al. Information Technology Skills Extractor for Job Descriptions in vku-ITSkills Dataset Using Natural Language Processing
Pham et al. A Study on Vietnamese Semantic Analysis using BERT-Based PreTrained Language Model
Dhounsi GENERAL STUDY ON POLYNOMIALS ASSOCIATED WITH HUMBERT POLYNOMIALS

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 08791839

Country of ref document: EP

Kind code of ref document: A1

WWE Wipo information: entry into national phase

Ref document number: 12452987

Country of ref document: US

WWE Wipo information: entry into national phase

Ref document number: 2009525412

Country of ref document: JP

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 08791839

Country of ref document: EP

Kind code of ref document: A1