CN102760125A - Barcode filtering and matching engine technology - Google Patents

Barcode filtering and matching engine technology Download PDF

Info

Publication number
CN102760125A
CN102760125A CN2011101046051A CN201110104605A CN102760125A CN 102760125 A CN102760125 A CN 102760125A CN 2011101046051 A CN2011101046051 A CN 2011101046051A CN 201110104605 A CN201110104605 A CN 201110104605A CN 102760125 A CN102760125 A CN 102760125A
Authority
CN
China
Prior art keywords
data
barcode
search
keyword
technology
Prior art date
Application number
CN2011101046051A
Other languages
Chinese (zh)
Inventor
苏捷
Original Assignee
上海真石信息技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 上海真石信息技术有限公司 filed Critical 上海真石信息技术有限公司
Priority to CN2011101046051A priority Critical patent/CN102760125A/en
Publication of CN102760125A publication Critical patent/CN102760125A/en

Links

Abstract

The invention relates to a barcode filtering and matching engine technology, which is the technology for processing barcode data with a filtering method and requesting matched data contained in a search term from mass data in a database. The technology comprises the steps as follows: analyzing primary characteristic property based on data information flow analysis of keywords, comparing according to the existing keyword library (a built-in keyword library), intercepting a network data packet in an equals manner during internal filtering, carrying out keyword comparison on the content of the data packet, calculating whether the content of the data packet is similar to the internal data library according to the corresponding MD5 value of the data information flow, obtaining and merging the data, and carrying out optimization on the data by combining dichotomy and Hash, so as to achieve rapid and precise matching of the input data. According to the technology, a method for searching the result by using the barcode or keywords of the search term is realized, and can lighten the equipment; professional equipment is not needed; and the function of fast scanning barcode and fast querying the corresponding merchandise information can be realized just by a wireless mobile terminal and the like.

Description

条形码滤浊匹配引擎技术 Barcode filter turbidity matching engine technology

一、技术领域 First, the technical field

[0001] 本发明有关一种从条形码数据经滤浊方法处理后,再从数据库海量数据中请求包括搜索项内所匹配的数据的技术方法。 [0001] The present invention method for processing voiced after filtration, and then the request relates to a bar code data from the mass data from the database comprises technical methods within the search item data matching. 搜索引擎可实现一种使用搜索项的条形码或关键字来搜索结果的方法。 Search engines can achieve a method of using a bar code or keyword search terms to the search results.

二、背景技术· Second, the technical background *

[0002] 条形码(barcode)是将宽度不等的多个黑条和空白,按照一定的编码规则排列,用以表达一组信息的图形标识符。 [0002] barcode (Barcode) is a plurality of varying widths of black bars and spaces, arranged according to certain encoding rules for expressing a set of graphical information identifier. 常见的条形码是由反射率相差很大的黑条(简称条)和白条(简称空)排成的平行线图案。 Common bar code is very different from the reflectance of the black bar (the bar) and white bars (the blank) are arranged in a pattern of parallel lines. 条形码可以标出物品的生产国、制造厂家、商品名称、生产日期、图书分类号、邮件起止地点、类别、日期等许多信息,因而在商品流通、图书管理、邮政管理、银行系统等许多领域都得到了广泛的应用。 Many bar code information can be marked goods producer, manufacturer, product name, date of production, book classification number, e-mail starting and ending location, category, date, etc., which are in many areas of commodity circulation, library management, postal management, banking systems, etc. It has been widely used.

三、发明内容 III. SUMMARY OF THE INVENTION

[0003] I、发明目的: [0003] I, object of the invention:

[0004] 以往只有企业级用户和拥有大型条形码查询设备用户才能查询商品条形码信息,现在提供该方法可以轻量化设备,做到不需要专业设备仅仅只需无线移动终端等就能实现条形码快速扫描条码,快速查询对应商品信息的功能。 [0004] the past, only large enterprise customers and has a bar code queries the device user can query product bar code information, which can now provide lightweight device, just simply do not need specialized equipment such as wireless mobile terminals will be able to achieve rapid bar code scanning bar code , quickly find the product information corresponding to the function.

[0005] 2、技术解决方案: [0005] 2, technology solutions:

[0006] 基于条形码关键字的数据信息流分析,分析其主要特征性质,根据现有关键字库(内建关键字库)进行比较,内部过滤采用equals方式对网络数据包进行侦听,对数据包的内容进行关键字比对,依据数据信息流的对应MD5值,计算出其是否与内部数据库相似,并获取其数据后合并,对数据进行二分法(dichotomy)和哈希(Hash)相互结合的优化,从而达到输入数据快速精确匹配性。 [0006] Keyword barcode data stream based on analysis, wherein the main properties, compared according to the prior keyword library (built keyword library), using an internal filter network packets equals embodiment of listening, the data contents of the package keyword matching, according to the corresponding data stream MD5 value, which is similar to calculate whether an internal database, and its data acquisition after the merger, dichotomous data (dichotomy) and hash (the hash) bonded to each other optimized to achieve the fast and accurate matching of the input data.

[0007] 3、附图说明: [0007] 3. Brief Description:

[0008] 图I是条形码滤浊匹配引擎技术的流程图。 [0008] Figure I is a bar code matching filter flowchart cloud engine technology.

四、具体实施方式 IV DETAILED DESCRIPTION

[0009] 条形码滤浊精确搜索引擎终端技术: [0009] Turbidity was filtered accurate barcode terminal search engine technology:

[0010] I.网络输入搜索项: [0010] I. Network enter a search term:

[0011] I. I过滤关键字: [0011] I. I filter Keywords:

[0012] 基于关键字的数据信息流分析,分析其主要特征性质根据现有关键字库(内建关键字库)进行比较,内部过滤采用equals方式对网络数据包进行侦听,对数据包的内容进行关键字比对,过滤到数据信息流的对应MD5值(Message Digest Algorithm),计算出其是否与内部数据库相似,判断是否拦截其数据。 [0012] The keyword-based data stream analysis, the main characteristic properties (built keyword library) comparing the keyword library according to the prior internal filter using the network data packets equals embodiment listening, the data packet SUMMARY keyword match, corresponding filtered value of the data stream is MD5 (message Digest Algorithm), which is similar to calculate whether an internal database, it is determined whether data interception.

[0013] I. 2合并数据流: [0013] I. 2 combined data stream:

[0014] 把已过滤关键字的数据信息流合并,采用MD5 (Message Digest Algorithm)混合连接方式,合并数据流。 [0014] The filtered data streams are combined keyword using MD5 (Message Digest Algorithm) mixed connection, merging the data stream.

[0015] I. 3数据流提交服务器设备 [0015] I. 3 Submit stream server device

[0016] 已合并好后的数据流可以由无线终端设备或PC硬件设备或服务器方式,采用Method加密方式(POST)到服务器设备。 [0016] The data stream may well have been combined by the wireless terminal device PC or hardware device or a server mode, encryption using Method (POST) to the server device. 采用更为Security的加密方式。 More use of encryption Security. 比如:通过非POST提交数据,信息数据可能会暴露在表现层,并被保存在其他介质的缓存中,一但查看缓存就造成不安全性。 For example: the data submitted by non-POST, information and data may be exposed to the presentation layer, and is stored in the cache of other media, but to see a cache causing insecurity. 除此之外,使用非POST提交数据还可能会造成Cross-site requestforgery 攻击。 In addition, the use of non-POST to submit data also may cause Cross-site requestforgery attack.

[0017] 2.服务器设备数据交互: [0017] 2. The server apparatus data exchange:

[0018] 2. I建立数据库连接: [0019] 建立各个表所需要的表空间,根据设备差异性采用SQL硬件标准。 [0018] 2. I database connection is established: [0019] various tables needed to establish the table space, according to the difference of the hardware device uses SQL standard. 对于数据库内部查询权限进行授权。 Authorize permissions for internal query the database. 连接到数据库。 Connect to the database. (建立相关查询表空间。) (Establishment of relevant look-up table space.)

[0020] 2. 2数据库查询优化: [0020] 2.2 database query optimization:

[0021] 对于需要查询的各个数据进行二分法(dichotomy)和哈希(Hash)相互结合的优化。 [0021] for dichotomy (Dichotomy) for each data to be queried and hash (the Hash) optimization combined with each other. 从而达到输入数据精确匹配性,缩短查询所使用时间的目的。 So as to achieve an exact match of the input data, to shorten the time of the query object.

[0022] 二分法: [0022] dichotomy:

[0023] 二分查找又称折半查找,它是一种效率较高的查找方法。 [0023] Also known as binary search binary search, it is a high efficient search method.

[0024] 二分查找要求:线性表是有序表,即表中结点按关键字有序,并且要用向量作为表的存储结构。 [0024] The binary search requires: linear table is sorted list, i.e., the node table ordered by keyword, and use as a vector table storage structure. 不妨设有序表是递增有序的。 You may assume an ordered list is incremental and orderly.

[0025]哈希: [0025] Hash:

[0026] HASH主要用于信息安全领域中加密算法,它把一些不同长度的信息转化成杂乱的128位的编码,这些编码值叫做HASH值.也可以说,hash就是找到一种数据内容和数据存放地址之间的映射关系。 [0026] HASH mainly used in the field of information security encryption algorithm that combines a number of different length information is converted into 128-bit messy coding, the coding values ​​is called HASH value can also be said that, hash is to find a data content and data storing the mapping between address.

[0027] 2. 3验证数据正确性: [0027] 2.3 verify the correctness of data:

[0028] 数据再次由MD5方式进行二次奇偶校验验证数据正确性。 [0028] The second parity data is performed again to verify the correctness of the data by the MD5 mode.

[0029] MD5 :对MD5算法简要的叙述可以为:MD5以512位分组来处理输入的信息,且每一分组又被划分为16个32位子分组,经过了一系列的处理后,算法的输出由四个32位分组组成,将这四个32位分组级联后将生成一个128位散列值。 [0029] MD5: Brief description of the MD5 algorithm may be: MD5 to 512 to process information input packet, and each packet has been divided into 16 groups of 32 seats, after a series of processing, the output of the algorithm composed of four 32-bit packets to generate a 128-bit hash value of these four 32-bit block concatenation.

[0030] 2. 4内部查错机制: [0030] 2.4 Internal troubleshooting mechanisms:

[0031] 当连接或优化或验证出错时,内部服务器设备将自动记录错误发生的时间错误的编号,形成日志(LogData)。 [0031] When a connection error or optimization or validation, the internal server device will automatically record the number of times the error occurred error, forming a log (LogData). 方便查询,并会启动相应的内部查错机制,修复常见错误,保证数据查询稳定性和强壮性。 Convenient query, and will start the appropriate internal mechanisms troubleshooting, repair common errors, data query to ensure stability and robustness.

[0032] 2. 5内码转换: [0032] 2.5 code conversion:

[0033] 根据信息流查询方平台的不同语言要求,统一采用UTF8内码格式。 [0033] Depending on the request for information stream query languages ​​party platform, the uniform application UTF8 code format. 在输出到表现层时,可以减少乱码的发生频率,做到最优化输出和提高输出效率的作用。 When output to the presentation layer, can reduce the frequency of occurrence of distortion, so as to optimize and enhance the role of the output of the output efficiency.

[0034] UTF8 :UTF-8是UNICODE的一种变长字符编码又称万国码,由Ken Thompson于1992年创建。 [0034] UTF8: UTF8 UNICODE is a variable-length character encoding, also known as Unicode, created by Ken Thompson in 1992. 现在已经标准化为RFC 3629。 Now standardized as RFC 3629. UTF-8用I到6个字节编码UNICODE字符。 UTF-8 with I to 6 bytes UNICODE character encoding. 用在网页上可以同一页面显示中文简体繁体及其它语言(如日文,韩文) With the same page on a web page can display Simplified Chinese Traditional and other languages ​​(such as Japanese, Korean)

[0035] 2. 6获得查询结果: [0035] 2.6 to obtain query results:

[0036] 将经过校验和转换后的字符信息数据流做为查询结果返回到目标查询终端上。 [0036] The query result as the stream passes the checksum character information data and convert the query to return to the target terminal. 采用网络或WIFI或其他各种无线或有线(根据设备需求)的方式来传输。 Or WIFI network, or using various other wireless or wired transmitted (according to the needs of the device) manner.

[0037] 3.终端查询结果解析: [0037] 3. Terminal Analytical Results:

[0038] 3. I网络检测校验: [0038] 3. I network checks parity:

[0039] 由终端对信息数据流进行奇偶校验。 [0039] The parity of the data stream of information by the terminal. (采用的都是将数据流视为16位整数流进行重复叠加计算。为了计算检验和,首先把检验和字段置为O。然后,对有效数据范围内中每个16位进行二进制反码求和,结果存在检验和字段中,数据长度为奇数则补一字节O。当收到数据后,同样对有效数据范围中每个16位数进行二进制反码的求和。接收方在计算过程中包含了发送方存在首部中的检验和,首部在传输过程中没有发生任何差错,接收方计算的结果应该为全0或全I。若结果不是全0或全1,表示数据错误。) (Using the data stream are considered to be repeated 16-bit stream of integers overlay calculation. To calculate the checksum, the checksum field is first set to O. Then, the valid data range of each 16-bit binary code seeking trans and the result in the checksum field, the data length of a byte is an odd number, then fill O. Upon receipt of the data, equally valid data range of each 16-bit sums of the binary one. in the calculation process of the recipient contains the sender is present in the header checksum, the header of any error does not occur during transmission, the results of the recipient computing be all 0s or all I. If the result is not all zeros or all 1, data indicating the error.)

[0040] 3. 2输出校验: [0040] 3.2 Output Calibration:

[0041] 测试将要输出的字符是否为UTF8格式,若不是UTF8格式请求出错,内部则再次尝试使用“2. I建立数据库连接”的方式。 Character [0041] The test will be whether the UTF8 format output, if not UTF8 format request error, try again using an internal "2. I establish a database connection" approach. (UTF8注解请见2. 5内码转换)。 (UTF8 annotation code conversion see 2.5).

[0042] 3. 3数据转换: [0042] 3.3 Data Conversion:

[0043] 将数据流信息采用国际通用的extensible Markup Language方式进行转换,传递到终端表现层。 [0043] The data stream information using the international common extensible Markup Language conversion mode, the terminal is transmitted to the presentation layer.

[0044] 3. 4终端输出: [0044] 3.4 Output Terminal:

[0045] 使用已转换好的extensible Markup Language方式数据进行数据展现。 [0045] uses the translated good way extensible Markup Language data for data show. 要将按照一定规则编译出来的条形码转换成有意义的信息,需要经历扫描和译码两个过程。 To convert compiled according to certain rules of the bar code information into meaningful, we need to go through the process of scanning and decoding two. 物体的颜色是由其反射光的类型决定的,白色物体能反射各种波长的可见光,黑色物体则吸收各种波长的可见光,所以当条形码扫描器光源发出的光在条形码上反射后,反射光照射到条码扫描器内部的光电转换器上,光电转换器根据强弱不同的反射光信号,转换成相应的电信号。 Color of an object is determined by the type of the reflected light, white object can reflect various wavelengths of visible light, a black object absorbs the visible light of various wavelengths, so that when the light is reflected on the bar code the bar code scanner light source, the reflected light irradiated onto the interior of the bar code scanner photoelectric conversion, photoelectric conversion depending on the strength of the reflected optical signal into a corresponding electrical signal.

Claims (2)

1.本发明是,有关一种从条形码数据经滤浊方法处理后,再从数据库海量数据中请求包括搜索项内所匹配的数据的技术方法。 1. The present invention, relates to a method of treating turbid after filtration, and then request from the bar code data from the database comprises the mass art methods data within the search item data matching. 搜索引擎可实现一种使用搜索项的条形码或关键字来搜索结果的方法。 Search engines can achieve a method of using a bar code or keyword search terms to the search results.
2.根据权利要求I所述的条形码滤浊匹配搜索引擎的技术,其特征是:通过无线终端使用条形码或关键字来搜索结果的方法。 The bar code matched filter turbidity search engine technology to claim I, wherein: the search results by using a bar code or a keyword by the wireless terminal method. 该发明可以对条形码或相关关键字信息来进行搜索查询。 The invention can be barcode or keyword information related to search queries. 从该搜索引擎方法获得的数据信息流可以返回到目标查询终端设备上。 Data stream obtained from the method may return to a search engine query the target terminal apparatus.
CN2011101046051A 2011-04-26 2011-04-26 Barcode filtering and matching engine technology CN102760125A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN2011101046051A CN102760125A (en) 2011-04-26 2011-04-26 Barcode filtering and matching engine technology

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN2011101046051A CN102760125A (en) 2011-04-26 2011-04-26 Barcode filtering and matching engine technology

Publications (1)

Publication Number Publication Date
CN102760125A true CN102760125A (en) 2012-10-31

Family

ID=47054583

Family Applications (1)

Application Number Title Priority Date Filing Date
CN2011101046051A CN102760125A (en) 2011-04-26 2011-04-26 Barcode filtering and matching engine technology

Country Status (1)

Country Link
CN (1) CN102760125A (en)

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20020026358A1 (en) * 1999-04-22 2002-02-28 Miller Michael R. System, method and article of manufacture for alerting a user to a promotional offer for a product based on user-input bar code information
CN101625681A (en) * 2008-07-11 2010-01-13 江苏怡丰通信设备有限公司 Quick commodity information query method
CN102007416A (en) * 2008-02-14 2011-04-06 三星电子株式会社 Bio-disc reading apparatus and assay method using same

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20020026358A1 (en) * 1999-04-22 2002-02-28 Miller Michael R. System, method and article of manufacture for alerting a user to a promotional offer for a product based on user-input bar code information
CN102007416A (en) * 2008-02-14 2011-04-06 三星电子株式会社 Bio-disc reading apparatus and assay method using same
CN101625681A (en) * 2008-07-11 2010-01-13 江苏怡丰通信设备有限公司 Quick commodity information query method

Similar Documents

Publication Publication Date Title
Harris et al. 4store: The design and implementation of a clustered RDF store
Leavitt Will NoSQL databases live up to their promise?
US7127467B2 (en) Managing expressions in a database system
US6148298A (en) System and method for aggregating distributed data
Dennis et al. Trade facilitation and export diversification
US6167393A (en) Heterogeneous record search apparatus and method
US8782017B2 (en) Representing and manipulating RDF data in a relational database management system
US5884304A (en) Alternate key index query apparatus and method
US20070174304A1 (en) Querying social networks
US20110191361A1 (en) System and method for building a cloud aware massive data analytics solution background
CN1716958B (en) System safety realizing method and relative system using sub form automatic machine
US7366735B2 (en) Efficient extraction of XML content stored in a LOB
US20060236224A1 (en) Method and apparatus for processing markup language information
Manku et al. Detecting near-duplicates for web crawling
US7398265B2 (en) Efficient query processing of XML data using XML index
US20090106271A1 (en) Secure search of private documents in an enterprise content management system
US8537160B2 (en) Generating distributed dataflow graphs
US7873663B2 (en) Methods and apparatus for converting a representation of XML and other markup language data to a data structure format
JP5002751B2 (en) 2-stage data validation and mapping for database access
US9639578B2 (en) System and method for investigating large amounts of data
Elliott et al. A complete translation from SPARQL into efficient SQL
US20110035390A1 (en) Message Descriptions
US8346737B2 (en) Encoding of hierarchically organized data for efficient storage and processing
CN1282332C (en) A method of fast data packet filtering
US8572127B2 (en) Structure based storage, query, update and transfer of tree-based documents

Legal Events

Date Code Title Description
C06 Publication
C10 Entry into substantive examination
C02 Deemed withdrawal of patent application after publication (patent law 2001)