WO2018023484A1 - Method and system of implementing search of different parts of speech in big data - Google Patents

Method and system of implementing search of different parts of speech in big data Download PDF

Info

Publication number
WO2018023484A1
WO2018023484A1 PCT/CN2016/093042 CN2016093042W WO2018023484A1 WO 2018023484 A1 WO2018023484 A1 WO 2018023484A1 CN 2016093042 W CN2016093042 W CN 2016093042W WO 2018023484 A1 WO2018023484 A1 WO 2018023484A1
Authority
WO
WIPO (PCT)
Prior art keywords
keyword
speech
search
big data
user
Prior art date
Application number
PCT/CN2016/093042
Other languages
French (fr)
Chinese (zh)
Inventor
王晓光
Original Assignee
王晓光
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 王晓光 filed Critical 王晓光
Priority to PCT/CN2016/093042 priority Critical patent/WO2018023484A1/en
Publication of WO2018023484A1 publication Critical patent/WO2018023484A1/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

A method and system for implementing the search of different parts of speech in big data. The method comprises the following steps: receiving a keyword input by a user (101); querying the part of speech of the keyword according to the keyword (102); and performing a search according to the part of speech of the keyword to obtain a search result (103). The method can implement good user experience.

Description

不同词性在大数据搜索中的实现方法及系统  Method and system for implementing different part of speech in big data search 技术领域Technical field
本发明涉及大数据领域,尤其涉及一种不同词性在大数据搜索中的实现方法及系统。The present invention relates to the field of big data, and in particular to a method and system for implementing different part of speech in big data search.
背景技术Background technique
大数据(big data),或称巨量资料,指的是需要新处理模式才能具有更强的决策力、洞察力和流程优化能力的海量、高增长率和多样化的信息资产。在维克托?迈尔-舍恩伯格及肯尼斯?库克耶编写的《大数据时代》中大数据指不用随机分析法(抽样调查)这样的捷径,而采用所有数据进行分析处理。大数据的4V特点:Volume(大量)、Velocity(高速)、Variety(多样)、Value(价值)。Big data (big Data), or massive data, refers to the massive, high-growth, and diverse information assets that require new processing models to have greater decision-making, insight, and process optimization capabilities. In the Big Data Era written by Victor Meyer Schonberg and Kenneth Cooke, big data refers to the use of all data for analysis without the use of random analysis (sample survey). 4V features of big data: Volume, Velocity, Variety, Value.
现有大数据在搜索时不准确,用户体验度低。 Existing big data is inaccurate in search and has a low user experience.
技术问题technical problem
提供一种不同词性在大数据搜索中的实现方法,其解决了现有技术用户体验度低的缺点。A method for implementing different part of speech in big data search is provided, which solves the shortcoming of low user experience in the prior art.
技术解决方案Technical solution
一方面,提供一种不同词性在大数据搜索中的实现方法,所述方法包括如下步骤:In one aspect, a method for implementing different part of speech in a big data search is provided, the method comprising the following steps:
接收用户输入的关键词;Receiving keywords input by the user;
依据该关键词查询出该关键词的词性;Querying the part of speech of the keyword according to the keyword;
依据关键词的词性实现搜索得到搜索结果。The search results are obtained by searching according to the part of speech of the keyword.
可选的,所述方法还包括:Optionally, the method further includes:
依据关键词出现的次数对搜索结果排序。Sort the search results based on the number of occurrences of the keyword.
可选的,所述方法还包括:Optionally, the method further includes:
获取用户的历史搜索结果,在搜索结果中屏蔽与历史搜索结果相同的内容。Get the user's historical search results and block the same content as the historical search results in the search results.
第二方面,提供一种不同词性在大数据搜索中的实现系统,所述系统包括:In a second aspect, a system for implementing different part of speech in a big data search is provided, the system comprising:
接收单元,用于接收用户输入的关键词;a receiving unit, configured to receive a keyword input by a user;
查询单元,用于依据该关键词查询出该关键词的词性;a query unit, configured to query the part of speech of the keyword according to the keyword;
搜索单元,用于依据关键词的词性实现搜索得到搜索结果。The search unit is configured to perform search according to the part of speech of the keyword to obtain the search result.
可选的,所述系统还包括:Optionally, the system further includes:
排序单元,用于依据关键词出现的次数对搜索结果排序。A sorting unit for sorting search results according to the number of occurrences of keywords.
可选的,所述系统还包括:Optionally, the system further includes:
屏蔽单元,用于获取用户的历史搜索结果,在搜索结果中屏蔽与历史搜索结果相同的内容。The shielding unit is configured to obtain a historical search result of the user, and block the same content as the historical search result in the search result.
有益效果Beneficial effect
本发明具体实施方式提供的技术方案接收用户输入的关键词,依据该关键词查询出该关键词的词性,依据关键词的词性实现搜索得到搜索结果,所以其具有搜索准确,体验度高的优点。The technical solution provided by the specific embodiment of the present invention receives a keyword input by a user, queries the part of speech of the keyword according to the keyword, and performs search according to the part of speech of the keyword to obtain a search result, so that the search has the advantages of accurate search and high experience. .
附图说明DRAWINGS
为了更清楚地说明本发明实施例或现有技术中的技术方案,下面将对实施例或现有技术描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本发明的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the embodiments or the description of the prior art will be briefly described below. Obviously, the drawings in the following description are only It is a certain embodiment of the present invention, and other drawings can be obtained from those skilled in the art without any creative work.
图1为本发明提供的一种不同词性在大数据搜索中的实现方法的流程图;1 is a flow chart of a method for implementing different part of speech in big data search according to the present invention;
图2为本发明提供的一种不同词性在大数据搜索中的实现系统的结构图。FIG. 2 is a structural diagram of a system for implementing different part of speech in big data search according to the present invention.
本发明的实施方式Embodiments of the invention
下面将结合本发明实施例中的附图,对本发明实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例仅仅是本发明一部分实施例,而不是全部的实施例。基于本发明中的实施例,本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例,都属于本发明保护的范围。The technical solutions in the embodiments of the present invention are clearly and completely described in the following with reference to the accompanying drawings in the embodiments of the present invention. It is obvious that the described embodiments are only a part of the embodiments of the present invention, but not all embodiments. All other embodiments obtained by those skilled in the art based on the embodiments of the present invention without creative efforts are within the scope of the present invention.
参阅图1,图1为本发明第一较佳实施方式提供的一种不同词性在大数据搜索中的实现方法的流程图,该方法由服务器来完成,该方法如图1所示,包括如下步骤:Referring to FIG. 1 , FIG. 1 is a flowchart of a method for implementing different part of speech in a big data search according to a first preferred embodiment of the present invention. The method is implemented by a server. The method is as shown in FIG. 1 and includes the following. step:
步骤S101、接收用户输入的关键词;Step S101: Receive a keyword input by a user;
步骤S102、依据该关键词查询出该关键词的词性;Step S102, querying the part of speech of the keyword according to the keyword;
步骤S103、依据关键词的词性实现搜索得到搜索结果。Step S103: Perform a search according to the part of speech of the keyword to obtain a search result.
本发明具体实施方式提供的技术方案接收用户输入的关键词,依据该关键词查询出该关键词的词性,依据关键词的词性实现搜索得到搜索结果,所以其具有搜索准确,体验度高的优点。The technical solution provided by the specific embodiment of the present invention receives a keyword input by a user, queries the part of speech of the keyword according to the keyword, and performs search according to the part of speech of the keyword to obtain a search result, so that the search has the advantages of accurate search and high experience. .
可选的,上述方法在步骤S103之后还可以包括:Optionally, after the step S103, the foregoing method may further include:
依据关键词出现的次数对搜索结果排序。Sort the search results based on the number of occurrences of the keyword.
可选的,上述方法在步骤S103之后还可以包括:Optionally, after the step S103, the foregoing method may further include:
获取用户的历史搜索结果,在搜索结果中屏蔽与历史搜索结果相同的内容。Get the user's historical search results and block the same content as the historical search results in the search results.
参阅图2,图2为本发明第二较佳实施方式提供的一种不同词性在大数据搜索中的实现系统,该系统包括:Referring to FIG. 2, FIG. 2 is a schematic diagram of a system for implementing different part of speech in a big data search according to a second preferred embodiment of the present invention. The system includes:
接收单元201,用于接收用户输入的关键词;The receiving unit 201 is configured to receive a keyword input by the user;
查询单元202,用于依据该关键词查询出该关键词的词性;The query unit 202 is configured to query the part of speech of the keyword according to the keyword;
搜索单元203,用于依据关键词的词性实现搜索得到搜索结果。The searching unit 203 is configured to perform a search according to the part of speech of the keyword to obtain a search result.
本发明具体实施方式提供的技术方案接收用户输入的关键词,依据该关键词查询出该关键词的词性,依据关键词的词性实现搜索得到搜索结果,所以其具有搜索准确,体验度高的优点。The technical solution provided by the specific embodiment of the present invention receives a keyword input by a user, queries the part of speech of the keyword according to the keyword, and performs search according to the part of speech of the keyword to obtain a search result, so that the search has the advantages of accurate search and high experience. .
可选的,上述系统还可以包括:Optionally, the above system may further include:
排序单元204,用于依据关键词出现的次数对搜索结果排序。The sorting unit 204 is configured to sort the search results according to the number of occurrences of the keywords.
可选的,上述系统还可以包括:Optionally, the above system may further include:
屏蔽单元205,用于获取用户的历史搜索结果,在搜索结果中屏蔽与历史搜索结果相同的内容。The masking unit 205 is configured to acquire a historical search result of the user, and block the same content as the historical search result in the search result.
需要说明的是,对于前述的各方法实施方式或实施例,为了简单描述,故将其都表述为一系列的动作组合,但是本领域技术人员应该知悉,本发明并不受所描述的动作顺序的限制,因为根据本发明,某些步骤可以采用其他顺序或者同时进行。其次,本领域技术人员也应该知悉,说明书中所描述实施方式或实施例均属于优选实施例,所涉及的动作和单元并不一定是本发明所必须的。It should be noted that, for the foregoing method embodiments or embodiments, for the sake of simple description, they are all expressed as a series of action combinations, but those skilled in the art should know that the present invention is not subject to the described action sequence. Limitations, as certain steps may be performed in other sequences or concurrently in accordance with the present invention. In the following, those skilled in the art should also understand that the embodiments or examples described in the specification are preferred embodiments, and the actions and units involved are not necessarily required by the present invention.
在上述实施例中,对各个实施例的描述都各有侧重,某个实施例中没有详述的部分,可以参见其他实施例的相关描述。In the above embodiments, the descriptions of the various embodiments are different, and the details that are not detailed in a certain embodiment can be referred to the related descriptions of other embodiments.
本发明实施例方法中的步骤可以根据实际需要进行顺序调整、合并和删减。The steps in the method of the embodiment of the present invention may be sequentially adjusted, merged, and deleted according to actual needs.
本发明实施例装置中的单元可以根据实际需要进行合并、划分和删减。本领域的技术人员可以将本说明书中描述的不同实施例以及不同实施例的特征进行结合或组合。The units in the apparatus of the embodiment of the present invention may be combined, divided, and deleted according to actual needs. Those skilled in the art can combine or combine the different embodiments described in the specification and the features of the different embodiments.
通过以上的实施方式的描述,所属领域的技术人员可以清楚地了解到本发明可以用硬件实现,或固件实现,或它们的组合方式来实现。当使用软件实现时,可以将上述功能存储在计算机可读介质中或作为计算机可读介质上的一个或多个指令或代码进行传输。计算机可读介质包括计算机存储介质和通信介质,其中通信介质包括便于从一个地方向另一个地方传送计算机程序的任何介质。存储介质可以是计算机能够存取的任何可用介质。以此为例但不限于:计算机可读介质可以包括随机存取存储器(Random Access Memory,RAM)、只读存储器(Read-Only Memory,ROM)、电可擦可编程只读存储器(Electrically Erasable Programmable Read-Only Memory,EEPROM)、只读光盘(Compact Disc Read-Only Memory,CD-ROM)或其他光盘存储、磁盘存储介质或者其他磁存储设备、或者能够用于携带或存储具有指令或数据结构形式的期望的程序代码并能够由计算机存取的任何其他介质。此外。任何连接可以适当的成为计算机可读介质。例如,如果软件是使用同轴电缆、光纤光缆、双绞线、数字用户线(Digital Subscriber Line,DSL)或者诸如红外线、无线电和微波之类的无线技术从网站、服务器或者其他远程源传输的,那么同轴电缆、光纤光缆、双绞线、DSL或者诸如红外线、无线和微波之类的无线技术包括在所属介质的定影中。如本发明所使用的,盘(Disk)和碟(disc)包括压缩光碟(CD)、激光碟、光碟、数字通用光碟(DVD)、软盘和蓝光光碟,其中盘通常磁性的复制数据,而碟则用激光来光学的复制数据。上面的组合也应当包括在计算机可读介质的保护范围之内。Through the description of the above embodiments, those skilled in the art can clearly understand that the present invention can be implemented in hardware, firmware implementation, or a combination thereof. When implemented in software, the functions described above may be stored in or transmitted as one or more instructions or code on a computer readable medium. Computer readable media includes both computer storage media and communication media including any medium that facilitates transfer of a computer program from one location to another. A storage medium may be any available media that can be accessed by a computer. Taking this as an example, but not limited to: the computer readable medium may include random access memory (Random) Access Memory, RAM), Read-Only Memory (ROM), Electrically Erasable Programmable Read Only Memory (Electrically Erasable Programmable Read-Only Memory, EEPROM), Compact Disc Read-Only Memory, CD-ROM, or other optical disc storage, magnetic storage medium or other magnetic storage device, or any other medium that can be used to carry or store desired program code in the form of instructions or data structures and that can be accessed by a computer. Also. Any connection may suitably be a computer readable medium. For example, if the software is using coaxial cable, fiber optic cable, twisted pair, digital subscriber line (Digital Subscriber Line, DSL) or wireless technology such as infrared, radio and microwave transmission from a website, server or other remote source, then coaxial cable, fiber optic cable, twisted pair, DSL or such as infrared, wireless and microwave Wireless technology is included in the fixing of the associated medium. As used in the present invention, a disk and a disc include a compact disc (CD), a laser disc, a compact disc, a digital versatile disc (DVD), a floppy disk, and a Blu-ray disc, wherein the disc is usually magnetically copied, and the disc is The laser is used to optically replicate the data. Combinations of the above should also be included within the scope of the computer readable media.
总之,以上所述仅为本发明技术方案的较佳实施例而已,并非用于限定本发明的保护范围。凡在本发明的精神和原则之内,所作的任何修改、等同替换、改进等,均应包含在本发明的保护范围之内。 In summary, the above description is only a preferred embodiment of the technical solution of the present invention, and is not intended to limit the scope of the present invention. Any modifications, equivalent substitutions, improvements, etc. made within the spirit and scope of the present invention are intended to be included within the scope of the present invention.

Claims (6)

  1. 一种不同词性在大数据搜索中的实现方法,其特征在于,所述方法包括如下步骤: A method for implementing different part of speech in big data search, characterized in that the method comprises the following steps:
    接收用户输入的关键词;Receiving keywords input by the user;
    依据该关键词查询出该关键词的词性;Querying the part of speech of the keyword according to the keyword;
    依据关键词的词性实现搜索得到搜索结果。The search results are obtained by searching according to the part of speech of the keyword.
  2. 根据权利要求1所述的方法,其特征在于,所述方法还包括:The method of claim 1 further comprising:
    依据关键词出现的次数对搜索结果排序。Sort the search results based on the number of occurrences of the keyword.
  3. 根据权利要求1所述的方法,其特征在于,所述方法还包括:The method of claim 1 further comprising:
    获取用户的历史搜索结果,在搜索结果中屏蔽与历史搜索结果相同的内容。Get the user's historical search results and block the same content as the historical search results in the search results.
  4. 一种不同词性在大数据搜索中的实现系统,其特征在于,所述系统包括:A system for implementing different part of speech in big data search, characterized in that the system comprises:
    接收单元,用于接收用户输入的关键词;a receiving unit, configured to receive a keyword input by a user;
    查询单元,用于依据该关键词查询出该关键词的词性;a query unit, configured to query the part of speech of the keyword according to the keyword;
    搜索单元,用于依据关键词的词性实现搜索得到搜索结果。The search unit is configured to perform search according to the part of speech of the keyword to obtain the search result.
  5. 根据权利要求4所述的系统,其特征在于,所述系统还包括:The system of claim 4, wherein the system further comprises:
    排序单元,用于依据关键词出现的次数对搜索结果排序。A sorting unit for sorting search results according to the number of occurrences of keywords.
  6. 根据权利要求4所述的系统,其特征在于,所述系统还包括:The system of claim 4, wherein the system further comprises:
    屏蔽单元,用于获取用户的历史搜索结果,在搜索结果中屏蔽与历史搜索结果相同的内容。The shielding unit is configured to obtain a historical search result of the user, and block the same content as the historical search result in the search result.
PCT/CN2016/093042 2016-08-03 2016-08-03 Method and system of implementing search of different parts of speech in big data WO2018023484A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
PCT/CN2016/093042 WO2018023484A1 (en) 2016-08-03 2016-08-03 Method and system of implementing search of different parts of speech in big data

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2016/093042 WO2018023484A1 (en) 2016-08-03 2016-08-03 Method and system of implementing search of different parts of speech in big data

Publications (1)

Publication Number Publication Date
WO2018023484A1 true WO2018023484A1 (en) 2018-02-08

Family

ID=61073071

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2016/093042 WO2018023484A1 (en) 2016-08-03 2016-08-03 Method and system of implementing search of different parts of speech in big data

Country Status (1)

Country Link
WO (1) WO2018023484A1 (en)

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1871603A (en) * 2003-08-21 2006-11-29 伊迪利亚公司 System and method for processing a query
CN104021198A (en) * 2014-06-16 2014-09-03 北京理工大学 Relational database information retrieval method and device based on ontology semantic index
CN104281698A (en) * 2014-10-15 2015-01-14 国云科技股份有限公司 Efficient big data query method
CN105786963A (en) * 2016-01-25 2016-07-20 汇智明德(北京)教育科技有限公司 Corpus searching method and system
CN106294645A (en) * 2016-08-03 2017-01-04 王晓光 Different part of speech realization method and systems in big data search

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1871603A (en) * 2003-08-21 2006-11-29 伊迪利亚公司 System and method for processing a query
CN104021198A (en) * 2014-06-16 2014-09-03 北京理工大学 Relational database information retrieval method and device based on ontology semantic index
CN104281698A (en) * 2014-10-15 2015-01-14 国云科技股份有限公司 Efficient big data query method
CN105786963A (en) * 2016-01-25 2016-07-20 汇智明德(北京)教育科技有限公司 Corpus searching method and system
CN106294645A (en) * 2016-08-03 2017-01-04 王晓光 Different part of speech realization method and systems in big data search

Similar Documents

Publication Publication Date Title
WO2017161578A1 (en) Method and system for data capturing
WO2018023484A1 (en) Method and system of implementing search of different parts of speech in big data
WO2018027464A1 (en) Implementation method and system for different parts of speech in big data search
WO2018023483A1 (en) Method and system for implementing real-time search of different languages in big data
WO2018023482A1 (en) Method and system for implementing voice search
WO2018027343A1 (en) Method and system for implementing voice search
WO2018027342A1 (en) Application method and system for synonym in big data search
WO2018023481A1 (en) Method and system for applying synonym in big data search
WO2018027341A1 (en) Category-based keyword searching method and system in big data
WO2018023480A1 (en) Keyword classification-based search method and system in big data
WO2018035663A1 (en) Infant detection method and system for internet of things quilt cover
WO2018027344A1 (en) Method and system for implementing real-time search of different languages in big data
WO2018035697A1 (en) Method and system for searching for house listings on internet
WO2018027470A1 (en) Method and system for sharing big data in wechat
WO2018027462A1 (en) Implementation method and system for search and comparison
WO2018027455A1 (en) Method and system for sharing big data in social network
WO2018027469A1 (en) Application method and system of keyword in big data storage
WO2018027456A1 (en) Method and system for specifying application to be shared in big data
WO2018027457A1 (en) Method and system for sharing mobile big data
WO2018027458A1 (en) Method and system for sharing big data in real time
WO2018027468A1 (en) Classified storage method and system for big data
WO2018027465A1 (en) Method and system for real-time backup based on big data
WO2018027463A1 (en) Application method and system for keyword analysis in big data
WO2018027460A1 (en) Method and system for algorithm comparison
WO2018027466A1 (en) Method and system for storing big data in distributed system

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 16911082

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 27/06/2019)

122 Ep: pct application non-entry in european phase

Ref document number: 16911082

Country of ref document: EP

Kind code of ref document: A1