WO2018032249A1 - Audio data fetching method and system - Google Patents
Audio data fetching method and system Download PDFInfo
- Publication number
- WO2018032249A1 WO2018032249A1 PCT/CN2016/095298 CN2016095298W WO2018032249A1 WO 2018032249 A1 WO2018032249 A1 WO 2018032249A1 CN 2016095298 W CN2016095298 W CN 2016095298W WO 2018032249 A1 WO2018032249 A1 WO 2018032249A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- search
- keyword
- search results
- audio
- unit
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
Definitions
- the present invention relates to the field of big data, and in particular, to a method and system for capturing audio data.
- Big data refers to a collection of data that cannot be captured, managed, and processed by conventional software tools within a certain time frame. It requires a new processing model to have stronger decision-making power, insight and process optimization capabilities to adapt to massive and high growth. Rate and diversified information assets, there are various big data available, such as image data. How to search for the desired data from the image data is a problem worth studying. The existing technical solutions cannot achieve effective image data. search for.
- the application provides a method for capturing audio data. It solves the shortcomings of the prior art technical solution that the effective search of picture data cannot be realized.
- a method of capturing audio data comprising the steps of:
- Baidu search and Google search are opened in the big picture data to search according to the keyword;
- the two search results are displayed side by side in the form of a text index corresponding to the audio.
- the method further includes:
- search results are the same in both search results, the same search results are displayed on either page.
- the method further includes:
- a crawling system for audio data comprising:
- An obtaining unit for obtaining a keyword to be searched An obtaining unit for obtaining a keyword to be searched
- a search unit for opening a Baidu search and a Google search in the image big data according to the keyword, respectively searching according to the keyword;
- the paging unit is configured to display the two search results in the left and right pages in the manner of the text index corresponding to the audio.
- system further includes:
- system further includes:
- a blocking unit that shields the promoted picture.
- the technical solution provided by the present invention acquires a keyword to be searched, and according to the keyword, the Baidu search and the Google search are respectively searched according to the keyword, and the two search results are displayed on the left and right by the text index corresponding to the audio, so Has the advantage of effective search.
- FIG. 1 is a flowchart of a method for capturing audio data according to a first preferred embodiment of the present invention
- FIG. 2 is a structural diagram of a capture system for audio data according to a second preferred embodiment of the present invention.
- FIG. 1 is a schematic diagram of a method for capturing audio data according to a first preferred embodiment of the present invention. The method is as shown in FIG.
- Step S101 Acquire a keyword to be searched
- Step S102 Open a Baidu search and a Google search in the big picture data according to the keyword, and perform a search according to the keyword respectively;
- Step S103 Display the two search results in left and right page by way of the text index corresponding to the audio.
- the technical solution provided by the present invention acquires a keyword to be searched, and according to the keyword, the Baidu search and the Google search are respectively searched according to the keyword, and the two search results are displayed on the left and right by the text index corresponding to the audio, so Has the advantage of effective search.
- the foregoing method may further include:
- search results are the same in both search results, the same search results are displayed on either page.
- the foregoing method may further include:
- FIG. 2 is a schematic diagram of a data capture system according to a second preferred embodiment of the present invention.
- the system includes:
- An obtaining unit 201 configured to acquire a keyword to be searched
- the searching unit 202 is configured to perform Baidu search and Google search in the big picture data according to the keyword, and perform search according to the keyword respectively;
- the paging unit 203 is configured to display the two search results in a left and right page by way of a text index corresponding to the audio.
- the technical solution provided by the present invention acquires a keyword to be searched, and according to the keyword, the Baidu search and the Google search are respectively searched according to the keyword, and the two search results are displayed on the left and right by the text index corresponding to the audio, so Has the advantage of effective search.
- the above system may further include:
- the allocating unit 204 is configured to display the same search result on any one of the pages if the two search results have the same search result.
- the above system may further include:
- the shielding unit 205 is configured to block the promoted picture.
- the program may be stored in a computer readable storage medium, and the storage medium may include: Flash drive, read-only memory (English: Read-Only Memory, referred to as: ROM), random accessor (English: Random Access Memory, referred to as: RAM), disk or CD.
- ROM Read-Only Memory
- RAM Random Access Memory
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- User Interface Of Digital Computer (AREA)
Abstract
An audio data fetching method and system. The method comprises the following steps: acquiring a keyword for searching (101); launching, according to the keyword, a Baidu search and a Google search to perform searches according to the keyword (102); and displaying, by means of indexing with text corresponding to audio, the two search results respectively on left and right pages (103). The technical solution of the present invention has the advantage of search effectiveness.
Description
本发明涉及大数据领域,尤其涉及一种音频数据的抓取方法及系统。The present invention relates to the field of big data, and in particular, to a method and system for capturing audio data.
大数据(big
data),指无法在一定时间范围内用常规软件工具进行捕捉、管理和处理的数据集合,是需要新处理模式才能具有更强的决策力、洞察发现力和流程优化能力来适应海量、高增长率和多样化的信息资产,现有的大数据有多样,例如图片数据,如何从图片数据中搜索出想要的数据是一个很值得研究的问题,现有的技术方案无法实现图片数据的有效搜索。Big data (big
Data) refers to a collection of data that cannot be captured, managed, and processed by conventional software tools within a certain time frame. It requires a new processing model to have stronger decision-making power, insight and process optimization capabilities to adapt to massive and high growth. Rate and diversified information assets, there are various big data available, such as image data. How to search for the desired data from the image data is a problem worth studying. The existing technical solutions cannot achieve effective image data. search for.
本申请提供一种音频数据的抓取方法。其解决现有技术的技术方案无法实现图片数据的有效搜索的缺点。The application provides a method for capturing audio data. It solves the shortcomings of the prior art technical solution that the effective search of picture data cannot be realized.
一方面,提供一种音频数据的抓取方法,所述方法包括如下步骤:In one aspect, a method of capturing audio data is provided, the method comprising the steps of:
获取需搜索的关键字;Get the keywords to search for;
依据该关键字在图片大数据内开通百度搜索和谷歌搜索分别依据该关键字进行搜索;According to the keyword, Baidu search and Google search are opened in the big picture data to search according to the keyword;
将两个搜索结果以音频对应的文字索引的方式左右分页显示。The two search results are displayed side by side in the form of a text index corresponding to the audio.
可选的,所述方法还包括:Optionally, the method further includes:
如两个搜索结果中具有相同的搜索结果,将相同的搜索结果在任一个分页显示。If the search results are the same in both search results, the same search results are displayed on either page.
可选的,所述方法还包括:Optionally, the method further includes:
将推广的图片屏蔽。Block the promoted image.
第二方面,提供一种音频数据的抓取系统,所述系统包括:In a second aspect, a crawling system for audio data is provided, the system comprising:
获取单元,用于获取需搜索的关键字;An obtaining unit for obtaining a keyword to be searched;
搜索单元,用于依据该关键字在图片大数据内开通百度搜索和谷歌搜索分别依据该关键字进行搜索;a search unit for opening a Baidu search and a Google search in the image big data according to the keyword, respectively searching according to the keyword;
分页单元,用于将两个搜索结果以音频对应的文字索引的方式左右分页显示。The paging unit is configured to display the two search results in the left and right pages in the manner of the text index corresponding to the audio.
可选的,所述系统还包括:Optionally, the system further includes:
分配单元,用于如两个搜索结果中具有相同的搜索结果,将相同的搜索结果在任一个分页显示。An allocation unit for having the same search result as in two search results, displaying the same search result on any one of the pages.
可选的,所述系统还包括:Optionally, the system further includes:
屏蔽单元,用于将推广的图片屏蔽。A blocking unit that shields the promoted picture.
本发明提供的技术方案获取需搜索的关键字,依据该关键字开通百度搜索和谷歌搜索分别依据该关键字进行搜索,将两个搜索结果以音频对应的文字索引的方式左右分页显示,所以其具有有效搜索的优点。The technical solution provided by the present invention acquires a keyword to be searched, and according to the keyword, the Baidu search and the Google search are respectively searched according to the keyword, and the two search results are displayed on the left and right by the text index corresponding to the audio, so Has the advantage of effective search.
为了更清楚地说明本发明实施例的技术方案,下面将对实施例描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图是本发明的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。In order to more clearly illustrate the technical solutions of the embodiments of the present invention, the drawings used in the description of the embodiments will be briefly described below. It is obvious that the drawings in the following description are some embodiments of the present invention. Those skilled in the art can also obtain other drawings based on these drawings without paying any creative work.
图1为本发明第一较佳实施方式提供的一种音频数据的抓取方法的流程图;1 is a flowchart of a method for capturing audio data according to a first preferred embodiment of the present invention;
图2为本发明第二较佳实施方式提供的一种音频数据的抓取系统的结构图。2 is a structural diagram of a capture system for audio data according to a second preferred embodiment of the present invention.
下面将结合本发明实施例中的附图,对本发明实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例是本发明一部分实施例,而不是全部的实施例。基于本发明中的实施例,本领域普通技术人员在没有作出创造性劳动前提下所获得的所有其他实施例,都属于本发明保护的范围。The technical solutions in the embodiments of the present invention are clearly and completely described in the following with reference to the accompanying drawings in the embodiments of the present invention. It is obvious that the described embodiments are a part of the embodiments of the present invention, but not all embodiments. All other embodiments obtained by those skilled in the art based on the embodiments of the present invention without creative efforts are within the scope of the present invention.
请参考图1,图1是本发明第一较佳实施方式提出的一种音频数据的抓取方法,该方法如图1所示,包括如下步骤:Please refer to FIG. 1. FIG. 1 is a schematic diagram of a method for capturing audio data according to a first preferred embodiment of the present invention. The method is as shown in FIG.
步骤S101、获取需搜索的关键字;Step S101: Acquire a keyword to be searched;
步骤S102、依据该关键字在图片大数据内开通百度搜索和谷歌搜索分别依据该关键字进行搜索;Step S102: Open a Baidu search and a Google search in the big picture data according to the keyword, and perform a search according to the keyword respectively;
步骤S103、将两个搜索结果以音频对应的文字索引的方式左右分页显示。Step S103: Display the two search results in left and right page by way of the text index corresponding to the audio.
本发明提供的技术方案获取需搜索的关键字,依据该关键字开通百度搜索和谷歌搜索分别依据该关键字进行搜索,将两个搜索结果以音频对应的文字索引的方式左右分页显示,所以其具有有效搜索的优点。The technical solution provided by the present invention acquires a keyword to be searched, and according to the keyword, the Baidu search and the Google search are respectively searched according to the keyword, and the two search results are displayed on the left and right by the text index corresponding to the audio, so Has the advantage of effective search.
可选的,上述方法在步骤S103之后还可以包括:Optionally, after the step S103, the foregoing method may further include:
如两个搜索结果中具有相同的搜索结果,将相同的搜索结果在任一个分页显示。If the search results are the same in both search results, the same search results are displayed on either page.
可选的,上述方法在步骤S103之后还可以包括:Optionally, after the step S103, the foregoing method may further include:
将推广的图片屏蔽。Block the promoted image.
请参考图2,图2是本发明第二较佳实施方式提出的一种音频数据的抓取系统,该系统包括:Please refer to FIG. 2. FIG. 2 is a schematic diagram of a data capture system according to a second preferred embodiment of the present invention. The system includes:
获取单元201,用于获取需搜索的关键字;An obtaining unit 201, configured to acquire a keyword to be searched;
搜索单元202,用于依据该关键字在图片大数据内开通百度搜索和谷歌搜索分别依据该关键字进行搜索;The searching unit 202 is configured to perform Baidu search and Google search in the big picture data according to the keyword, and perform search according to the keyword respectively;
分页单元203,用于将两个搜索结果以音频对应的文字索引的方式左右分页显示。The paging unit 203 is configured to display the two search results in a left and right page by way of a text index corresponding to the audio.
本发明提供的技术方案获取需搜索的关键字,依据该关键字开通百度搜索和谷歌搜索分别依据该关键字进行搜索,将两个搜索结果以音频对应的文字索引的方式左右分页显示,所以其具有有效搜索的优点。The technical solution provided by the present invention acquires a keyword to be searched, and according to the keyword, the Baidu search and the Google search are respectively searched according to the keyword, and the two search results are displayed on the left and right by the text index corresponding to the audio, so Has the advantage of effective search.
可选的,上述系统还可以包括:Optionally, the above system may further include:
分配单元204,用于如两个搜索结果中具有相同的搜索结果,将相同的搜索结果在任一个分页显示。The allocating unit 204 is configured to display the same search result on any one of the pages if the two search results have the same search result.
可选的,上述系统还可以包括:Optionally, the above system may further include:
屏蔽单元205,用于将推广的图片屏蔽。The shielding unit 205 is configured to block the promoted picture.
需要说明的是,对于前述的各个方法实施例,为了简单描述,故将其都表述为一系列的动作组合,但是本领域技术人员应该知悉,本发明并不受所描述的动作顺序的限制,因为依据本发明,某一些步骤可以采用其他顺序或者同时进行。其次,本领域技术人员也应该知悉,说明书中所描述的实施例均属于优选实施例,所涉及的动作和模块并不一定是本发明所必须的。It should be noted that, for the foregoing various method embodiments, for the sake of simple description, they are all expressed as a series of action combinations, but those skilled in the art should understand that the present invention is not limited by the described action sequence. Because certain steps may be performed in other sequences or concurrently in accordance with the present invention. In addition, those skilled in the art should also understand that the embodiments described in the specification are all preferred embodiments, and the actions and modules involved are not necessarily required by the present invention.
在上述实施例中,对各个实施例的描述都各有侧重,某个实施例中没有详细描述的部分,可以参见其他实施例的相关描述。In the above embodiments, the descriptions of the various embodiments are different, and the parts that are not described in detail in a certain embodiment can be referred to the related descriptions of other embodiments.
本领域普通技术人员可以理解上述实施例的各种方法中的全部或部分步骤是可以通过程序来指令相关的硬件来完成,该程序可以存储于一计算机可读存储介质中,存储介质可以包括:闪存盘、只读存储器(英文:Read-Only
Memory ,简称:ROM)、随机存取器(英文:Random Access Memory,简称:RAM)、磁盘或光盘等。A person skilled in the art may understand that all or part of the various steps of the foregoing embodiments may be performed by a program to instruct related hardware. The program may be stored in a computer readable storage medium, and the storage medium may include: Flash drive, read-only memory (English: Read-Only
Memory, referred to as: ROM), random accessor (English: Random Access Memory, referred to as: RAM), disk or CD.
以上对本发明实施例所提供的内容下载方法及相关设备、系统进行了详细介绍,本文中应用了具体个例对本发明的原理及实施方式进行了阐述,以上实施例的说明只是用于帮助理解本发明的方法及其核心思想;同时,对于本领域的一般技术人员,依据本发明的思想,在具体实施方式及应用范围上均会有改变之处,综上所述,本说明书内容不应理解为对本发明的限制。The content downloading method and the related device and system provided by the embodiments of the present invention are described in detail above. The principles and implementation manners of the present invention are described in the specific examples. The description of the above embodiments is only used to help understand the present invention. The method of the invention and its core idea; at the same time, for the person of ordinary skill in the art, according to the idea of the present invention, there are some changes in the specific embodiment and the scope of application. In summary, the content of the specification should not be understood. To limit the invention.
Claims (6)
- 一种音频数据的抓取方法,其特征在于,所述方法包括如下步骤: A method for capturing audio data, characterized in that the method comprises the following steps:获取需搜索的关键字;Get the keywords to search for;依据该关键字在图片大数据内开通百度搜索和谷歌搜索分别依据该关键字进行搜索;According to the keyword, Baidu search and Google search are opened in the big picture data to search according to the keyword;将两个搜索结果以音频对应的文字索引的方式左右分页显示。The two search results are displayed side by side in the form of a text index corresponding to the audio.
- 根据权利要求1所述的方法,其特征在于,所述方法还包括:The method of claim 1 further comprising:如两个搜索结果中具有相同的搜索结果,将相同的搜索结果在任一个分页显示。If the search results are the same in both search results, the same search results are displayed on either page.
- 根据权利要求1所述的方法,其特征在于,所述方法还包括:The method of claim 1 further comprising:将推广的图片屏蔽。Block the promoted image.
- 一种音频数据的抓取系统,其特征在于,所述系统包括:A crawling system for audio data, characterized in that the system comprises:获取单元,用于获取需搜索的关键字;An obtaining unit for obtaining a keyword to be searched;搜索单元,用于依据该关键字在图片大数据内开通百度搜索和谷歌搜索分别依据该关键字进行搜索;a search unit for opening a Baidu search and a Google search in the image big data according to the keyword, respectively searching according to the keyword;分页单元,用于将两个搜索结果以音频对应的文字索引的方式左右分页显示。The paging unit is configured to display the two search results in the left and right pages in the manner of the text index corresponding to the audio.
- 根据权利要求4所述的系统,其特征在于,所述系统还包括:The system of claim 4, wherein the system further comprises:分配单元,用于如两个搜索结果中具有相同的搜索结果,将相同的搜索结果在任一个分页显示。An allocation unit for having the same search result as in two search results, displaying the same search result on any one of the pages.
- 根据权利要求4所述的系统,其特征在于,所述系统还包括:The system of claim 4, wherein the system further comprises:屏蔽单元,用于将推广的图片屏蔽。 A blocking unit that shields the promoted picture.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PCT/CN2016/095298 WO2018032249A1 (en) | 2016-08-15 | 2016-08-15 | Audio data fetching method and system |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PCT/CN2016/095298 WO2018032249A1 (en) | 2016-08-15 | 2016-08-15 | Audio data fetching method and system |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2018032249A1 true WO2018032249A1 (en) | 2018-02-22 |
Family
ID=61196087
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2016/095298 WO2018032249A1 (en) | 2016-08-15 | 2016-08-15 | Audio data fetching method and system |
Country Status (1)
Country | Link |
---|---|
WO (1) | WO2018032249A1 (en) |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102004782A (en) * | 2010-11-25 | 2011-04-06 | 北京搜狗科技发展有限公司 | Search result sequencing method and search result sequencer |
US8078606B2 (en) * | 2003-06-27 | 2011-12-13 | At&T Intellectual Property I, L.P. | Rank-based estimate of relevance values |
CN105117476A (en) * | 2015-09-08 | 2015-12-02 | 刘珉恺 | Search method based on network platform |
CN105683966A (en) * | 2016-01-30 | 2016-06-15 | 深圳市博信诺达经贸咨询有限公司 | Searching method and searching system based on big data |
CN105849730A (en) * | 2016-03-25 | 2016-08-10 | 马岩 | Data capture method and system |
CN106294802A (en) * | 2016-08-15 | 2017-01-04 | 马岩 | The grasping means of voice data and system |
-
2016
- 2016-08-15 WO PCT/CN2016/095298 patent/WO2018032249A1/en active Application Filing
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8078606B2 (en) * | 2003-06-27 | 2011-12-13 | At&T Intellectual Property I, L.P. | Rank-based estimate of relevance values |
CN102004782A (en) * | 2010-11-25 | 2011-04-06 | 北京搜狗科技发展有限公司 | Search result sequencing method and search result sequencer |
CN105117476A (en) * | 2015-09-08 | 2015-12-02 | 刘珉恺 | Search method based on network platform |
CN105683966A (en) * | 2016-01-30 | 2016-06-15 | 深圳市博信诺达经贸咨询有限公司 | Searching method and searching system based on big data |
CN105849730A (en) * | 2016-03-25 | 2016-08-10 | 马岩 | Data capture method and system |
CN106294802A (en) * | 2016-08-15 | 2017-01-04 | 马岩 | The grasping means of voice data and system |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2017128362A1 (en) | Searching method and system based on big data | |
WO2017161578A1 (en) | Method and system for data capturing | |
WO2018032249A1 (en) | Audio data fetching method and system | |
WO2018032254A1 (en) | Method and system for fetching trusted video in big data | |
WO2018032253A1 (en) | Secure search method and system for big data of images | |
WO2018032250A1 (en) | Text data search method and system for big data | |
WO2018027928A1 (en) | Forum big data capturing method and system | |
WO2018032245A1 (en) | Data search method and system for comment data of social networking software | |
WO2018032251A1 (en) | Method and system for applying security level to data fetching of big data | |
WO2018032252A1 (en) | Secure search method and system for big data on forums | |
WO2018032246A1 (en) | Search method and system for big data in local area network | |
WO2018027927A1 (en) | Webpage data searching method and system | |
WO2018032248A1 (en) | Image search application method and system for search in big data | |
WO2017128357A1 (en) | Big data-based method and system for webpage crawling | |
WO2018006254A1 (en) | Local area network mail data-based fetching method and system | |
WO2018006218A1 (en) | Local mail data-based fetching method and system | |
WO2018006217A1 (en) | Network mail data-based fetching method and system | |
WO2018006256A1 (en) | Local mail data collection method and system | |
WO2018157330A1 (en) | Big data partitioning method and system | |
WO2018006255A1 (en) | Network mail data collection method and system | |
WO2018014316A1 (en) | Method and system for collecting email data of local area network | |
WO2018032247A1 (en) | Search method and system for big data of videos | |
WO2018027463A1 (en) | Application method and system for keyword analysis in big data | |
WO2018157332A1 (en) | Statistical method and system applied to big data | |
WO2018027460A1 (en) | Method and system for algorithm comparison |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 16913013 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 16913013 Country of ref document: EP Kind code of ref document: A1 |