WO2015172359A1 - 一种对象搜索方法及装置 - Google Patents
一种对象搜索方法及装置 Download PDFInfo
- Publication number
- WO2015172359A1 WO2015172359A1 PCT/CN2014/077566 CN2014077566W WO2015172359A1 WO 2015172359 A1 WO2015172359 A1 WO 2015172359A1 CN 2014077566 W CN2014077566 W CN 2014077566W WO 2015172359 A1 WO2015172359 A1 WO 2015172359A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- user
- target object
- input
- gesture
- image area
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/953—Querying, e.g. by the use of web search engines
- G06F16/9538—Presentation of query results
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/951—Indexing; Web crawling techniques
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G06F3/048—Interaction techniques based on graphical user interfaces [GUI]
- G06F3/0487—Interaction techniques based on graphical user interfaces [GUI] using specific features provided by the input device, e.g. functions controlled by the rotation of a mouse with dual sensing arrangements, or of the nature of the input device, e.g. tap gestures based on pressure sensed by a digitiser
- G06F3/0488—Interaction techniques based on graphical user interfaces [GUI] using specific features provided by the input device, e.g. functions controlled by the rotation of a mouse with dual sensing arrangements, or of the nature of the input device, e.g. tap gestures based on pressure sensed by a digitiser using a touch-screen or digitiser, e.g. input of commands through traced gestures
- G06F3/04883—Interaction techniques based on graphical user interfaces [GUI] using specific features provided by the input device, e.g. functions controlled by the rotation of a mouse with dual sensing arrangements, or of the nature of the input device, e.g. tap gestures based on pressure sensed by a digitiser using a touch-screen or digitiser, e.g. input of commands through traced gestures for inputting data by handwriting, e.g. gesture or text
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/24—Querying
- G06F16/242—Query formulation
- G06F16/2428—Query predicate definition using graphical user interfaces, including menus and forms
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/50—Information retrieval; Database structures therefor; File system structures therefor of still image data
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/50—Information retrieval; Database structures therefor; File system structures therefor of still image data
- G06F16/53—Querying
- G06F16/532—Query formulation, e.g. graphical querying
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G06F3/017—Gesture based interaction, e.g. based on a set of recognized hand gestures
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G06F3/048—Interaction techniques based on graphical user interfaces [GUI]
- G06F3/0481—Interaction techniques based on graphical user interfaces [GUI] based on specific properties of the displayed interaction object or a metaphor-based environment, e.g. interaction with desktop elements like windows or icons, or assisted by a cursor's changing behaviour or appearance
- G06F3/0482—Interaction with lists of selectable items, e.g. menus
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G06F3/048—Interaction techniques based on graphical user interfaces [GUI]
- G06F3/0484—Interaction techniques based on graphical user interfaces [GUI] for the control of specific functions or operations, e.g. selecting or manipulating an object, an image or a displayed text element, setting a parameter value or selecting a range
- G06F3/04842—Selection of displayed objects or displayed text elements
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/16—Sound input; Sound output
- G06F3/167—Audio in a user interface, e.g. using voice commands for navigating, audio feedback
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F2203/00—Indexing scheme relating to G06F3/00 - G06F3/048
- G06F2203/038—Indexing scheme relating to G06F3/038
- G06F2203/0381—Multimodal input, i.e. interface arrangements enabling the user to issue commands by simultaneous use of input devices of different nature, e.g. voice plus gesture on digitizer
Definitions
- the embodiment of the present invention provides an object search method and apparatus.
- the preferred embodiments of the present invention are described below in conjunction with the accompanying drawings. It should be understood that the preferred embodiments described herein are only for the purpose of illustration and explanation. The invention is not intended to limit the invention. And in the case of no conflict, the embodiments in the present application and the features in the embodiments can be combined with each other.
- Step 102 Determine, according to the voice input, a target object name that the user desires to search and a feature category of the target object.
- Step 202 Determine, according to the voice input, a target object name that the user desires to search and a feature category of the target object.
- the terminal after determining, by step 202, the target object name that the user desires to search and the feature category of the target object, the terminal directly selects the category information of the feature category, the target object name, and the user-selected The image area is sent to the server, and the server searches based on the received information and returns the search result to the terminal.
- the user can also enter the photographing mode, and perform a photographing operation to obtain an image, and correspondingly, obtain an image obtained by the user currently photographed as an image region selected by the user.
- the user performs the operation of selecting the image region and the operation of inputting the feature category and the target object name, and there is no strict sequence between the two.
- Step 302 The terminal determines, according to the voice input of the user, a target object name that the user desires to search and a feature category of the target object.
- the feature of the feature category of the image region selected by the user may be extracted by the terminal, or may be extracted by the server, or may be extracted by the terminal for some feature categories, for other
- the feature category is extracted by the server, so the feature category extracted by the terminal to the feature information can be set as the preset feature category, and is performed in this step. The above judgment.
- the first searching unit 404 is specifically configured to send the feature information and the target object name to the server, and receive a search result returned by the server, where the search result is that the server performs the feature information
- the target object represented by the target object name is searched.
- a second receiving unit 501 configured to receive a voice input and a gesture input of the user
- the second determining unit 502 is further configured to: acquire an image area selected by the user from the specified image by using the gesture input, as an image area selected by the user; or obtain an image obtained by the user inputting the photo by using the gesture, As the image area selected by the user.
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- General Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Databases & Information Systems (AREA)
- Human Computer Interaction (AREA)
- Data Mining & Analysis (AREA)
- Mathematical Physics (AREA)
- Computational Linguistics (AREA)
- Multimedia (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- General Health & Medical Sciences (AREA)
- User Interface Of Digital Computer (AREA)
- Image Analysis (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
Description
Claims
Priority Applications (7)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
BR112016017262-0A BR112016017262B1 (pt) | 2014-05-15 | 2014-05-15 | Método para busca de objeto e terminal acoplado de forma comunicativa a um servidor. |
CN201480003299.4A CN104854539B (zh) | 2014-05-15 | 2014-05-15 | 一种对象搜索方法及装置 |
KR1020167020862A KR101864240B1 (ko) | 2014-05-15 | 2014-05-15 | 객체 검색 방법 및 장치 |
PCT/CN2014/077566 WO2015172359A1 (zh) | 2014-05-15 | 2014-05-15 | 一种对象搜索方法及装置 |
JP2016550858A JP6316447B2 (ja) | 2014-05-15 | 2014-05-15 | オブジェクト検索方法および装置 |
EP14892023.4A EP3001333A4 (en) | 2014-05-15 | 2014-05-15 | METHOD AND APPARATUS FOR SEARCHING OBJECTS |
US14/902,227 US10311115B2 (en) | 2014-05-15 | 2014-05-15 | Object search method and apparatus |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PCT/CN2014/077566 WO2015172359A1 (zh) | 2014-05-15 | 2014-05-15 | 一种对象搜索方法及装置 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2015172359A1 true WO2015172359A1 (zh) | 2015-11-19 |
Family
ID=53852833
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2014/077566 WO2015172359A1 (zh) | 2014-05-15 | 2014-05-15 | 一种对象搜索方法及装置 |
Country Status (7)
Country | Link |
---|---|
US (1) | US10311115B2 (zh) |
EP (1) | EP3001333A4 (zh) |
JP (1) | JP6316447B2 (zh) |
KR (1) | KR101864240B1 (zh) |
CN (1) | CN104854539B (zh) |
BR (1) | BR112016017262B1 (zh) |
WO (1) | WO2015172359A1 (zh) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US10788902B2 (en) | 2016-06-22 | 2020-09-29 | Sony Corporation | Information processing device and information processing method |
Families Citing this family (16)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR101561628B1 (ko) * | 2013-12-30 | 2015-10-20 | 주식회사 케이티 | 스마트 글래스의 영상 정보를 제공하는 검색 장치 및 검색 방법 |
US10444977B2 (en) * | 2014-12-05 | 2019-10-15 | Verizon Patent And Licensing Inc. | Cellphone manager |
KR20170052364A (ko) * | 2015-11-04 | 2017-05-12 | 삼성전자주식회사 | 디스플레이장치 및 그 제어방법 |
CN107515868A (zh) * | 2016-06-15 | 2017-12-26 | 北京陌上花科技有限公司 | 搜索方法及装置 |
KR102055733B1 (ko) * | 2017-02-24 | 2019-12-13 | 권오민 | 이미지광고 온라인 제공 방법 |
KR102469717B1 (ko) * | 2017-08-01 | 2022-11-22 | 삼성전자주식회사 | 오브젝트에 대한 검색 결과를 제공하기 위한 전자 장치 및 이의 제어 방법 |
CN110119461B (zh) * | 2018-01-25 | 2022-01-14 | 阿里巴巴(中国)有限公司 | 一种查询信息的处理方法及装置 |
KR102630662B1 (ko) | 2018-04-02 | 2024-01-30 | 삼성전자주식회사 | 어플리케이션 실행 방법 및 이를 지원하는 전자 장치 |
CN108874910B (zh) * | 2018-05-28 | 2021-08-17 | 思百达物联网科技(北京)有限公司 | 基于视觉的小目标识别系统 |
CN108984730A (zh) * | 2018-07-12 | 2018-12-11 | 三星电子(中国)研发中心 | 一种搜索方法和搜索设备 |
WO2020062392A1 (zh) | 2018-09-28 | 2020-04-02 | 上海寒武纪信息科技有限公司 | 信号处理装置、信号处理方法及相关产品 |
KR102688902B1 (ko) | 2018-12-05 | 2024-07-26 | 제주대학교 산학협력단 | 감귤 바이오겔을 포함하는 감귤 미숙과 추출물을 유효성분으로 함유하는 화장료 조성물 |
JP7275795B2 (ja) * | 2019-04-15 | 2023-05-18 | コニカミノルタ株式会社 | 操作受付装置、制御方法、画像形成システム、及び、プログラム |
CN110765294B (zh) * | 2019-10-25 | 2021-03-12 | 深圳追一科技有限公司 | 图像搜索方法、装置、终端设备及存储介质 |
CN113093406A (zh) * | 2021-04-14 | 2021-07-09 | 陈祥炎 | 智能眼镜 |
CN116628327A (zh) * | 2023-02-16 | 2023-08-22 | 百度在线网络技术(北京)有限公司 | 搜索方法、装置、电子设备以及存储介质 |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1952935A (zh) * | 2006-09-22 | 2007-04-25 | 南京搜拍信息技术有限公司 | 综合利用图像及文字信息的搜索系统及搜索方法 |
CN101930457A (zh) * | 2010-08-13 | 2010-12-29 | 百度在线网络技术(北京)有限公司 | 一种供用户进行快速选择对象及搜索的方法、设备和系统 |
CN102411627A (zh) * | 2010-12-16 | 2012-04-11 | 微软公司 | 包括面部图像的图像搜索 |
Family Cites Families (19)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH08166866A (ja) | 1994-10-14 | 1996-06-25 | Hitachi Ltd | 対話型インターフェースを具備した編集支援システム |
JPH10198695A (ja) * | 1997-01-13 | 1998-07-31 | Sharp Corp | 情報処理装置 |
US6513063B1 (en) * | 1999-01-05 | 2003-01-28 | Sri International | Accessing network-based electronic information through scripted online interfaces using spoken input |
JP3823129B2 (ja) * | 2001-12-07 | 2006-09-20 | 株式会社シガメック | 画像検索システム及び画像検索方法 |
JP2006107109A (ja) * | 2004-10-05 | 2006-04-20 | Canon Inc | 情報管理装置及び情報管理方法 |
JP2007026316A (ja) * | 2005-07-20 | 2007-02-01 | Yamaha Motor Co Ltd | 画像管理装置、ならびに画像管理用コンピュータプログラムおよびそれを記録した記録媒体 |
US7457825B2 (en) | 2005-09-21 | 2008-11-25 | Microsoft Corporation | Generating search requests from multimodal queries |
CN101071431A (zh) * | 2007-01-31 | 2007-11-14 | 腾讯科技(深圳)有限公司 | 基于关键图形为搜索条件进行图象搜索的方法及系统 |
CN100578508C (zh) | 2008-01-14 | 2010-01-06 | 上海博康智能信息技术有限公司 | 交互式图像搜索系统和方法 |
US20090287626A1 (en) | 2008-05-14 | 2009-11-19 | Microsoft Corporation | Multi-modal query generation |
US9978365B2 (en) * | 2008-10-31 | 2018-05-22 | Nokia Technologies Oy | Method and system for providing a voice interface |
US20100281435A1 (en) | 2009-04-30 | 2010-11-04 | At&T Intellectual Property I, L.P. | System and method for multimodal interaction using robust gesture processing |
US9087059B2 (en) * | 2009-08-07 | 2015-07-21 | Google Inc. | User interface for presenting search results for multiple regions of a visual query |
US8788434B2 (en) * | 2010-10-28 | 2014-07-22 | Google Inc. | Search with joint image-audio queries |
JP5794036B2 (ja) * | 2011-08-22 | 2015-10-14 | セイコーエプソン株式会社 | 画像検索装置、画像検索方法、およびプログラム |
EP2783305A4 (en) | 2011-11-24 | 2015-08-12 | Microsoft Technology Licensing Llc | MULTIMODAL INTERACTIVE IMAGE SEARCH |
US9152376B2 (en) | 2011-12-01 | 2015-10-06 | At&T Intellectual Property I, L.P. | System and method for continuous multimodal speech and gesture interaction |
CN103246682A (zh) * | 2012-02-13 | 2013-08-14 | 联想(北京)有限公司 | 数据搜索方法和数据搜索装置 |
CN103020184B (zh) | 2012-11-29 | 2016-05-25 | 北京百度网讯科技有限公司 | 使用拍摄图像获取搜索结果的方法和系统 |
-
2014
- 2014-05-15 KR KR1020167020862A patent/KR101864240B1/ko active IP Right Grant
- 2014-05-15 WO PCT/CN2014/077566 patent/WO2015172359A1/zh active Application Filing
- 2014-05-15 CN CN201480003299.4A patent/CN104854539B/zh active Active
- 2014-05-15 JP JP2016550858A patent/JP6316447B2/ja active Active
- 2014-05-15 US US14/902,227 patent/US10311115B2/en active Active
- 2014-05-15 EP EP14892023.4A patent/EP3001333A4/en not_active Ceased
- 2014-05-15 BR BR112016017262-0A patent/BR112016017262B1/pt active IP Right Grant
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1952935A (zh) * | 2006-09-22 | 2007-04-25 | 南京搜拍信息技术有限公司 | 综合利用图像及文字信息的搜索系统及搜索方法 |
CN101930457A (zh) * | 2010-08-13 | 2010-12-29 | 百度在线网络技术(北京)有限公司 | 一种供用户进行快速选择对象及搜索的方法、设备和系统 |
CN102411627A (zh) * | 2010-12-16 | 2012-04-11 | 微软公司 | 包括面部图像的图像搜索 |
Non-Patent Citations (1)
Title |
---|
See also references of EP3001333A4 * |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US10788902B2 (en) | 2016-06-22 | 2020-09-29 | Sony Corporation | Information processing device and information processing method |
Also Published As
Publication number | Publication date |
---|---|
EP3001333A4 (en) | 2016-08-24 |
EP3001333A1 (en) | 2016-03-30 |
KR20160104054A (ko) | 2016-09-02 |
JP2017513090A (ja) | 2017-05-25 |
US10311115B2 (en) | 2019-06-04 |
BR112016017262B1 (pt) | 2022-09-27 |
KR101864240B1 (ko) | 2018-06-04 |
CN104854539B (zh) | 2018-08-14 |
CN104854539A (zh) | 2015-08-19 |
BR112016017262A2 (zh) | 2017-08-08 |
JP6316447B2 (ja) | 2018-04-25 |
US20160147882A1 (en) | 2016-05-26 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2015172359A1 (zh) | 一种对象搜索方法及装置 | |
US11120078B2 (en) | Method and device for video processing, electronic device, and storage medium | |
US10956793B1 (en) | Content tagging | |
US10810253B2 (en) | Information display method and device | |
KR102467236B1 (ko) | 실시간 추적 보상 이미지 효과 | |
JP6410930B2 (ja) | 拡張現実及び物体認識を用いた実世界物体でのコンテンツ項目の検索及び関連付けスキーム | |
US20210303855A1 (en) | Augmented reality item collections | |
US20170161382A1 (en) | System to correlate video data and contextual data | |
US11769500B2 (en) | Augmented reality-based translation of speech in association with travel | |
US11983461B2 (en) | Speech-based selection of augmented reality content for detected objects | |
EP4173256A1 (en) | Travel-based augmented reality content for images | |
US20210392097A1 (en) | Bidirectional bridge for web view | |
CN106250421A (zh) | 一种拍摄处理的方法及终端 | |
US11798550B2 (en) | Speech-based selection of augmented reality content | |
WO2021195404A1 (en) | Speech-based selection of augmented reality content for detected objects | |
WO2021252235A1 (en) | Software development kit engagement monitor | |
WO2016192284A1 (zh) | 一种用于获取地图中的候选地址信息的方法和装置 | |
WO2016082470A1 (zh) | 一种图片处理方法、装置及计算机存储介质 | |
US20200097568A1 (en) | Fashion by trend user interfaces | |
WO2018103544A1 (zh) | 一种在图像中展现业务对象数据的方法和装置 | |
CN106250510B (zh) | 搜索方法、装置和系统 | |
KR102335972B1 (ko) | 검색 추천 정보를 제공하기 위한 방법 및 장치 | |
CN114827744A (zh) | 弹幕处理方法和装置 | |
WO2015139204A1 (zh) | 图片管理方法及设备 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
WWE | Wipo information: entry into national phase |
Ref document number: 2014892023 Country of ref document: EP |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 14892023 Country of ref document: EP Kind code of ref document: A1 |
|
WWE | Wipo information: entry into national phase |
Ref document number: 14902227 Country of ref document: US |
|
ENP | Entry into the national phase |
Ref document number: 20167020862 Country of ref document: KR Kind code of ref document: A |
|
ENP | Entry into the national phase |
Ref document number: 2016550858 Country of ref document: JP Kind code of ref document: A |
|
REG | Reference to national code |
Ref country code: BR Ref legal event code: B01A Ref document number: 112016017262 Country of ref document: BR |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
ENP | Entry into the national phase |
Ref document number: 112016017262 Country of ref document: BR Kind code of ref document: A2 Effective date: 20160726 |