WO2017164510A3 - Procédé de marquage de contenu multimédia basé sur des données vocales, et système l'utilisant - Google Patents
Procédé de marquage de contenu multimédia basé sur des données vocales, et système l'utilisant Download PDFInfo
- Publication number
- WO2017164510A3 WO2017164510A3 PCT/KR2017/001103 KR2017001103W WO2017164510A3 WO 2017164510 A3 WO2017164510 A3 WO 2017164510A3 KR 2017001103 W KR2017001103 W KR 2017001103W WO 2017164510 A3 WO2017164510 A3 WO 2017164510A3
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- multimedia content
- voice
- voice data
- tag
- search
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/40—Information retrieval; Database structures therefor; File system structures therefor of multimedia data, e.g. slideshows comprising image and additional audio data
- G06F16/48—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/40—Information retrieval; Database structures therefor; File system structures therefor of multimedia data, e.g. slideshows comprising image and additional audio data
- G06F16/43—Querying
- G06F16/432—Query formulation
- G06F16/433—Query formulation using audio data
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Multimedia (AREA)
- Theoretical Computer Science (AREA)
- Human Computer Interaction (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Acoustics & Sound (AREA)
- Health & Medical Sciences (AREA)
- Computational Linguistics (AREA)
- General Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- Databases & Information Systems (AREA)
- Data Mining & Analysis (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Mathematical Physics (AREA)
- Library & Information Science (AREA)
- Telephonic Communication Services (AREA)
Abstract
L'invention concerne : un procédé de marquage de contenu multimédia basé sur des données vocales pour générer une étiquette vocale sur la base de données vocales d'un contenu multimédia et marquer l'étiquette vocale générée par rapport au contenu multimédia ; et un système utilisant ce procédé. Le procédé de marquage de contenu multimédia basé sur des données vocales comprend les étapes consistant à : permettre à un serveur de générer une étiquette vocale sur la base d'informations de mot-clé vocal extraites ; et permettre au serveur d'étiqueter l'étiquette vocale générée par rapport à un contenu multimédia. Par conséquent, un service de recherche permettant à un utilisateur d'un terminal mobile de rechercher un contenu multimédia souhaité peut être fourni à l'utilisateur. De plus, dans une recherche liée à un mot de recherche spécifique, un résultat de recherche fiable peut être acquis via la recherche d'étiquettes vocales associées au mot de recherche spécifique parmi des étiquettes vocales générées sur la base de données vocales.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR1020160036059A KR101832050B1 (ko) | 2016-03-25 | 2016-03-25 | 음성 데이터 기반 멀티미디어 콘텐츠 태깅 방법 및 이를 이용한 시스템 |
KR10-2016-0036059 | 2016-03-25 |
Publications (2)
Publication Number | Publication Date |
---|---|
WO2017164510A2 WO2017164510A2 (fr) | 2017-09-28 |
WO2017164510A3 true WO2017164510A3 (fr) | 2018-08-02 |
Family
ID=59900594
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/KR2017/001103 WO2017164510A2 (fr) | 2016-03-25 | 2017-02-02 | Procédé de marquage de contenu multimédia basé sur des données vocales, et système l'utilisant |
Country Status (2)
Country | Link |
---|---|
KR (1) | KR101832050B1 (fr) |
WO (1) | WO2017164510A2 (fr) |
Families Citing this family (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR102523135B1 (ko) * | 2018-01-09 | 2023-04-21 | 삼성전자주식회사 | 전자 장치 및 전자 장치에 의한 자막 표현 방법 |
CN109215657A (zh) * | 2018-11-23 | 2019-01-15 | 四川工大创兴大数据有限公司 | 一种粮库监测用语音机器人及其应用 |
KR20220138512A (ko) | 2021-04-05 | 2022-10-13 | 이피엘코딩 주식회사 | 모바일 기기에서의 음성 태깅을 이용한 영상 학습 및 인식 방법 |
WO2023233421A1 (fr) * | 2022-05-31 | 2023-12-07 | Humanify Technologies Pvt Ltd | Système et procédé de balisage de contenu multimédia |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2007156286A (ja) * | 2005-12-08 | 2007-06-21 | Hitachi Ltd | 情報認識装置及び情報認識プログラム |
KR20090062371A (ko) * | 2007-12-13 | 2009-06-17 | 주식회사 그래텍 | 부가 정보 제공 시스템 및 방법 |
KR20130060226A (ko) * | 2010-04-30 | 2013-06-07 | 나우 테크놀로지스 (아이피) 리미티드 | 콘텐츠 관리 장치 |
KR20130141094A (ko) * | 2012-06-15 | 2013-12-26 | 휴텍 주식회사 | 음성태그를 이용한 웹 컨텐츠 검색관리 방법, 그리고 이를 위한 웹 컨텐츠 검색관리 프로그램을 기록한 컴퓨터로 판독가능한 기록매체 |
KR101356006B1 (ko) * | 2012-02-06 | 2014-02-12 | 한국과학기술원 | 구간설정이 가능한 음성기반 멀티미디어 컨텐츠 태깅 방법 및 장치 |
-
2016
- 2016-03-25 KR KR1020160036059A patent/KR101832050B1/ko active IP Right Grant
-
2017
- 2017-02-02 WO PCT/KR2017/001103 patent/WO2017164510A2/fr active Application Filing
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2007156286A (ja) * | 2005-12-08 | 2007-06-21 | Hitachi Ltd | 情報認識装置及び情報認識プログラム |
KR20090062371A (ko) * | 2007-12-13 | 2009-06-17 | 주식회사 그래텍 | 부가 정보 제공 시스템 및 방법 |
KR20130060226A (ko) * | 2010-04-30 | 2013-06-07 | 나우 테크놀로지스 (아이피) 리미티드 | 콘텐츠 관리 장치 |
KR101356006B1 (ko) * | 2012-02-06 | 2014-02-12 | 한국과학기술원 | 구간설정이 가능한 음성기반 멀티미디어 컨텐츠 태깅 방법 및 장치 |
KR20130141094A (ko) * | 2012-06-15 | 2013-12-26 | 휴텍 주식회사 | 음성태그를 이용한 웹 컨텐츠 검색관리 방법, 그리고 이를 위한 웹 컨텐츠 검색관리 프로그램을 기록한 컴퓨터로 판독가능한 기록매체 |
Also Published As
Publication number | Publication date |
---|---|
KR101832050B1 (ko) | 2018-02-23 |
KR20170111161A (ko) | 2017-10-12 |
WO2017164510A2 (fr) | 2017-09-28 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN104268166B (zh) | 一种输入方法、装置和电子设备 | |
US10332506B2 (en) | Computerized system and method for formatted transcription of multimedia content | |
US10366327B2 (en) | Generating vector representations of documents | |
WO2017164510A3 (fr) | Procédé de marquage de contenu multimédia basé sur des données vocales, et système l'utilisant | |
US11966432B2 (en) | Media consumption context for personalized instant query suggest | |
WO2017019732A8 (fr) | Systèmes et procédés de suivi de données à l'aide d'étiquettes de données fournies par un utilisateur | |
MX2019003096A (es) | Presentacion de fotogramas clave de video en redes sociales en linea. | |
MX2015007303A (es) | Metodo y dispositivo para el establecimiento de una libreria de etiquetas, metodo y dispositivo para la busqueda de un usuario. | |
KR102257910B1 (ko) | 음성 인식 장치 및 방법, 잡음-음성 인식 모델 생성 장치 및 방법 | |
WO2006083662A3 (fr) | Systeme et procede de generation d'informations de produits de voyage sur un affichage interactif avec des categories de voisinage | |
WO2013192218A3 (fr) | Modèle de langage dynamique | |
GB2523496A (en) | Systems and methods for computer assisted dispatch, incident report-based video search and tagging | |
CN104078044A (zh) | 移动终端及其录音搜索的方法和装置 | |
AU2017408800A1 (en) | Method and system of mining information, electronic device and readable storable medium | |
KR20190113712A (ko) | 환경 콘텍스트를 이용한 질문 답변 | |
US20170115853A1 (en) | Determining Image Captions | |
SG11202000081XA (en) | Image retrieval methods and apparatuses, devices, and readable storage media | |
CN107943914A (zh) | 语音信息处理方法和装置 | |
CN104599692A (zh) | 录音方法及装置,录音内容搜索方法及装置 | |
PH12018550213A1 (en) | System and method for learning-based group tagging | |
US9477664B2 (en) | Method and apparatus for querying media based on media characteristics | |
BG111708A (bg) | Метод и система за търсене и създаване на адаптирано съдържание | |
CN104216896A (zh) | 一种查找联系人信息的方法及装置 | |
CN103971679B (zh) | 一种联系人语音搜索方法、装置及移动终端 | |
KR102536944B1 (ko) | 음성 신호 처리 방법 및 장치 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 17770481 Country of ref document: EP Kind code of ref document: A2 |
|
32PN | Ep: public notification in the ep bulletin as address of the adressee cannot be established |
Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 21.01.2019) |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 17770481 Country of ref document: EP Kind code of ref document: A2 |