WO2017164510A3 - Procédé de marquage de contenu multimédia basé sur des données vocales, et système l'utilisant - Google Patents

Procédé de marquage de contenu multimédia basé sur des données vocales, et système l'utilisant Download PDF

Info

Publication number
WO2017164510A3
WO2017164510A3 PCT/KR2017/001103 KR2017001103W WO2017164510A3 WO 2017164510 A3 WO2017164510 A3 WO 2017164510A3 KR 2017001103 W KR2017001103 W KR 2017001103W WO 2017164510 A3 WO2017164510 A3 WO 2017164510A3
Authority
WO
WIPO (PCT)
Prior art keywords
multimedia content
voice
voice data
tag
search
Prior art date
Application number
PCT/KR2017/001103
Other languages
English (en)
Korean (ko)
Other versions
WO2017164510A2 (fr
Inventor
김준모
Original Assignee
김준모
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 김준모 filed Critical 김준모
Publication of WO2017164510A2 publication Critical patent/WO2017164510A2/fr
Publication of WO2017164510A3 publication Critical patent/WO2017164510A3/fr

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/40Information retrieval; Database structures therefor; File system structures therefor of multimedia data, e.g. slideshows comprising image and additional audio data
    • G06F16/48Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/40Information retrieval; Database structures therefor; File system structures therefor of multimedia data, e.g. slideshows comprising image and additional audio data
    • G06F16/43Querying
    • G06F16/432Query formulation
    • G06F16/433Query formulation using audio data
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/02Feature extraction for speech recognition; Selection of recognition unit
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Theoretical Computer Science (AREA)
  • Human Computer Interaction (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Acoustics & Sound (AREA)
  • Health & Medical Sciences (AREA)
  • Computational Linguistics (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Mathematical Physics (AREA)
  • Library & Information Science (AREA)
  • Telephonic Communication Services (AREA)

Abstract

L'invention concerne : un procédé de marquage de contenu multimédia basé sur des données vocales pour générer une étiquette vocale sur la base de données vocales d'un contenu multimédia et marquer l'étiquette vocale générée par rapport au contenu multimédia ; et un système utilisant ce procédé. Le procédé de marquage de contenu multimédia basé sur des données vocales comprend les étapes consistant à : permettre à un serveur de générer une étiquette vocale sur la base d'informations de mot-clé vocal extraites ; et permettre au serveur d'étiqueter l'étiquette vocale générée par rapport à un contenu multimédia. Par conséquent, un service de recherche permettant à un utilisateur d'un terminal mobile de rechercher un contenu multimédia souhaité peut être fourni à l'utilisateur. De plus, dans une recherche liée à un mot de recherche spécifique, un résultat de recherche fiable peut être acquis via la recherche d'étiquettes vocales associées au mot de recherche spécifique parmi des étiquettes vocales générées sur la base de données vocales.
PCT/KR2017/001103 2016-03-25 2017-02-02 Procédé de marquage de contenu multimédia basé sur des données vocales, et système l'utilisant WO2017164510A2 (fr)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
KR1020160036059A KR101832050B1 (ko) 2016-03-25 2016-03-25 음성 데이터 기반 멀티미디어 콘텐츠 태깅 방법 및 이를 이용한 시스템
KR10-2016-0036059 2016-03-25

Publications (2)

Publication Number Publication Date
WO2017164510A2 WO2017164510A2 (fr) 2017-09-28
WO2017164510A3 true WO2017164510A3 (fr) 2018-08-02

Family

ID=59900594

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/KR2017/001103 WO2017164510A2 (fr) 2016-03-25 2017-02-02 Procédé de marquage de contenu multimédia basé sur des données vocales, et système l'utilisant

Country Status (2)

Country Link
KR (1) KR101832050B1 (fr)
WO (1) WO2017164510A2 (fr)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR102523135B1 (ko) * 2018-01-09 2023-04-21 삼성전자주식회사 전자 장치 및 전자 장치에 의한 자막 표현 방법
CN109215657A (zh) * 2018-11-23 2019-01-15 四川工大创兴大数据有限公司 一种粮库监测用语音机器人及其应用
KR20220138512A (ko) 2021-04-05 2022-10-13 이피엘코딩 주식회사 모바일 기기에서의 음성 태깅을 이용한 영상 학습 및 인식 방법
WO2023233421A1 (fr) * 2022-05-31 2023-12-07 Humanify Technologies Pvt Ltd Système et procédé de balisage de contenu multimédia

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2007156286A (ja) * 2005-12-08 2007-06-21 Hitachi Ltd 情報認識装置及び情報認識プログラム
KR20090062371A (ko) * 2007-12-13 2009-06-17 주식회사 그래텍 부가 정보 제공 시스템 및 방법
KR20130060226A (ko) * 2010-04-30 2013-06-07 나우 테크놀로지스 (아이피) 리미티드 콘텐츠 관리 장치
KR20130141094A (ko) * 2012-06-15 2013-12-26 휴텍 주식회사 음성태그를 이용한 웹 컨텐츠 검색관리 방법, 그리고 이를 위한 웹 컨텐츠 검색관리 프로그램을 기록한 컴퓨터로 판독가능한 기록매체
KR101356006B1 (ko) * 2012-02-06 2014-02-12 한국과학기술원 구간설정이 가능한 음성기반 멀티미디어 컨텐츠 태깅 방법 및 장치

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2007156286A (ja) * 2005-12-08 2007-06-21 Hitachi Ltd 情報認識装置及び情報認識プログラム
KR20090062371A (ko) * 2007-12-13 2009-06-17 주식회사 그래텍 부가 정보 제공 시스템 및 방법
KR20130060226A (ko) * 2010-04-30 2013-06-07 나우 테크놀로지스 (아이피) 리미티드 콘텐츠 관리 장치
KR101356006B1 (ko) * 2012-02-06 2014-02-12 한국과학기술원 구간설정이 가능한 음성기반 멀티미디어 컨텐츠 태깅 방법 및 장치
KR20130141094A (ko) * 2012-06-15 2013-12-26 휴텍 주식회사 음성태그를 이용한 웹 컨텐츠 검색관리 방법, 그리고 이를 위한 웹 컨텐츠 검색관리 프로그램을 기록한 컴퓨터로 판독가능한 기록매체

Also Published As

Publication number Publication date
KR101832050B1 (ko) 2018-02-23
KR20170111161A (ko) 2017-10-12
WO2017164510A2 (fr) 2017-09-28

Similar Documents

Publication Publication Date Title
CN104268166B (zh) 一种输入方法、装置和电子设备
US10332506B2 (en) Computerized system and method for formatted transcription of multimedia content
US10366327B2 (en) Generating vector representations of documents
WO2017164510A3 (fr) Procédé de marquage de contenu multimédia basé sur des données vocales, et système l'utilisant
US11966432B2 (en) Media consumption context for personalized instant query suggest
WO2017019732A8 (fr) Systèmes et procédés de suivi de données à l'aide d'étiquettes de données fournies par un utilisateur
MX2019003096A (es) Presentacion de fotogramas clave de video en redes sociales en linea.
MX2015007303A (es) Metodo y dispositivo para el establecimiento de una libreria de etiquetas, metodo y dispositivo para la busqueda de un usuario.
KR102257910B1 (ko) 음성 인식 장치 및 방법, 잡음-음성 인식 모델 생성 장치 및 방법
WO2006083662A3 (fr) Systeme et procede de generation d'informations de produits de voyage sur un affichage interactif avec des categories de voisinage
WO2013192218A3 (fr) Modèle de langage dynamique
GB2523496A (en) Systems and methods for computer assisted dispatch, incident report-based video search and tagging
CN104078044A (zh) 移动终端及其录音搜索的方法和装置
AU2017408800A1 (en) Method and system of mining information, electronic device and readable storable medium
KR20190113712A (ko) 환경 콘텍스트를 이용한 질문 답변
US20170115853A1 (en) Determining Image Captions
SG11202000081XA (en) Image retrieval methods and apparatuses, devices, and readable storage media
CN107943914A (zh) 语音信息处理方法和装置
CN104599692A (zh) 录音方法及装置,录音内容搜索方法及装置
PH12018550213A1 (en) System and method for learning-based group tagging
US9477664B2 (en) Method and apparatus for querying media based on media characteristics
BG111708A (bg) Метод и система за търсене и създаване на адаптирано съдържание
CN104216896A (zh) 一种查找联系人信息的方法及装置
CN103971679B (zh) 一种联系人语音搜索方法、装置及移动终端
KR102536944B1 (ko) 음성 신호 처리 방법 및 장치

Legal Events

Date Code Title Description
NENP Non-entry into the national phase

Ref country code: DE

121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 17770481

Country of ref document: EP

Kind code of ref document: A2

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 21.01.2019)

122 Ep: pct application non-entry in european phase

Ref document number: 17770481

Country of ref document: EP

Kind code of ref document: A2