KR102446300B1 - 음성 기록을 위한 음성 인식률을 향상시키는 방법, 시스템, 및 컴퓨터 판독가능한 기록 매체 - Google Patents

음성 기록을 위한 음성 인식률을 향상시키는 방법, 시스템, 및 컴퓨터 판독가능한 기록 매체 Download PDF

Info

Publication number
KR102446300B1
KR102446300B1 KR1020200137324A KR20200137324A KR102446300B1 KR 102446300 B1 KR102446300 B1 KR 102446300B1 KR 1020200137324 A KR1020200137324 A KR 1020200137324A KR 20200137324 A KR20200137324 A KR 20200137324A KR 102446300 B1 KR102446300 B1 KR 102446300B1
Authority
KR
South Korea
Prior art keywords
voice
processor
voice record
computer device
extracting
Prior art date
Application number
KR1020200137324A
Other languages
English (en)
Korean (ko)
Other versions
KR20220053182A (ko
Inventor
이수미
신지은
정예림
황길환
장정훈
정남규
임대현
Original Assignee
네이버 주식회사
라인 가부시키가이샤
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 네이버 주식회사, 라인 가부시키가이샤 filed Critical 네이버 주식회사
Priority to KR1020200137324A priority Critical patent/KR102446300B1/ko
Priority to JP2021014195A priority patent/JP7166370B2/ja
Priority to TW110135178A priority patent/TWI807428B/zh
Priority to US17/448,616 priority patent/US20220093103A1/en
Publication of KR20220053182A publication Critical patent/KR20220053182A/ko
Application granted granted Critical
Publication of KR102446300B1 publication Critical patent/KR102446300B1/ko

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification techniques
    • G10L17/22Interactive procedures; Man-machine interfaces
    • G10L17/24Interactive procedures; Man-machine interfaces the user being prompted to utter a password or a predefined phrase
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/225Feedback of the input speech

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Computational Linguistics (AREA)
  • Quality & Reliability (AREA)
  • Signal Processing (AREA)
  • User Interface Of Digital Computer (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Signal Processing For Digital Recording And Reproducing (AREA)
KR1020200137324A 2020-09-23 2020-10-22 음성 기록을 위한 음성 인식률을 향상시키는 방법, 시스템, 및 컴퓨터 판독가능한 기록 매체 KR102446300B1 (ko)

Priority Applications (4)

Application Number Priority Date Filing Date Title
KR1020200137324A KR102446300B1 (ko) 2020-10-22 2020-10-22 음성 기록을 위한 음성 인식률을 향상시키는 방법, 시스템, 및 컴퓨터 판독가능한 기록 매체
JP2021014195A JP7166370B2 (ja) 2020-10-22 2021-02-01 音声記録のための音声認識率を向上させる方法、システム、およびコンピュータ読み取り可能な記録媒体
TW110135178A TWI807428B (zh) 2020-09-23 2021-09-22 一同管理與語音檔有關的文本轉換記錄和備忘錄的方法、系統及電腦可讀記錄介質
US17/448,616 US20220093103A1 (en) 2020-09-23 2021-09-23 Method, system, and computer-readable recording medium for managing text transcript and memo for audio file

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
KR1020200137324A KR102446300B1 (ko) 2020-10-22 2020-10-22 음성 기록을 위한 음성 인식률을 향상시키는 방법, 시스템, 및 컴퓨터 판독가능한 기록 매체

Publications (2)

Publication Number Publication Date
KR20220053182A KR20220053182A (ko) 2022-04-29
KR102446300B1 true KR102446300B1 (ko) 2022-09-22

Family

ID=81428729

Family Applications (1)

Application Number Title Priority Date Filing Date
KR1020200137324A KR102446300B1 (ko) 2020-09-23 2020-10-22 음성 기록을 위한 음성 인식률을 향상시키는 방법, 시스템, 및 컴퓨터 판독가능한 기록 매체

Country Status (2)

Country Link
JP (1) JP7166370B2 (ja)
KR (1) KR102446300B1 (ja)

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2006178087A (ja) * 2004-12-21 2006-07-06 Internatl Business Mach Corp <Ibm> 字幕生成装置、検索装置、文書処理と音声処理とを融合する方法、及びプログラム

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP4764203B2 (ja) 2006-02-27 2011-08-31 日本放送協会 音声認識装置及び音声認識プログラム
JP5054711B2 (ja) 2009-01-29 2012-10-24 日本放送協会 音声認識装置および音声認識プログラム
JP5362651B2 (ja) 2010-06-07 2013-12-11 日本電信電話株式会社 重要語句抽出装置及び方法及びプログラム
JP7052328B2 (ja) 2017-12-13 2022-04-12 大日本印刷株式会社 表示制御装置、プログラム、表示システム及び表示制御方法
US20200403818A1 (en) 2019-06-24 2020-12-24 Dropbox, Inc. Generating improved digital transcripts utilizing digital transcription models that analyze dynamic meeting contexts

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2006178087A (ja) * 2004-12-21 2006-07-06 Internatl Business Mach Corp <Ibm> 字幕生成装置、検索装置、文書処理と音声処理とを融合する方法、及びプログラム

Also Published As

Publication number Publication date
JP7166370B2 (ja) 2022-11-07
JP2022068817A (ja) 2022-05-10
KR20220053182A (ko) 2022-04-29

Similar Documents

Publication Publication Date Title
US11398236B2 (en) Intent-specific automatic speech recognition result generation
CN108228132B (zh) 语音启用装置及其中执行的方法
Schalkwyk et al. “Your word is my command”: Google search by voice: A case study
US11262970B2 (en) Platform for producing and delivering media content
US8725492B2 (en) Recognizing multiple semantic items from single utterance
US11527233B2 (en) Method, apparatus, device and computer storage medium for generating speech packet
US20120016671A1 (en) Tool and method for enhanced human machine collaboration for rapid and accurate transcriptions
JP6280312B2 (ja) 議事録記録装置、議事録記録方法及びプログラム
US9922650B1 (en) Intent-specific automatic speech recognition result generation
TWI807428B (zh) 一同管理與語音檔有關的文本轉換記錄和備忘錄的方法、系統及電腦可讀記錄介質
CN115082602B (zh) 生成数字人的方法、模型的训练方法、装置、设备和介质
KR20060100646A (ko) 영상물의 특정 위치를 검색하는 방법 및 영상 검색 시스템
KR102446300B1 (ko) 음성 기록을 위한 음성 인식률을 향상시키는 방법, 시스템, 및 컴퓨터 판독가능한 기록 매체
KR102437752B1 (ko) 인공지능 디바이스와 연동하여 음성 기록을 관리하는 방법, 시스템, 및 컴퓨터 판독가능한 기록 매체
KR102530669B1 (ko) 앱과 웹의 연동을 통해 음성 파일에 대한 메모를 작성하는 방법, 시스템, 및 컴퓨터 판독가능한 기록 매체
US20060149545A1 (en) Method and apparatus of speech template selection for speech recognition
CN108255917A (zh) 图像管理方法、设备及电子设备
KR102503586B1 (ko) 음성을 텍스트로 변환한 음성 기록에서 유사 발음의 단어를 포함하여 검색하는 방법, 시스템, 및 컴퓨터 판독가능한 기록 매체
KR102427213B1 (ko) 음성 파일에 대한 텍스트 변환 기록과 메모를 함께 관리하는 방법, 시스템, 및 컴퓨터 판독가능한 기록 매체
JP7128222B2 (ja) 映像コンテンツに対する合成音のリアルタイム生成を基盤としたコンテンツ編集支援方法およびシステム
KR102353797B1 (ko) 영상 컨텐츠에 대한 합성음 실시간 생성에 기반한 컨텐츠 편집 지원 방법 및 시스템

Legal Events

Date Code Title Description
E701 Decision to grant or registration of patent right