WO2020145667A3 - 네트워크를 이용한 단말에서의 오디오 음질 제어 방법 및 장치 - Google Patents

네트워크를 이용한 단말에서의 오디오 음질 제어 방법 및 장치 Download PDF

Info

Publication number
WO2020145667A3
WO2020145667A3 PCT/KR2020/000348 KR2020000348W WO2020145667A3 WO 2020145667 A3 WO2020145667 A3 WO 2020145667A3 KR 2020000348 W KR2020000348 W KR 2020000348W WO 2020145667 A3 WO2020145667 A3 WO 2020145667A3
Authority
WO
WIPO (PCT)
Prior art keywords
data
post
processing
video
terminal
Prior art date
Application number
PCT/KR2020/000348
Other languages
English (en)
French (fr)
Other versions
WO2020145667A2 (ko
Inventor
홍승범
이용훈
고성환
박영현
박의순
조준영
최윤구
문한길
황호철
Original Assignee
삼성전자 주식회사
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 삼성전자 주식회사 filed Critical 삼성전자 주식회사
Priority to US17/420,841 priority Critical patent/US20220095009A1/en
Publication of WO2020145667A2 publication Critical patent/WO2020145667A2/ko
Publication of WO2020145667A3 publication Critical patent/WO2020145667A3/ko

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/439Processing of audio elementary streams
    • H04N21/4394Processing of audio elementary streams involving operations for analysing the audio stream, e.g. detecting features or characteristics in audio streams
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/233Processing of audio elementary streams
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/30Semantic analysis
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/236Assembling of a multiplex stream, e.g. transport stream, by combining a video stream with other content or additional data, e.g. inserting a URL [Uniform Resource Locator] into a video stream, multiplexing software data into a video stream; Remultiplexing of multiplex streams; Insertion of stuffing bits into the multiplex stream, e.g. to obtain a constant bit-rate; Assembling of a packetised elementary stream
    • H04N21/2368Multiplexing of audio and video streams
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/27Server based end-user applications
    • H04N21/274Storing end-user multimedia data in response to end-user request, e.g. network recorder
    • H04N21/2743Video hosting of uploaded data from client
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/41Structure of client; Structure of client peripherals
    • H04N21/422Input-only peripherals, i.e. input devices connected to specially adapted client devices, e.g. global positioning system [GPS]
    • H04N21/42203Input-only peripherals, i.e. input devices connected to specially adapted client devices, e.g. global positioning system [GPS] sound input device, e.g. microphone
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/41Structure of client; Structure of client peripherals
    • H04N21/422Input-only peripherals, i.e. input devices connected to specially adapted client devices, e.g. global positioning system [GPS]
    • H04N21/4223Cameras
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/434Disassembling of a multiplex stream, e.g. demultiplexing audio and video streams, extraction of additional data from a video stream; Remultiplexing of multiplex streams; Extraction or processing of SI; Disassembling of packetised elementary stream
    • H04N21/4341Demultiplexing of audio and video streams
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/475End-user interface for inputting end-user data, e.g. personal identification number [PIN], preference data
    • H04N21/4756End-user interface for inputting end-user data, e.g. personal identification number [PIN], preference data for rating content, e.g. scoring a recommended movie
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/60Network structure or processes for video distribution between server and client or between remote clients; Control signalling between clients, server and network components; Transmission of management data between server and client, e.g. sending from server to client commands for recording incoming content stream; Communication details between server and client 
    • H04N21/65Transmission of management data between client and server
    • H04N21/658Transmission by the client directed to the server
    • H04N21/6582Data stored in the client, e.g. viewing habits, hardware capabilities, credit card number
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0316Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Theoretical Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Human Computer Interaction (AREA)
  • Studio Devices (AREA)
  • Telephonic Communication Services (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)

Abstract

본 발명은 네트워크 연결을 이용해 영상 정보를 이용하여 영상 정보에 따라 결정된 상황에 따른 최적의 오디오 후처리를 수행하는 방법 및 장치를 제공한다. 본 발명에 따른 단단말의 방법은, 오디오 데이터를 처리하고자 하는 동영상 데이터를 획득하는 단계, 상기 획득한 동영상 관련 데이터를 서버에 전송하는 단계, 상기 서버로부터 후처리가 수행된 오디오 데이터를 포함하는 데이터를 수신하는 단계, 및 상기 후처리가 수행된 오디오 데이터를 포함하는 데이터를 저장하는 단계를 포함하며 상기 후처리는 상기 동영상 관련 데이터에 포함된 이미지 데이터를 기반으로 수행되는 것을 특징으로 한다.
PCT/KR2020/000348 2019-01-09 2020-01-08 네트워크를 이용한 단말에서의 오디오 음질 제어 방법 및 장치 WO2020145667A2 (ko)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US17/420,841 US20220095009A1 (en) 2019-01-09 2020-01-08 Method and apparatus for controlling audio sound quality in terminal using network

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
KR10-2019-0002951 2019-01-09
KR1020190002951A KR20200086569A (ko) 2019-01-09 2019-01-09 네트워크를 이용한 단말에서의 오디오 음질 제어 방법 및 장치

Publications (2)

Publication Number Publication Date
WO2020145667A2 WO2020145667A2 (ko) 2020-07-16
WO2020145667A3 true WO2020145667A3 (ko) 2020-09-17

Family

ID=71522308

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/KR2020/000348 WO2020145667A2 (ko) 2019-01-09 2020-01-08 네트워크를 이용한 단말에서의 오디오 음질 제어 방법 및 장치

Country Status (3)

Country Link
US (1) US20220095009A1 (ko)
KR (1) KR20200086569A (ko)
WO (1) WO2020145667A2 (ko)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112532856B (zh) * 2019-09-17 2023-10-17 中兴通讯股份有限公司 一种拍摄方法、装置和系统

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20040075447A (ko) * 2003-02-21 2004-08-30 대한민국(전남대학교총장) 이동통신 기반의 음성인식 시스템 및 방법
US20140168344A1 (en) * 2012-12-14 2014-06-19 Biscotti Inc. Video Mail Capture, Processing and Distribution
KR20140126556A (ko) * 2013-04-23 2014-10-31 주식회사 엘지유플러스 감성 기반 멀티미디어 재생을 위한 장치, 서버, 단말, 방법, 및 기록 매체
KR20170013960A (ko) * 2015-07-06 2017-02-07 한국과학기술원 이미지 기반 동영상 콘텐츠 제공 방법 및 그 시스템
KR20180110971A (ko) * 2017-03-30 2018-10-11 엘지전자 주식회사 홈 어플라이언스, 및 음성 인식 모듈

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7801910B2 (en) * 2005-11-09 2010-09-21 Ramp Holdings, Inc. Method and apparatus for timed tagging of media content
US8774598B2 (en) * 2011-03-29 2014-07-08 Sony Corporation Method, apparatus and system for generating media content
US8531602B1 (en) * 2011-10-19 2013-09-10 Google Inc. Audio enhancements for media
US9819429B2 (en) * 2014-10-21 2017-11-14 Qualcomm Innovation Center, Inc. Efficient load sharing and accelerating of audio post-processing
CN111201567A (zh) * 2017-08-10 2020-05-26 费赛特实验室有限责任公司 用于与数字媒体内容交互的口语、面部和姿势通信设备和计算体系架构

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20040075447A (ko) * 2003-02-21 2004-08-30 대한민국(전남대학교총장) 이동통신 기반의 음성인식 시스템 및 방법
US20140168344A1 (en) * 2012-12-14 2014-06-19 Biscotti Inc. Video Mail Capture, Processing and Distribution
KR20140126556A (ko) * 2013-04-23 2014-10-31 주식회사 엘지유플러스 감성 기반 멀티미디어 재생을 위한 장치, 서버, 단말, 방법, 및 기록 매체
KR20170013960A (ko) * 2015-07-06 2017-02-07 한국과학기술원 이미지 기반 동영상 콘텐츠 제공 방법 및 그 시스템
KR20180110971A (ko) * 2017-03-30 2018-10-11 엘지전자 주식회사 홈 어플라이언스, 및 음성 인식 모듈

Also Published As

Publication number Publication date
WO2020145667A2 (ko) 2020-07-16
KR20200086569A (ko) 2020-07-17
US20220095009A1 (en) 2022-03-24

Similar Documents

Publication Publication Date Title
EP2146340A1 (en) A system and method for controlling an image collecting device to carry out a target location
JP4431836B2 (ja) 音声取得装置、雑音除去システム、及び、プログラム
US9876944B2 (en) Apparatus, systems and methods for user controlled synchronization of presented video and audio streams
CN105763832B (zh) 一种视频互动、控制方法及装置
US11710488B2 (en) Transcription of communications using multiple speech recognition systems
EP3188180B1 (en) Enhancing an audio recording
KR20220077132A (ko) 시청각 콘텐츠용 바이노럴 몰입형 오디오 생성 방법 및 시스템
US10283114B2 (en) Sound conditioning
CN109361527B (zh) 语音会议记录方法及系统
EP3860133A4 (en) AUDIO AND VIDEO QUALITY IMPROVEMENT METHOD AND SYSTEM WITH SCENE DETECTION AND DISPLAY DEVICE
CN105407361A (zh) 一种音视频直播数据的处理方法和装置
CN1714554A (zh) 视听媒体编码系统
JP2005515706A5 (ko)
WO2020145667A3 (ko) 네트워크를 이용한 단말에서의 오디오 음질 제어 방법 및 장치
EP4093022A4 (en) METHOD, APPARATUS AND SYSTEM FOR MANAGING IMAGE CAPTURED BY A DRONE
JP5450279B2 (ja) 映像品質客観評価装置及び方法及びプログラム
EP3796647A4 (en) VIDEOCONFERENCE SERVER FOR CONDUCTING A VIDEOCONFERENCE BY MEANS OF A PLURALITY OF VIDEOCONFERENCE TERMINALS, AND ASSOCIATED AUDIO ECHO CANCELLATION METHOD
EP3923271A3 (en) Voice control method, vehicle, server and storage medium
MX2021010049A (es) Metodo y aparato para hacer una conexion de video, y medio de almacenamiento no transitorio legible por computadora.
CN108401209A (zh) 实现语音播报校正的方法及装置、可读存储介质
US20190019522A1 (en) Method and apparatus for multilingual film and audio dubbing
CN114928806A (zh) 麦克风系统的音频实时监控替换方法及装置、设备
US9521365B2 (en) Image-based techniques for audio content
TWI548278B (zh) 音視訊同步控制設備及方法
JP2021107873A5 (ko)

Legal Events

Date Code Title Description
NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 20738565

Country of ref document: EP

Kind code of ref document: A2