US20170270965A1 - Method and device for accelerated playback, transmission and storage of media files - Google Patents

Method and device for accelerated playback, transmission and storage of media files Download PDF

Info

Publication number
US20170270965A1
US20170270965A1 US15/459,518 US201715459518A US2017270965A1 US 20170270965 A1 US20170270965 A1 US 20170270965A1 US 201715459518 A US201715459518 A US 201715459518A US 2017270965 A1 US2017270965 A1 US 2017270965A1
Authority
US
United States
Prior art keywords
content
media file
audio
key
text
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
US15/459,518
Other languages
English (en)
Inventor
Fei BAO
Xianliang WANG
Xuan Zhu
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Samsung Electronics Co Ltd
Original Assignee
Samsung Electronics Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Samsung Electronics Co Ltd filed Critical Samsung Electronics Co Ltd
Assigned to SAMSUNG ELECTRONICS CO., LTD. reassignment SAMSUNG ELECTRONICS CO., LTD. ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: BAO, Fei, WANG, Xianliang, ZHU, Xuan
Publication of US20170270965A1 publication Critical patent/US20170270965A1/en
Abandoned legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B27/00Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
    • G11B27/005Reproducing at a different information rate from the information rate of recording
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/34Browsing; Visualisation therefor
    • G06F16/345Summarisation for human users
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/35Clustering; Classification
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/70Information retrieval; Database structures therefor; File system structures therefor of video data
    • G06F16/74Browsing; Visualisation therefor
    • G06F16/745Browsing; Visualisation therefor the internal structure of a single video sequence
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/70Information retrieval; Database structures therefor; File system structures therefor of video data
    • G06F16/78Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/783Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
    • G06F16/7834Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content using audio features
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/30Semantic analysis
    • G06K9/00744
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/40Scenes; Scene-specific elements in video content
    • G06V20/46Extracting features or characteristics from the video content, e.g. video fingerprints, representative shots or key frames
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • G10L15/063Training
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B27/00Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
    • G11B27/10Indexing; Addressing; Timing or synchronising; Measuring tape travel
    • G11B27/19Indexing; Addressing; Timing or synchronising; Measuring tape travel by using information detectable on the record carrier
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B27/00Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
    • G11B27/10Indexing; Addressing; Timing or synchronising; Measuring tape travel
    • G11B27/34Indicating arrangements 
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • G10L25/51Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
    • G10L25/57Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination for processing of video signals

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • General Engineering & Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Library & Information Science (AREA)
  • Computational Linguistics (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Human Computer Interaction (AREA)
  • General Health & Medical Sciences (AREA)
  • Acoustics & Sound (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
  • Signal Processing (AREA)
US15/459,518 2016-03-15 2017-03-15 Method and device for accelerated playback, transmission and storage of media files Abandoned US20170270965A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201610147563.2A CN107193841B (zh) 2016-03-15 2016-03-15 媒体文件加速播放、传输及存储的方法和装置
CN201610147563.2 2016-03-15

Publications (1)

Publication Number Publication Date
US20170270965A1 true US20170270965A1 (en) 2017-09-21

Family

ID=59851324

Family Applications (1)

Application Number Title Priority Date Filing Date
US15/459,518 Abandoned US20170270965A1 (en) 2016-03-15 2017-03-15 Method and device for accelerated playback, transmission and storage of media files

Country Status (4)

Country Link
US (1) US20170270965A1 (fr)
EP (1) EP3403415A4 (fr)
CN (1) CN107193841B (fr)
WO (1) WO2017160073A1 (fr)

Cited By (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20190073183A1 (en) * 2016-04-27 2019-03-07 Sony Corporation Information processing apparatus, information processing method, and program
CN111916053A (zh) * 2020-08-17 2020-11-10 北京字节跳动网络技术有限公司 语音生成方法、装置、设备和计算机可读介质
CN112398912A (zh) * 2020-10-26 2021-02-23 北京佳讯飞鸿电气股份有限公司 一种语音信号加速方法、装置、计算机设备及存储介质
WO2021119102A1 (fr) * 2019-12-09 2021-06-17 Dolby Laboratories Licensing Corporation Ajustement de caractéristiques audio et non audio sur la base de métriques de bruit et de métriques d'intelligibilité de paroles
US11232808B2 (en) * 2017-08-15 2022-01-25 Amazon Technologies, Inc. Adjusting speed of human speech playback
CN114257858A (zh) * 2022-03-02 2022-03-29 浙江宇视科技有限公司 一种基于情感计算的内容同步方法和装置
CN114564165A (zh) * 2022-02-23 2022-05-31 成都智元汇信息技术股份有限公司 基于公共交通的文本、音频自适应方法、显示终端、系统
CN114845089A (zh) * 2022-07-04 2022-08-02 浙江大华技术股份有限公司 视频画面的传输方法及装置
WO2022253053A1 (fr) * 2021-05-31 2022-12-08 华为技术有限公司 Procédé et appareil de lecture de vidéo
US11676385B1 (en) * 2022-04-07 2023-06-13 Lemon Inc. Processing method and apparatus, terminal device and medium
WO2023238650A1 (fr) * 2022-06-06 2023-12-14 ソニーグループ株式会社 Dispositif de conversion et procédé de conversion

Families Citing this family (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107846625B (zh) * 2017-10-30 2019-09-24 Oppo广东移动通信有限公司 视频画质调整方法、装置、终端设备及存储介质
CN107770626B (zh) * 2017-11-06 2020-03-17 腾讯科技(深圳)有限公司 视频素材的处理方法、视频合成方法、装置及存储介质
CN110771175A (zh) * 2018-05-30 2020-02-07 深圳市大疆创新科技有限公司 视频播放速度的控制方法、装置及运动相机
CN108882024B (zh) * 2018-08-01 2021-08-20 北京奇艺世纪科技有限公司 一种视频播放方法、装置及电子设备
CN109977239B (zh) * 2019-03-31 2023-08-18 联想(北京)有限公司 一种信息处理方法和电子设备
CN110113666A (zh) * 2019-05-10 2019-08-09 腾讯科技(深圳)有限公司 一种多媒体文件播放方法、装置、设备及存储介质
CN110177298B (zh) * 2019-05-27 2021-03-26 湖南快乐阳光互动娱乐传媒有限公司 一种基于语音的视频倍速播放方法及系统
CN110519619B (zh) * 2019-09-19 2022-03-25 湖南快乐阳光互动娱乐传媒有限公司 一种基于倍速播的变速播放方法及系统
CN111327958B (zh) * 2020-02-28 2022-03-25 北京百度网讯科技有限公司 视频播放方法、装置、电子设备及存储介质
CN111356010A (zh) * 2020-04-01 2020-06-30 上海依图信息技术有限公司 一种获取音频最适播放速度的方法与系统
CN112349299A (zh) * 2020-10-28 2021-02-09 维沃移动通信有限公司 语音播放方法、装置及电子设备
CN112423019B (zh) * 2020-11-17 2022-11-22 北京达佳互联信息技术有限公司 调整音频播放速度的方法、装置、电子设备及存储介质
CN113434231A (zh) * 2021-06-24 2021-09-24 维沃移动通信有限公司 文本信息播报方法和装置
CN114979798B (zh) * 2022-04-21 2024-03-22 维沃移动通信有限公司 播放速度控制方法和电子设备
CN115022705A (zh) * 2022-05-24 2022-09-06 咪咕文化科技有限公司 一种视频播放方法、装置及设备

Citations (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20020133339A1 (en) * 2001-03-13 2002-09-19 Gudorf Gregory D. Method and apparatus for automatic collection and summarization of meeting information
US20040152055A1 (en) * 2003-01-30 2004-08-05 Gliessner Michael J.G. Video based language learning system
US7136571B1 (en) * 2000-10-11 2006-11-14 Koninklijke Philips Electronics N.V. System and method for fast playback of video with selected audio
US20070112837A1 (en) * 2005-11-09 2007-05-17 Bbnt Solutions Llc Method and apparatus for timed tagging of media content
US20080250080A1 (en) * 2007-04-05 2008-10-09 Nokia Corporation Annotating the dramatic content of segments in media work
US20100106498A1 (en) * 2008-10-24 2010-04-29 At&T Intellectual Property I, L.P. System and method for targeted advertising
US20130266193A1 (en) * 2012-04-09 2013-10-10 Accenture Global Services Limited Biometric matching technology
US9087508B1 (en) * 2012-10-18 2015-07-21 Audible, Inc. Presenting representative content portions during content navigation
US20150222944A1 (en) * 1998-07-27 2015-08-06 Microsoft Technology Licensing, Llc Selection compression
US20170024614A1 (en) * 2015-03-16 2017-01-26 Rohan Sanil System for Automatically Editing Video
US9847096B2 (en) * 2014-02-20 2017-12-19 Harman International Industries, Incorporated Environment sensing intelligent apparatus

Family Cites Families (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5664227A (en) * 1994-10-14 1997-09-02 Carnegie Mellon University System and method for skimming digital audio/video data
IL144818A (en) * 2001-08-09 2006-08-20 Voicesense Ltd Method and apparatus for speech analysis
US6625387B1 (en) * 2002-03-01 2003-09-23 Thomson Licensing S.A. Gated silence removal during video trick modes
TWI270052B (en) * 2005-08-09 2007-01-01 Delta Electronics Inc System for selecting audio content by using speech recognition and method therefor
US7673238B2 (en) * 2006-01-05 2010-03-02 Apple Inc. Portable media device with video acceleration capabilities
US20080300872A1 (en) * 2007-05-31 2008-12-04 Microsoft Corporation Scalable summaries of audio or visual content
KR101349797B1 (ko) * 2007-06-26 2014-01-13 삼성전자주식회사 전자기기에서 음성 파일 재생 방법 및 장치
US9953651B2 (en) * 2008-07-28 2018-04-24 International Business Machines Corporation Speed podcasting
JP5168105B2 (ja) * 2008-11-26 2013-03-21 パナソニック株式会社 音声再生装置、及び音声再生方法
CN102143384B (zh) * 2010-12-31 2013-01-16 华为技术有限公司 一种媒体文件生成方法、装置及系统
US20120323897A1 (en) * 2011-06-14 2012-12-20 Microsoft Corporation Query-dependent audio/video clip search result previews
CN102271280A (zh) * 2011-07-20 2011-12-07 宝利微电子系统控股公司 一种数字音视频变速播放的方法和装置
JP5854208B2 (ja) * 2011-11-28 2016-02-09 日本電気株式会社 多段高速再生のための映像コンテンツ生成方法
CN102867042A (zh) * 2012-09-03 2013-01-09 北京奇虎科技有限公司 多媒体文件搜索方法及装置
CN103813215A (zh) * 2012-11-13 2014-05-21 联想(北京)有限公司 一种信息采集的方法及电子设备
US9569167B2 (en) * 2013-03-12 2017-02-14 Tivo Inc. Automatic rate control for improved audio time scaling
CN103686411A (zh) * 2013-12-11 2014-03-26 深圳Tcl新技术有限公司 视频的播放方法及多媒体设备
CN105205083A (zh) * 2014-06-27 2015-12-30 国际商业机器公司 用于利用进度条中的关键点来浏览内容的方法和设备

Patent Citations (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20150222944A1 (en) * 1998-07-27 2015-08-06 Microsoft Technology Licensing, Llc Selection compression
US7136571B1 (en) * 2000-10-11 2006-11-14 Koninklijke Philips Electronics N.V. System and method for fast playback of video with selected audio
US20020133339A1 (en) * 2001-03-13 2002-09-19 Gudorf Gregory D. Method and apparatus for automatic collection and summarization of meeting information
US20040152055A1 (en) * 2003-01-30 2004-08-05 Gliessner Michael J.G. Video based language learning system
US20070112837A1 (en) * 2005-11-09 2007-05-17 Bbnt Solutions Llc Method and apparatus for timed tagging of media content
US20080250080A1 (en) * 2007-04-05 2008-10-09 Nokia Corporation Annotating the dramatic content of segments in media work
US20100106498A1 (en) * 2008-10-24 2010-04-29 At&T Intellectual Property I, L.P. System and method for targeted advertising
US20130266193A1 (en) * 2012-04-09 2013-10-10 Accenture Global Services Limited Biometric matching technology
US9087508B1 (en) * 2012-10-18 2015-07-21 Audible, Inc. Presenting representative content portions during content navigation
US9847096B2 (en) * 2014-02-20 2017-12-19 Harman International Industries, Incorporated Environment sensing intelligent apparatus
US20170024614A1 (en) * 2015-03-16 2017-01-26 Rohan Sanil System for Automatically Editing Video

Cited By (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20190073183A1 (en) * 2016-04-27 2019-03-07 Sony Corporation Information processing apparatus, information processing method, and program
US11074034B2 (en) * 2016-04-27 2021-07-27 Sony Corporation Information processing apparatus, information processing method, and program
US11232808B2 (en) * 2017-08-15 2022-01-25 Amazon Technologies, Inc. Adjusting speed of human speech playback
WO2021119102A1 (fr) * 2019-12-09 2021-06-17 Dolby Laboratories Licensing Corporation Ajustement de caractéristiques audio et non audio sur la base de métriques de bruit et de métriques d'intelligibilité de paroles
WO2021119094A1 (fr) * 2019-12-09 2021-06-17 Dolby Laboratories Licensing Corporation Ajustement de caractéristiques audio et non audio sur la base de mesures de bruit et de mesures d'intelligibilité de paroles
CN114902688A (zh) * 2019-12-09 2022-08-12 杜比实验室特许公司 基于噪声指标和语音可懂度指标来调整音频和非音频特征
CN111916053A (zh) * 2020-08-17 2020-11-10 北京字节跳动网络技术有限公司 语音生成方法、装置、设备和计算机可读介质
CN112398912A (zh) * 2020-10-26 2021-02-23 北京佳讯飞鸿电气股份有限公司 一种语音信号加速方法、装置、计算机设备及存储介质
WO2022253053A1 (fr) * 2021-05-31 2022-12-08 华为技术有限公司 Procédé et appareil de lecture de vidéo
CN114564165A (zh) * 2022-02-23 2022-05-31 成都智元汇信息技术股份有限公司 基于公共交通的文本、音频自适应方法、显示终端、系统
CN114257858A (zh) * 2022-03-02 2022-03-29 浙江宇视科技有限公司 一种基于情感计算的内容同步方法和装置
US11676385B1 (en) * 2022-04-07 2023-06-13 Lemon Inc. Processing method and apparatus, terminal device and medium
WO2023238650A1 (fr) * 2022-06-06 2023-12-14 ソニーグループ株式会社 Dispositif de conversion et procédé de conversion
CN114845089A (zh) * 2022-07-04 2022-08-02 浙江大华技术股份有限公司 视频画面的传输方法及装置

Also Published As

Publication number Publication date
EP3403415A4 (fr) 2019-04-17
EP3403415A1 (fr) 2018-11-21
CN107193841A (zh) 2017-09-22
CN107193841B (zh) 2022-07-26
WO2017160073A1 (fr) 2017-09-21

Similar Documents

Publication Publication Date Title
US20170270965A1 (en) Method and device for accelerated playback, transmission and storage of media files
CN110446115B (zh) 直播互动方法、装置、电子设备及存储介质
US20210280185A1 (en) Interactive voice controlled entertainment
WO2022121601A1 (fr) Procédé et appareil d'interaction de diffusion en continu en direct, et dispositif et support
KR102038809B1 (ko) 미디어 검색 및 재생을 위한 지능형 자동화 어시스턴트
KR102277920B1 (ko) 미디어 환경에서 지능형 자동화 어시스턴트
CN108391149B (zh) 显示设备、控制显示设备的方法、服务器以及控制服务器的方法
US20220239882A1 (en) Interactive information processing method, device and medium
CN108847214B (zh) 语音处理方法、客户端、装置、终端、服务器和存储介质
US10652592B2 (en) Named entity disambiguation for providing TV content enrichment
CN112040263A (zh) 视频处理方法、视频播放方法、装置、存储介质和设备
CN108292314B (zh) 信息处理装置、信息处理方法和程序
US9563704B1 (en) Methods, systems, and media for presenting suggestions of related media content
US20200044999A1 (en) Voice forwarding in automated chatting
WO2019047850A1 (fr) Procédé et dispositif d'affichage d'identificateur, et procédé et dispositif de réponse à une demande
CN108885869A (zh) 控制包含语音的音频数据的回放
CN113035199B (zh) 音频处理方法、装置、设备及可读存储介质
JP7274210B2 (ja) 対話システムおよびプログラム
CN112653902A (zh) 说话人识别方法、装置及电子设备
US20240121451A1 (en) Video processing method and apparatus, storage medium, and device
CN113761268A (zh) 音频节目内容的播放控制方法、装置、设备和存储介质
US11729476B2 (en) Reproduction control of scene description
US20140129221A1 (en) Sound recognition device, non-transitory computer readable storage medium stored threreof sound recognition program, and sound recognition method
CN111490929B (zh) 视频片段推送方法、装置、电子设备、存储介质
JPWO2018079294A1 (ja) 情報処理装置及び情報処理方法

Legal Events

Date Code Title Description
AS Assignment

Owner name: SAMSUNG ELECTRONICS CO., LTD., KOREA, REPUBLIC OF

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:BAO, FEI;WANG, XIANLIANG;ZHU, XUAN;REEL/FRAME:041609/0010

Effective date: 20170215

STPP Information on status: patent application and granting procedure in general

Free format text: NON FINAL ACTION MAILED

STPP Information on status: patent application and granting procedure in general

Free format text: RESPONSE TO NON-FINAL OFFICE ACTION ENTERED AND FORWARDED TO EXAMINER

STPP Information on status: patent application and granting procedure in general

Free format text: FINAL REJECTION MAILED

STPP Information on status: patent application and granting procedure in general

Free format text: ADVISORY ACTION MAILED

STPP Information on status: patent application and granting procedure in general

Free format text: DOCKETED NEW CASE - READY FOR EXAMINATION

STPP Information on status: patent application and granting procedure in general

Free format text: NON FINAL ACTION MAILED

STPP Information on status: patent application and granting procedure in general

Free format text: RESPONSE TO NON-FINAL OFFICE ACTION ENTERED AND FORWARDED TO EXAMINER

STCV Information on status: appeal procedure

Free format text: NOTICE OF APPEAL FILED

STCB Information on status: application discontinuation

Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION