IN2014MN01588A

IN2014MN01588A -

Info

Publication number: IN2014MN01588A
Authority: IN
Inventors: Venkatraman Srinivasa Atti; Ethan Robert Duni
Original assignee: Qualcomm Inc
Priority date: 2012-01-13
Filing date: 2012-12-21
Publication date: 2015-05-08
Also published as: BR112014017001B1; EP2803068A1; WO2013106192A1; DK2803068T3; CN104040626B; HUE027037T2; SI2803068T1; JP5964455B2; EP2803068B1; JP2015507222A; ES2576232T3; US9111531B2; KR20170005514A; CN104040626A; US20130185063A1; KR20140116487A; BR112014017001A8; BR112014017001A2

Abstract

Improved audio classification is provided for encoding applications. An initial classification is performed followed by a finer classification to produce speech classifications and music classifications with higher accuracy and less complexity than previously available. Audio is classified as speech or music on a frame by frame basis. If the frame is classified as music by the initial classification that frame undergoes a second finer classification to confirm that the frame is music and not speech (e.g. speech that is tonal and/or structured that may not have been classified as speech by the initial classification). Depending on the implementation one or more parameters may be used in the finer classification. Example parameters include voicing modified correlation signal activity and long term pitch gain.