IN2013MU03833A - - Google Patents

Info

Publication number
IN2013MU03833A
IN2013MU03833A IN3833MU2013A IN2013MU03833A IN 2013MU03833 A IN2013MU03833 A IN 2013MU03833A IN 3833MU2013 A IN3833MU2013 A IN 3833MU2013A IN 2013MU03833 A IN2013MU03833 A IN 2013MU03833A
Authority
IN
India
Prior art keywords
features
noise data
classification
noise
respect
Prior art date
Application number
Inventor
Ramu Reddy Vempada
Aniruddha Sinha
Guruprasad Seshadri
Original Assignee
Tata Consultancy Services Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tata Consultancy Services Ltd filed Critical Tata Consultancy Services Ltd
Priority to PCT/IB2014/066538 priority Critical patent/WO2015083091A2/en
Priority to EP14867853.5A priority patent/EP3078026B1/en
Priority to IN3833MU2013 priority patent/IN2013MU03833A/en
Priority to US15/101,817 priority patent/US10134423B2/en
Publication of IN2013MU03833A publication Critical patent/IN2013MU03833A/en

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • G10L25/51Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/09Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being zero crossing rates
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/24Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being the cepstrum
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/213Feature extraction, e.g. by transforming the feature space; Summarisation; Mappings, e.g. subspace methods
    • G06F18/2132Feature extraction, e.g. by transforming the feature space; Summarisation; Mappings, e.g. subspace methods based on discrimination criteria, e.g. discriminant analysis

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Image Analysis (AREA)
  • Image Processing (AREA)

Abstract

System(s) and method(s) for classifying noise data of human crowd are disclosed. Noise data is captured from one or more sources and features are extracted by using computation techniques. The features comprise spectral domain features and time domain features. Classification models are developed by using each of the spectral domain features and the time domain features. Discriminative information with respect to the noise data is extracted by using the classification models. A performance matrix is computed for each of the classification model. The performance matrix comprises classified noise elements with respect to the noise data. Each classified noise element is associated with a classification performance score with respect to a spectral domain feature, a time domain feature, and fusion of features and scores. The classified noise elements provide the classification of the noise data. [To be published with figure 3]
IN3833MU2013 2013-12-06 2014-12-03 IN2013MU03833A (en)

Priority Applications (4)

Application Number Priority Date Filing Date Title
PCT/IB2014/066538 WO2015083091A2 (en) 2013-12-06 2014-12-03 System and method to provide classification of noise data of human crowd
EP14867853.5A EP3078026B1 (en) 2013-12-06 2014-12-03 System and method to provide classification of noise data of human crowd
IN3833MU2013 IN2013MU03833A (en) 2013-12-06 2014-12-03
US15/101,817 US10134423B2 (en) 2013-12-06 2014-12-03 System and method to provide classification of noise data of human crowd

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
IN3833MU2013 IN2013MU03833A (en) 2013-12-06 2014-12-03

Publications (1)

Publication Number Publication Date
IN2013MU03833A true IN2013MU03833A (en) 2015-07-31

Family

ID=53274234

Family Applications (1)

Application Number Title Priority Date Filing Date
IN3833MU2013 IN2013MU03833A (en) 2013-12-06 2014-12-03

Country Status (4)

Country Link
US (1) US10134423B2 (en)
EP (1) EP3078026B1 (en)
IN (1) IN2013MU03833A (en)
WO (1) WO2015083091A2 (en)

Families Citing this family (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108028048B (en) * 2015-06-30 2022-06-21 弗劳恩霍夫应用研究促进协会 Method and apparatus for correlating noise and for analysis
US10754947B2 (en) * 2015-11-30 2020-08-25 International Business Machines Corporation System, method and apparatus for usable code-level statistical analysis with applications in malware detection
CN109801638B (en) * 2019-01-24 2023-10-13 平安科技(深圳)有限公司 Voice verification method, device, computer equipment and storage medium
KR102300599B1 (en) * 2020-01-31 2021-09-08 연세대학교 산학협력단 Method and Apparatus for Determining Stress in Speech Signal Using Weight
CN111370025A (en) * 2020-02-25 2020-07-03 广州酷狗计算机科技有限公司 Audio recognition method and device and computer storage medium
CN112233694B (en) * 2020-10-10 2024-03-05 中国电子科技集团公司第三研究所 Target identification method and device, storage medium and electronic equipment
CN113011568A (en) * 2021-03-31 2021-06-22 华为技术有限公司 Model training method, data processing method and equipment
CN113257266B (en) * 2021-05-21 2021-12-24 特斯联科技集团有限公司 Complex environment access control method and device based on voiceprint multi-feature fusion
CN114724549B (en) * 2022-06-09 2022-09-06 广州声博士声学技术有限公司 Intelligent identification method, device, equipment and storage medium for environmental noise

Family Cites Families (25)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6633842B1 (en) * 1999-10-22 2003-10-14 Texas Instruments Incorporated Speech recognition front-end feature extraction for noisy speech
US5970447A (en) 1998-01-20 1999-10-19 Advanced Micro Devices, Inc. Detection of tonal signals
US7177808B2 (en) * 2000-11-29 2007-02-13 The United States Of America As Represented By The Secretary Of The Air Force Method for improving speaker identification by determining usable speech
US7505902B2 (en) 2004-07-28 2009-03-17 University Of Maryland Discrimination of components of audio signals based on multiscale spectro-temporal modulations
US7668790B2 (en) * 2006-07-27 2010-02-23 The United States Of America As Represented By The Secretary Of The Navy System and method for fusing data from different information sources with shared-sampling distribution based boosting
US8457768B2 (en) * 2007-06-04 2013-06-04 International Business Machines Corporation Crowd noise analysis
US8140331B2 (en) * 2007-07-06 2012-03-20 Xia Lou Feature extraction for identification and classification of audio signals
US8515257B2 (en) * 2007-10-17 2013-08-20 International Business Machines Corporation Automatic announcer voice attenuation in a presentation of a televised sporting event
WO2010032405A1 (en) * 2008-09-16 2010-03-25 パナソニック株式会社 Speech analyzing apparatus, speech analyzing/synthesizing apparatus, correction rule information generating apparatus, speech analyzing system, speech analyzing method, correction rule information generating method, and program
EP2356763A1 (en) * 2008-12-08 2011-08-17 BAE Systems Information and Electronic Systems Integration Inc. Method for collaborative discrimation between authentic and spurious signals in a wireless cognitive network
WO2010105089A1 (en) * 2009-03-11 2010-09-16 Google Inc. Audio classification for information retrieval using sparse features
FR2943875A1 (en) * 2009-03-31 2010-10-01 France Telecom METHOD AND DEVICE FOR CLASSIFYING BACKGROUND NOISE CONTAINED IN AN AUDIO SIGNAL.
US8428933B1 (en) 2009-12-17 2013-04-23 Shopzilla, Inc. Usage based query response
US8762144B2 (en) * 2010-07-21 2014-06-24 Samsung Electronics Co., Ltd. Method and apparatus for voice activity detection
US8812310B2 (en) * 2010-08-22 2014-08-19 King Saud University Environment recognition of audio input
EP2523149B1 (en) * 2011-05-11 2023-01-11 Tata Consultancy Services Ltd. A method and system for association and decision fusion of multimodal inputs
US8239196B1 (en) * 2011-07-28 2012-08-07 Google Inc. System and method for multi-channel multi-feature speech/noise classification for noise suppression
WO2013105108A1 (en) * 2011-11-09 2013-07-18 Tata Consultancy Services Limited A system and method for enhancing human counting by fusing results of human detection modalities
KR101892733B1 (en) * 2011-11-24 2018-08-29 한국전자통신연구원 Voice recognition apparatus based on cepstrum feature vector and method thereof
US9218728B2 (en) * 2012-02-02 2015-12-22 Raytheon Company Methods and apparatus for acoustic event detection
US8880444B2 (en) * 2012-08-22 2014-11-04 Kodak Alaris Inc. Audio based control of equipment and systems
US9183849B2 (en) * 2012-12-21 2015-11-10 The Nielsen Company (Us), Llc Audio matching with semantic audio recognition and report generation
US9117104B2 (en) * 2013-07-10 2015-08-25 Cherif Algreatly Object recognition for 3D models and 2D drawings
US9892745B2 (en) * 2013-08-23 2018-02-13 At&T Intellectual Property I, L.P. Augmented multi-tier classifier for multi-modal voice activity detection
US20150110277A1 (en) * 2013-10-22 2015-04-23 Charles Pidgeon Wearable/Portable Device and Application Software for Alerting People When the Human Sound Reaches the Preset Threshold

Also Published As

Publication number Publication date
EP3078026A2 (en) 2016-10-12
EP3078026B1 (en) 2022-11-16
WO2015083091A3 (en) 2015-09-24
EP3078026A4 (en) 2017-05-17
US10134423B2 (en) 2018-11-20
US20160307582A1 (en) 2016-10-20
WO2015083091A2 (en) 2015-06-11

Similar Documents

Publication Publication Date Title
IN2013MU03833A (en)
GB2527009A (en) Simulation of production systems
WO2015006343A3 (en) Note recognition and management using color classification
WO2018081607A3 (en) Methods of systems of generating virtual multi-dimensional models using image analysis
WO2015195275A3 (en) Method for optimizing asset value based on driver acceleration and braking behavior
WO2014085832A3 (en) Event investigation within an online research system
WO2012148950A3 (en) Representing information from documents
NZ629509A (en) Family networks
WO2014120699A3 (en) Scaling statistical language understanding systems across domains and intents
WO2012115958A3 (en) Automatic data cleaning for machine learning classifiers
WO2014172640A3 (en) Method and system of construction project management
EP2650780A3 (en) Component discovery from source code
WO2007136560A3 (en) Method and system for information extraction and modeling
EA201490172A1 (en) SYSTEM AND METHOD FOR FORMING A GEOSTATISTICAL MODEL OF AN INTERESTING GEOLOGICAL VOLUME LIMITED TO A PROCESS ORIENTED MODEL OF AN INTERESTING GEOLOGICAL VOLUME
BR112012017182A2 (en) Method and System for Using Multipoint Statistical Simulation to Model Reservoir Property Trends
SG11201900440TA (en) System and method for estimating market price of real estate by using big data
WO2014085776A3 (en) Web search ranking
EP2860672A3 (en) Scalable cross domain recommendation system
WO2012167073A8 (en) Methods, apparatuses, and computer program products for database record recovery
GB201217271D0 (en) Traffic sensor management
WO2014022172A3 (en) Information classification based on product recognition
EP2838037A3 (en) Information processing system, information processing method, and information processing program
WO2014209879A3 (en) Characterizing porosity distribution from a borehole image
BR112017013140A2 (en) computer-implemented method and mesoscale modeling system
WO2014008357A3 (en) Methods and systems for identifying and securing educational services