EP4083999A4 - Voice recognition method and related product - Google Patents

Voice recognition method and related product

Info

Publication number
EP4083999A4
EP4083999A4 EP20905795.9A EP20905795A EP4083999A4 EP 4083999 A4 EP4083999 A4 EP 4083999A4 EP 20905795 A EP20905795 A EP 20905795A EP 4083999 A4 EP4083999 A4 EP 4083999A4
Authority
EP
European Patent Office
Prior art keywords
voice recognition
recognition method
related product
product
voice
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
EP20905795.9A
Other languages
German (de)
French (fr)
Other versions
EP4083999A1 (en
Inventor
Genshun Wan
Jianqing Gao
Zhiguo Wang
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
iFlytek Co Ltd
Original Assignee
iFlytek Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by iFlytek Co Ltd filed Critical iFlytek Co Ltd
Publication of EP4083999A1 publication Critical patent/EP4083999A1/en
Publication of EP4083999A4 publication Critical patent/EP4083999A4/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • G10L15/183Speech classification or search using natural language modelling using context dependencies, e.g. language models
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/237Lexical tools
    • G06F40/247Thesauruses; Synonyms
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • G06F40/289Phrasal analysis, e.g. finite state techniques or chunking
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/04Segmentation; Word boundary detection
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • G10L15/063Training
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • G10L15/063Training
    • G10L2015/0631Creating reference templates; Clustering

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Multimedia (AREA)
  • Acoustics & Sound (AREA)
  • Human Computer Interaction (AREA)
  • Artificial Intelligence (AREA)
  • Theoretical Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Machine Translation (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
EP20905795.9A 2019-12-28 2020-12-14 Voice recognition method and related product Pending EP4083999A4 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201911389673.XA CN111161739B (en) 2019-12-28 2019-12-28 Speech recognition method and related product
PCT/CN2020/136126 WO2021129439A1 (en) 2019-12-28 2020-12-14 Voice recognition method and related product

Publications (2)

Publication Number Publication Date
EP4083999A1 EP4083999A1 (en) 2022-11-02
EP4083999A4 true EP4083999A4 (en) 2024-01-17

Family

ID=70559183

Family Applications (1)

Application Number Title Priority Date Filing Date
EP20905795.9A Pending EP4083999A4 (en) 2019-12-28 2020-12-14 Voice recognition method and related product

Country Status (6)

Country Link
US (1) US20230035947A1 (en)
EP (1) EP4083999A4 (en)
JP (1) JP7413533B2 (en)
KR (1) KR20220054587A (en)
CN (1) CN111161739B (en)
WO (1) WO2021129439A1 (en)

Families Citing this family (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111161739B (en) * 2019-12-28 2023-01-17 科大讯飞股份有限公司 Speech recognition method and related product
CN111930949B (en) * 2020-09-11 2021-01-15 腾讯科技(深圳)有限公司 Search string processing method and device, computer readable medium and electronic equipment
CN112489651B (en) * 2020-11-30 2023-02-17 科大讯飞股份有限公司 Voice recognition method, electronic device and storage device
CN112562659B (en) * 2020-12-11 2024-04-09 科大讯飞(上海)科技有限公司 Speech recognition method, device, electronic equipment and storage medium
CN112954235B (en) * 2021-02-04 2021-10-29 读书郎教育科技有限公司 Early education panel interaction method based on family interaction
CN114143281B (en) * 2021-11-10 2023-03-14 聚好看科技股份有限公司 Document generation method, server and display device
CN114464182B (en) * 2022-03-03 2022-10-21 慧言科技(天津)有限公司 Voice recognition fast self-adaption method assisted by audio scene classification
CN115374793B (en) * 2022-10-25 2023-01-20 深圳市人马互动科技有限公司 Voice data processing method based on service scene recognition and related device
CN117198289B (en) * 2023-09-28 2024-05-10 阿波罗智联(北京)科技有限公司 Voice interaction method, device, equipment, medium and product

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8447608B1 (en) * 2008-12-10 2013-05-21 Adobe Systems Incorporated Custom language models for audio content
US20150278191A1 (en) * 2014-03-27 2015-10-01 Microsoft Corporation Flexible Schema for Language Model Customization

Family Cites Families (22)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2004233541A (en) * 2003-01-29 2004-08-19 Riyuukoku Univ Highlight scene detection system
US9031839B2 (en) * 2010-12-01 2015-05-12 Cisco Technology, Inc. Conference transcription based on conference data
JP5124012B2 (en) * 2010-12-10 2013-01-23 日本放送協会 Speech recognition apparatus and speech recognition program
JP5723711B2 (en) * 2011-07-28 2015-05-27 日本放送協会 Speech recognition apparatus and speech recognition program
CN103838756A (en) * 2012-11-23 2014-06-04 阿里巴巴集团控股有限公司 Method and device for determining pushed information
CN105448292B (en) * 2014-08-19 2019-03-12 北京羽扇智信息科技有限公司 A kind of time Speech Recognition System and method based on scene
CN104464733B (en) * 2014-10-28 2019-09-20 百度在线网络技术(北京)有限公司 A kind of more scene management method and devices of voice dialogue
CN105045778B (en) * 2015-06-24 2017-10-17 江苏科技大学 A kind of Chinese homonym mistake auto-collation
CN105654945B (en) * 2015-10-29 2020-03-06 乐融致新电子科技(天津)有限公司 Language model training method, device and equipment
CN105719649B (en) * 2016-01-19 2019-07-05 百度在线网络技术(北京)有限公司 Audio recognition method and device
CN106328147B (en) * 2016-08-31 2022-02-01 中国科学技术大学 Speech recognition method and device
CN107644641B (en) * 2017-07-28 2021-04-13 深圳前海微众银行股份有限公司 Dialog scene recognition method, terminal and computer-readable storage medium
EP3752958A4 (en) * 2018-02-15 2021-11-10 DMAI, Inc. System and method for visual scene construction based on user communication
CN108984529B (en) * 2018-07-16 2022-06-03 北京华宇信息技术有限公司 Real-time court trial voice recognition automatic error correction method, storage medium and computing device
CN109272995A (en) * 2018-09-26 2019-01-25 出门问问信息科技有限公司 Audio recognition method, device and electronic equipment
CN110534094B (en) * 2019-07-31 2022-05-31 大众问问(北京)信息科技有限公司 Voice interaction method, device and equipment
CN110415705B (en) * 2019-08-01 2022-03-01 苏州奇梦者网络科技有限公司 Hot word recognition method, system, device and storage medium
WO2021040092A1 (en) * 2019-08-29 2021-03-04 엘지전자 주식회사 Speech recognition service provision method and apparatus
CN110544477A (en) * 2019-09-29 2019-12-06 北京声智科技有限公司 Voice recognition method, device, equipment and medium
CN111161739B (en) * 2019-12-28 2023-01-17 科大讯飞股份有限公司 Speech recognition method and related product
CN112037792B (en) * 2020-08-20 2022-06-17 北京字节跳动网络技术有限公司 Voice recognition method and device, electronic equipment and storage medium
CN112562659B (en) * 2020-12-11 2024-04-09 科大讯飞(上海)科技有限公司 Speech recognition method, device, electronic equipment and storage medium

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8447608B1 (en) * 2008-12-10 2013-05-21 Adobe Systems Incorporated Custom language models for audio content
US20150278191A1 (en) * 2014-03-27 2015-10-01 Microsoft Corporation Flexible Schema for Language Model Customization

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See also references of WO2021129439A1 *

Also Published As

Publication number Publication date
CN111161739B (en) 2023-01-17
JP7413533B2 (en) 2024-01-15
WO2021129439A1 (en) 2021-07-01
EP4083999A1 (en) 2022-11-02
KR20220054587A (en) 2022-05-03
US20230035947A1 (en) 2023-02-02
CN111161739A (en) 2020-05-15
JP2023504796A (en) 2023-02-07

Similar Documents

Publication Publication Date Title
EP4083999A4 (en) Voice recognition method and related product
EP3501023A4 (en) Speech recognition method and apparatus
EP4047598A4 (en) Voice matching method and related device
EP3751569A4 (en) Multi-person voice separation method and apparatus
EP3504703A4 (en) A speech recognition method and apparatus
EP3767619A4 (en) Speech recognition and speech recognition model training method and apparatus
EP3479376A4 (en) Speech recognition method and apparatus based on speaker recognition
EP3373293A4 (en) Speech recognition method and apparatus
EP3859731A4 (en) Speech synthesis method and device
EP3933693A4 (en) Object recognition method and device
EP3872691A4 (en) Fingerprint recognition method and related product
EP3533052A4 (en) Speech recognition method and apparatus
EP3701521A4 (en) Voice recognition apparatus and operation method thereof cross-reference to related application
SG11202107826QA (en) Facial recognition method and apparatus
EP4064123A4 (en) Text recognition method and apparatus
EP3757874A4 (en) Action recognition method and apparatus
EP3850622A4 (en) Method and device for speech recognition
EP3869509A4 (en) Voice recognition device and method
EP3819810A4 (en) Face recognition method and apparatus
KR102351008B9 (en) Apparatus and method for recognizing emotions
GB2588496B (en) Recognition apparatus and method
EP3686882A4 (en) Method for training filter model and speech recognition method
EP4026121A4 (en) Speech recognition systems and methods
EP3584700A4 (en) Fingerprint recognition method and related product
EP3820162A4 (en) Speech data processing method and related product

Legal Events

Date Code Title Description
STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: THE INTERNATIONAL PUBLICATION HAS BEEN MADE

PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: REQUEST FOR EXAMINATION WAS MADE

17P Request for examination filed

Effective date: 20220527

AK Designated contracting states

Kind code of ref document: A1

Designated state(s): AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR

DAV Request for validation of the european patent (deleted)
DAX Request for extension of the european patent (deleted)
REG Reference to a national code

Ref country code: DE

Ref legal event code: R079

Free format text: PREVIOUS MAIN CLASS: G10L0015260000

Ipc: G10L0015180000

A4 Supplementary search report drawn up and despatched

Effective date: 20231218

RIC1 Information provided on ipc code assigned before grant

Ipc: G10L 15/06 20130101ALI20231212BHEP

Ipc: G10L 15/26 20060101ALI20231212BHEP

Ipc: G10L 15/18 20130101AFI20231212BHEP