CN111341324A - 一种基于fasttest模型的识别纠错及训练方法 - Google Patents
一种基于fasttest模型的识别纠错及训练方法 Download PDFInfo
- Publication number
- CN111341324A CN111341324A CN202010416525.9A CN202010416525A CN111341324A CN 111341324 A CN111341324 A CN 111341324A CN 202010416525 A CN202010416525 A CN 202010416525A CN 111341324 A CN111341324 A CN 111341324A
- Authority
- CN
- China
- Prior art keywords
- model
- label
- fasttest
- voice
- voice recognition
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 title claims abstract description 31
- 238000007781 pre-processing Methods 0.000 claims description 6
- 230000011218 segmentation Effects 0.000 claims description 3
- 238000007689 inspection Methods 0.000 description 2
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 230000008451 emotion Effects 0.000 description 1
- 238000012423 maintenance Methods 0.000 description 1
- 238000003908 quality control method Methods 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 239000002699 waste material Substances 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification techniques
- G10L17/06—Decision making techniques; Pattern matching strategies
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/35—Clustering; Classification
- G06F16/353—Clustering; Classification into predefined classes
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/23—Clustering techniques
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification techniques
- G10L17/02—Preprocessing operations, e.g. segment selection; Pattern representation or modelling, e.g. based on linear discriminant analysis [LDA] or principal components; Feature selection or extraction
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification techniques
- G10L17/04—Training, enrolment or model building
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification techniques
- G10L17/06—Decision making techniques; Pattern matching strategies
- G10L17/14—Use of phonemic categorisation or speech recognition prior to speaker recognition or verification
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G10L2015/225—Feedback of the input speech
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Multimedia (AREA)
- Acoustics & Sound (AREA)
- Human Computer Interaction (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Health & Medical Sciences (AREA)
- Data Mining & Analysis (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Theoretical Computer Science (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Business, Economics & Management (AREA)
- Game Theory and Decision Science (AREA)
- Evolutionary Computation (AREA)
- Computational Linguistics (AREA)
- Evolutionary Biology (AREA)
- Bioinformatics & Computational Biology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Artificial Intelligence (AREA)
- Life Sciences & Earth Sciences (AREA)
- Databases & Information Systems (AREA)
- Telephonic Communication Services (AREA)
Abstract
Description
Claims (6)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010416525.9A CN111341324B (zh) | 2020-05-18 | 2020-05-18 | 一种基于fasttext模型的识别纠错及训练方法 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010416525.9A CN111341324B (zh) | 2020-05-18 | 2020-05-18 | 一种基于fasttext模型的识别纠错及训练方法 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN111341324A true CN111341324A (zh) | 2020-06-26 |
CN111341324B CN111341324B (zh) | 2020-08-25 |
Family
ID=71184909
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202010416525.9A Active CN111341324B (zh) | 2020-05-18 | 2020-05-18 | 一种基于fasttext模型的识别纠错及训练方法 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN111341324B (zh) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113807973A (zh) * | 2021-09-16 | 2021-12-17 | 平安科技(深圳)有限公司 | 文本纠错方法、装置、电子设备及计算机可读存储介质 |
WO2022178933A1 (zh) * | 2021-02-26 | 2022-09-01 | 平安科技(深圳)有限公司 | 基于上下文的语音情感检测方法、装置、设备及存储介质 |
Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1547191A (zh) * | 2003-12-12 | 2004-11-17 | 北京大学 | 结合语义和声纹信息的说话人身份确认系统 |
JP2005321530A (ja) * | 2004-05-07 | 2005-11-17 | Sony Corp | 発話識別装置および発話識別方法 |
CN102024455A (zh) * | 2009-09-10 | 2011-04-20 | 索尼株式会社 | 说话人识别系统及其方法 |
CN108074574A (zh) * | 2017-11-29 | 2018-05-25 | 维沃移动通信有限公司 | 音频处理方法、装置及移动终端 |
CN109448728A (zh) * | 2018-10-29 | 2019-03-08 | 苏州工业职业技术学院 | 融合情感识别的多方会话可视化方法和系统 |
CN110309216A (zh) * | 2019-05-10 | 2019-10-08 | 焦点科技股份有限公司 | 一种基于文本分类的客服语音质检方法 |
JP2019532354A (ja) * | 2016-09-12 | 2019-11-07 | ピンドロップ セキュリティー、インコーポレイテッド | ディープニューラルネットワークを使用する端末間話者認識 |
-
2020
- 2020-05-18 CN CN202010416525.9A patent/CN111341324B/zh active Active
Patent Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1547191A (zh) * | 2003-12-12 | 2004-11-17 | 北京大学 | 结合语义和声纹信息的说话人身份确认系统 |
JP2005321530A (ja) * | 2004-05-07 | 2005-11-17 | Sony Corp | 発話識別装置および発話識別方法 |
CN102024455A (zh) * | 2009-09-10 | 2011-04-20 | 索尼株式会社 | 说话人识别系统及其方法 |
JP2019532354A (ja) * | 2016-09-12 | 2019-11-07 | ピンドロップ セキュリティー、インコーポレイテッド | ディープニューラルネットワークを使用する端末間話者認識 |
CN108074574A (zh) * | 2017-11-29 | 2018-05-25 | 维沃移动通信有限公司 | 音频处理方法、装置及移动终端 |
CN109448728A (zh) * | 2018-10-29 | 2019-03-08 | 苏州工业职业技术学院 | 融合情感识别的多方会话可视化方法和系统 |
CN110309216A (zh) * | 2019-05-10 | 2019-10-08 | 焦点科技股份有限公司 | 一种基于文本分类的客服语音质检方法 |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2022178933A1 (zh) * | 2021-02-26 | 2022-09-01 | 平安科技(深圳)有限公司 | 基于上下文的语音情感检测方法、装置、设备及存储介质 |
CN113807973A (zh) * | 2021-09-16 | 2021-12-17 | 平安科技(深圳)有限公司 | 文本纠错方法、装置、电子设备及计算机可读存储介质 |
CN113807973B (zh) * | 2021-09-16 | 2023-07-25 | 平安科技(深圳)有限公司 | 文本纠错方法、装置、电子设备及计算机可读存储介质 |
Also Published As
Publication number | Publication date |
---|---|
CN111341324B (zh) | 2020-08-25 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US10950241B2 (en) | Diarization using linguistic labeling with segmented and clustered diarized textual transcripts | |
US10109280B2 (en) | Blind diarization of recorded calls with arbitrary number of speakers | |
CN111341324B (zh) | 一种基于fasttext模型的识别纠错及训练方法 | |
CN117219110A (zh) | 一种适用于录音工牌的话者分离方法 | |
US20230238002A1 (en) | Signal processing device, signal processing method and program | |
Burkhardt et al. | Advances in anger detection with real life data | |
CN111916112A (zh) | 一种基于语音和文字的情绪识别方法 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
PE01 | Entry into force of the registration of the contract for pledge of patent right | ||
PE01 | Entry into force of the registration of the contract for pledge of patent right |
Denomination of invention: A recognition, error correction and training method based on fasttext model Effective date of registration: 20211203 Granted publication date: 20200825 Pledgee: Hangzhou High-tech Financing Guarantee Co.,Ltd. Pledgor: ZHEJIANG BYAI TECHNOLOGY Co.,Ltd. Registration number: Y2021980013964 |
|
PC01 | Cancellation of the registration of the contract for pledge of patent right |
Date of cancellation: 20220322 Granted publication date: 20200825 Pledgee: Hangzhou High-tech Financing Guarantee Co.,Ltd. Pledgor: ZHEJIANG BYAI TECHNOLOGY Co.,Ltd. Registration number: Y2021980013964 |
|
PC01 | Cancellation of the registration of the contract for pledge of patent right | ||
PE01 | Entry into force of the registration of the contract for pledge of patent right |
Denomination of invention: A recognition, error correction and training method based on fasttext model Effective date of registration: 20220322 Granted publication date: 20200825 Pledgee: Shanghai Guotai Junan Securities Asset Management Co.,Ltd. Pledgor: ZHEJIANG BYAI TECHNOLOGY Co.,Ltd. Registration number: Y2022990000161 |
|
PE01 | Entry into force of the registration of the contract for pledge of patent right | ||
PC01 | Cancellation of the registration of the contract for pledge of patent right |
Date of cancellation: 20230131 Granted publication date: 20200825 Pledgee: Shanghai Guotai Junan Securities Asset Management Co.,Ltd. Pledgor: ZHEJIANG BYAI TECHNOLOGY Co.,Ltd. Registration number: Y2022990000161 |
|
PC01 | Cancellation of the registration of the contract for pledge of patent right |