WO2023211369A3

WO2023211369A3 - Speech recognition model generation method and apparatus, speech recognition method and apparatus, medium, and device

Info

Publication number: WO2023211369A3
Application number: PCT/SG2023/050236
Authority: WO
Inventors: 马娆; 吴璟成; 马泽君
Original assignee: 脸萌有限公司
Priority date: 2022-04-25
Filing date: 2023-04-06
Publication date: 2024-03-21
Also published as: WO2023211369A2; CN114765025A

Abstract

The present disclosure relates to a speech recognition model generation method and apparatus, a speech recognition method and apparatus, a medium, and a device. The speech recognition model generation method comprises: obtaining a target named entity word list, the target named entity word list comprising a plurality of named entity words; performing screening on preset text data on the basis of the named entity words in the target named entity word list to obtain target text data containing the named entity words; performing speech synthesis processing on the target text data to determine target audio data; determining target training data on the basis of the target audio data; newly performing training on a pre-trained speech recognition model on the basis of initial training data and the target training data to obtain a target speech recognition model, the initial training data being audio data used for training to obtain the pre-trained speech recognition model. The target speech recognition model obtained by the speech recognition model generation method, the recognition accuracy of named entity words can be improved.