CN111243570A - Voice acquisition method and device and computer readable storage medium - Google Patents

Voice acquisition method and device and computer readable storage medium Download PDF

Info

Publication number
CN111243570A
CN111243570A CN202010060939.2A CN202010060939A CN111243570A CN 111243570 A CN111243570 A CN 111243570A CN 202010060939 A CN202010060939 A CN 202010060939A CN 111243570 A CN111243570 A CN 111243570A
Authority
CN
China
Prior art keywords
voice
text information
text
cache
information
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202010060939.2A
Other languages
Chinese (zh)
Other versions
CN111243570B (en
Inventor
李永强
雷欣
李志飞
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Volkswagen China Investment Co Ltd
Mobvoi Innovation Technology Co Ltd
Original Assignee
Mobvoi Information Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Mobvoi Information Technology Co Ltd filed Critical Mobvoi Information Technology Co Ltd
Priority to CN202010060939.2A priority Critical patent/CN111243570B/en
Publication of CN111243570A publication Critical patent/CN111243570A/en
Application granted granted Critical
Publication of CN111243570B publication Critical patent/CN111243570B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/02Methods for producing synthetic speech; Speech synthesisers
    • G10L13/04Details of speech synthesis systems, e.g. synthesiser structure or memory management

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Machine Translation (AREA)

Abstract

The invention discloses a voice acquisition method, a voice acquisition device and a computer readable storage medium, wherein the voice acquisition method comprises the following steps: acquiring text information; splitting the acquired text information to obtain a plurality of split texts; judging whether the obtained split texts exist in a first voice cache one by one; and if the split text is judged to exist in the first voice cache, extracting the voice information corresponding to the split text from the first voice cache. Therefore, after the text information is received, the corresponding voice information can be fed back quickly, the calculation cost of the voice synthesis system is reduced, and the voice acquisition efficiency is greatly improved.

Description

Voice acquisition method and device and computer readable storage medium
Technical Field
The present invention relates to the field of speech synthesis technologies, and in particular, to a speech acquisition method and apparatus, and a computer-readable storage medium.
Background
TTS systems (speech synthesis systems) convert text information into speech information, which takes a certain time overhead. If the TTS system adopts the most advanced neural network model (such as tacotron, wavenet, wavernn, lpcnet and the like), the synthesis speed is very slow, and the rapid synthesis cannot be achieved in many cases.
Disclosure of Invention
Embodiments of the present invention provide a method and an apparatus for obtaining speech, and a computer-readable storage medium, which can reduce the computation overhead of a speech synthesis system and improve the efficiency of obtaining speech.
One aspect of the present invention provides a method for acquiring a voice, including: acquiring text information; splitting the acquired text information to obtain a plurality of split texts; judging whether the obtained split texts exist in a first voice cache one by one; and if the split text is judged to exist in the first voice cache, extracting the voice information corresponding to the split text from the first voice cache.
In an embodiment, the method further comprises: and if the first voice cache does not have the split text, carrying out voice synthesis on the split text to obtain corresponding voice information.
In an embodiment, the method further comprises: and storing the text information and the corresponding voice information into a second voice cache.
In an implementation manner, before splitting the acquired text information to obtain a plurality of split texts, the method further includes: judging whether the text information exists in a second voice cache; if the text information is judged to be in the second voice cache, acquiring the voice information corresponding to the text information from the second voice cache; and splitting the acquired text information to obtain a plurality of split texts if the question information is judged not to exist in the second voice cache.
In an embodiment, before determining whether the text information exists in the second voice cache, the method further comprises: judging whether the text information exists in a voice database; if the text information is judged to be in the voice database, acquiring the voice information corresponding to the text information from the voice database; and if the text information is not judged to exist in the voice database, judging whether the text information exists in the second voice buffer.
In one embodiment, the voice database is a shared resource, and the first voice cache and the second voice cache are exclusive resources.
Another aspect of the present invention provides a speech acquisition apparatus, including: the text acquisition module is used for acquiring text information; the text splitting module is used for splitting the acquired text information to obtain a plurality of split texts; the split text judgment module is used for judging whether the obtained split texts exist in the first voice cache one by one; and the voice extraction module is used for extracting the voice information corresponding to the split text from the first voice cache if the split text is judged to exist in the first voice cache by the split text judgment module.
In an implementation manner, before the text splitting module splits the acquired text information to obtain a plurality of split texts, the apparatus further includes: the cache judging module is used for judging whether the text information exists in a second voice cache; if the text information is judged to be in the second voice cache, acquiring the voice information corresponding to the text information from the second voice cache; and splitting the acquired text information to obtain a plurality of split texts if the question information is judged not to exist in the second voice cache.
In an embodiment, before the cache determination module determines whether the text information exists in the second voice cache, the method further includes: the database judging module is used for judging whether the text information exists in a voice database; if the text information is judged to be in the voice database, acquiring the voice information corresponding to the text information from the voice database; and if the text information is not judged to exist in the voice database, judging whether the text information exists in the second voice buffer.
Another aspect of the invention provides a computer-readable storage medium comprising a set of computer-executable instructions that, when executed, perform any of the speech acquisition methods described above.
In the embodiment of the invention, some phrases and corresponding voice information are stored in the first voice cache in advance. Therefore, after the text information is received, the text is split to obtain a plurality of split texts, whether the split texts exist in the first voice cache or not is judged, if the split texts exist in the first voice cache, the corresponding voice information is directly extracted, and the split texts which do not exist in the first voice cache are subjected to subsequent processing.
Therefore, after the text information is received, the corresponding voice information can be fed back quickly, the calculation cost of the voice synthesis system is reduced, and the voice acquisition efficiency is greatly improved.
Drawings
The above and other objects, features and advantages of exemplary embodiments of the present invention will become readily apparent from the following detailed description read in conjunction with the accompanying drawings. Several embodiments of the invention are illustrated by way of example, and not by way of limitation, in the figures of the accompanying drawings and in which:
in the drawings, the same or corresponding reference numerals indicate the same or corresponding parts.
Fig. 1 is a schematic flow chart illustrating an implementation of a speech acquisition method according to an embodiment of the present invention;
fig. 2 is a schematic flow chart of a specific implementation of a speech acquisition method according to an embodiment of the present invention;
fig. 3 is a schematic structural diagram of a speech acquisition apparatus according to an embodiment of the present invention.
Detailed Description
In order to make the objects, features and advantages of the present invention more obvious and understandable, the technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present invention, and it is apparent that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
Fig. 1 is a schematic flow chart illustrating an implementation of a speech acquisition method according to an embodiment of the present invention.
As shown in fig. 1, an aspect of the present invention provides a method for acquiring a voice, where the method includes:
step 101, acquiring text information;
step 102, splitting the acquired text information to obtain a plurality of split texts;
step 103, judging whether the obtained split texts exist in a first voice cache one by one;
and 104, if the split text is judged to exist in the first voice cache, extracting the voice information corresponding to the split text from the first voice cache.
In this embodiment, first, text information is acquired, where the text information may be provided by a user.
And then splitting the acquired text information to obtain a plurality of split texts, wherein the splitting mode can be split according to punctuations in the text information, and the existing word segmentation tools can be used for carrying out word segmentation processing on the split texts.
Then judging whether the obtained split texts exist in a first voice cache one by one; wherein, the first voice cache is used for storing some common phrases and corresponding voice messages, for example: "hello, i is xxx" and "hello, ask xxx", and "hello" in the two sentences is a frequently appearing phrase, the same part in the two sentences is extracted in advance, corresponding voice information is generated by the same part, and the text information and the corresponding voice information are stored in a first voice cache.
Then judging whether the split texts exist in a first voice cache one by one;
and if the split text is judged to exist in the first voice cache, extracting the voice information corresponding to the split text.
Therefore, after the text information is received, the corresponding voice information can be fed back quickly, the calculation cost of the voice synthesis system is reduced, and the voice acquisition efficiency is greatly improved.
In an embodiment, the method further comprises:
and if the first voice cache is judged not to have the split text, carrying out voice synthesis on the first voice cache to obtain corresponding voice information.
In this embodiment, if one or more of the split texts does not exist in the first voice cache, the split texts are converted by using a voice synthesis technology to obtain corresponding voice information, and the obtained voice information is fed back to the user.
In an embodiment, the method further comprises:
and storing the text information and the corresponding voice information into a second voice cache.
In this embodiment, after the above steps, the entire text information and the corresponding voice information are stored in the second voice cache. The second voice cache is mainly used for storing text information and voice information which are requested to be synthesized by a recent user.
In an implementation manner, before splitting the acquired text information to obtain a plurality of split texts, the method further includes:
judging whether the text information exists in a second voice cache;
if the text information is judged to be in the second voice cache, acquiring the voice information corresponding to the text information from the second voice cache;
and splitting the acquired text information to obtain a plurality of split texts if the question information is judged not to exist in the second voice cache.
In this embodiment, before splitting the acquired text information to obtain a plurality of split texts, it is determined whether the text information exists in the second voice cache, if the text information exists in the second voice cache, the voice information corresponding to the text information is directly extracted, and if it is determined that the text information does not exist in the second voice cache, the text information is split to obtain a plurality of split texts, and then the subsequent steps are continued.
Therefore, when the text information exists in the second voice cache, the corresponding voice information can be directly extracted and fed back to the user, and the voice acquisition efficiency is improved.
In an embodiment, before determining whether the text information exists in the second voice cache, the method further comprises:
judging whether the text information exists in a voice database;
if the text information is judged to be in the voice database, acquiring the voice information corresponding to the text information from the voice database;
and if the text information is not in the voice database, judging whether the text information exists in the second voice buffer.
In this embodiment, before determining whether the text information exists in the second voice cache, it is determined whether the text information exists in the voice database, where the voice database is used to store a large amount of text information with high use frequency and corresponding voice information. During storage, all historical text information is sorted from high to low according to frequency, tens of thousands of pieces of text information with high frequency and corresponding voice information are selected and stored into a voice database.
After a user inputs text information, whether the text information exists in the voice database is judged firstly, if the text information exists in the voice database, corresponding voice information is extracted from the voice database, and if the text information does not exist in the voice database, whether the text information exists in the second voice cache is judged continuously.
Therefore, when the text information exists in the voice database, the corresponding voice information can be directly extracted and fed back to the user, and the voice acquisition efficiency is improved.
In one embodiment, the voice database is a shared resource, and the first voice cache and the second voice cache are exclusive resources.
In this embodiment, the voice database is a shared resource for the distributed server to access; the first voice cache and the second voice cache are independent resources and are located in each server.
Fig. 2 is a schematic flow chart of a specific implementation of a speech acquisition method according to an embodiment of the present invention.
As shown in fig. 2, after the user inputs text information at the local end, the local end transmits the input text information to a server, and the server first transmits the text information to a Remote dictionary server (Remote dictionary server) which is the above-mentioned voice database, and the storage system stores key-value pair information, where the key information is text information and the value information is corresponding voice information. Searching whether the text information exists in the storage system, if the text information exists in the storage system, extracting the voice information corresponding to the text information, feeding the voice information back to the server, and feeding the voice information back to the local terminal by the server.
If the text information does not exist in the storage system, the storage system feeds back an instruction which is not found by the server, the server stores the text information in an LRU Cache (least recently used algorithm), and the LRU Cache is the second voice Cache mentioned above and is used for storing the recently accessed text information and the corresponding voice information. The server judges whether the text information exists in the LRU Cache, if the text information exists in the LRU Cache, the voice information corresponding to the text information is extracted, and the voice information is fed back to the local terminal.
If the text information does not exist in the LRU Cache, the text information is split to obtain a plurality of split texts, and then it is determined whether a Prefix Cache (Prefix Cache) in the server exists in the plurality of split texts, where the Prefix Cache is the first voice Cache mentioned above and is mainly used for storing phrases and corresponding voice information. And if the split texts do not exist in Prefix Cache, converting the split texts into corresponding voice information by using a voice synthesis system, and feeding back the obtained voice information to the local terminal.
By setting Redis, LRU Cache and Prefix Cache, corresponding voice information can be quickly searched, the calculation overhead of a voice synthesis system is reduced, the voice acquisition efficiency is greatly improved, and a second-come feeling is provided for a user.
Fig. 3 is a schematic structural diagram of a speech acquisition apparatus according to an embodiment of the present invention.
As shown in fig. 3, another aspect of the embodiment of the present invention provides a speech acquiring apparatus, including:
a text acquisition module 201, configured to acquire text information;
the text splitting module 202 is configured to split the acquired text information to obtain a plurality of split texts;
the split text judgment module 203 is configured to judge whether the obtained multiple split texts exist in the first voice cache one by one;
the voice extracting module 204 is configured to, if it is determined by the split text determining module 203 that the split text exists in the first voice cache, extract the voice information corresponding to the split text from the first voice cache.
In this embodiment, the text information is first acquired by the text acquisition module 201, where the text information may be provided by a user.
The obtained text information is then split by the text splitting module 202 to obtain a plurality of split texts, wherein the splitting mode can be split according to punctuations in the text information, and the existing word segmentation tools can also be used for performing word segmentation processing on the text information.
Then, the split text judgment module 203 judges whether the obtained split texts exist in the first voice cache one by one; wherein, the first voice cache is used for storing some common phrases and corresponding voice messages, for example: "hello, i is xxx" and "hello, ask xxx", and "hello" in the two sentences is a frequently appearing phrase, the same part in the two sentences is extracted in advance, corresponding voice information is generated by the same part, and the text information and the corresponding voice information are stored in a first voice cache. Then judging whether the split texts exist in a first voice cache one by one;
if the split text determination module 203 determines that the split text exists in the first voice cache, the voice extraction module 204 extracts the voice information corresponding to the split text.
Therefore, after the text information is received, the corresponding voice information can be fed back quickly, the calculation cost of the voice synthesis system is reduced, and the voice acquisition efficiency is greatly improved.
In an implementation manner, before the text splitting module 202 splits the acquired text information to obtain a plurality of split texts, the apparatus further includes:
a buffer judgment module 2012, configured to judge whether the text information exists in the second voice buffer;
if the text information is judged to be in the second voice cache, acquiring the voice information corresponding to the text information from the second voice cache;
and splitting the acquired text information to obtain a plurality of split texts if the question information is judged not to exist in the second voice cache.
In this embodiment, before the text splitting module 202 splits the acquired text information to obtain a plurality of split texts, the cache determination module 2012 determines whether the text information exists in the second voice cache, if the text information exists in the second voice cache, the voice information corresponding to the text information is directly extracted, and if the text information does not exist in the second voice cache, the text information is split by the text splitting module 202 to obtain a plurality of split texts, and then the subsequent steps are continued.
Therefore, when the text information exists in the second voice cache, the corresponding voice information can be directly extracted and fed back to the user, and the voice acquisition efficiency is improved.
In an embodiment, before the buffer determination module 2012 determines whether the text information exists in the second voice buffer, the method further includes:
a database determination module 2011, configured to determine whether the text information exists in the voice database;
if the text information is judged to be in the voice database, acquiring the voice information corresponding to the text information from the voice database;
and if the text information is not in the voice database, judging whether the text information exists in the second voice buffer.
In this embodiment, before the cache determination module 2012 determines whether the text information exists in the second voice cache, the database determination module 2011 determines whether the text information exists in a voice database, where the voice database is used for storing a large amount of text information with a high use frequency and corresponding voice information. During storage, all historical text information is sorted from high to low according to frequency, tens of thousands of pieces of text information with high frequency and corresponding voice information are selected and stored into a voice database.
After the user inputs the text message, the database determination module 2011 first determines whether the text message exists in the voice database, if it is determined that the text message exists in the voice database, extracts the corresponding voice message from the voice database, and if it is determined that the text message does not exist in the voice database, it continues to determine whether the text message exists in the second voice cache through the cache determination module 2012.
Therefore, when the text information exists in the voice database, the corresponding voice information can be directly extracted and fed back to the user, and the voice acquisition efficiency is improved.
Another aspect of the invention provides a computer-readable storage medium comprising a set of computer-executable instructions that, when executed, perform any of the speech acquisition methods described above.
In an embodiment of the present invention, a computer-readable storage medium includes a set of computer-executable instructions that, when executed, operate to obtain textual information; splitting the acquired text information to obtain a plurality of split texts; judging whether the obtained split texts exist in a first voice cache one by one; and if the split text is judged to exist in the first voice cache, extracting the voice information corresponding to the split text from the first voice cache.
Therefore, after the text information is received, the corresponding voice information can be fed back quickly, the calculation cost of the voice synthesis system is reduced, and the voice acquisition efficiency is greatly improved.
In the description herein, references to the description of the term "one embodiment," "some embodiments," "an example," "a specific example," or "some examples," etc., mean that a particular feature, structure, material, or characteristic described in connection with the embodiment or example is included in at least one embodiment or example of the invention. Furthermore, the particular features, structures, materials, or characteristics described may be combined in any suitable manner in any one or more embodiments or examples. Furthermore, various embodiments or examples and features of different embodiments or examples described in this specification can be combined and combined by one skilled in the art without contradiction.
Furthermore, the terms "first", "second" and "first" are used for descriptive purposes only and are not to be construed as indicating or implying relative importance or implicitly indicating the number of technical features indicated. Thus, a feature defined as "first" or "second" may explicitly or implicitly include at least one such feature. In the description of the present invention, "a plurality" means two or more unless specifically defined otherwise.
The above description is only for the specific embodiments of the present invention, but the scope of the present invention is not limited thereto, and any person skilled in the art can easily conceive of the changes or substitutions within the technical scope of the present invention, and the changes or substitutions should be covered within the scope of the present invention. Therefore, the protection scope of the present invention shall be subject to the protection scope of the claims.

Claims (10)

1. A method for speech acquisition, the method comprising:
acquiring text information;
splitting the acquired text information to obtain a plurality of split texts;
judging whether the obtained split texts exist in a first voice cache one by one;
and if the split text is judged to exist in the first voice cache, extracting the voice information corresponding to the split text from the first voice cache.
2. The method of claim 1, further comprising:
and if the first voice cache does not have the split text, carrying out voice synthesis on the split text to obtain corresponding voice information.
3. The method of claim 2, further comprising:
and storing the text information and the corresponding voice information into a second voice cache.
4. The method according to claim 1 or 3, wherein before splitting the acquired text information into a plurality of split texts, the method further comprises:
judging whether the text information exists in a second voice cache;
if the text information is judged to be in the second voice cache, acquiring the voice information corresponding to the text information from the second voice cache;
and splitting the acquired text information to obtain a plurality of split texts if the question information is judged not to exist in the second voice cache.
5. The method of claim 4, wherein prior to determining whether the text information is present in the second speech buffer, the method further comprises:
judging whether the text information exists in a voice database;
if the text information is judged to be in the voice database, acquiring the voice information corresponding to the text information from the voice database;
and if the text information is not judged to exist in the voice database, judging whether the text information exists in the second voice buffer.
6. The method of claim 5, wherein the voice database is a shared resource and the first voice cache and the second voice cache are exclusive resources.
7. A speech acquisition apparatus, characterized in that the apparatus comprises:
the text acquisition module is used for acquiring text information;
the text splitting module is used for splitting the acquired text information to obtain a plurality of split texts;
the split text judgment module is used for judging whether the obtained split texts exist in the first voice cache one by one;
and the voice extraction module is used for extracting the voice information corresponding to the split text from the first voice cache if the split text is judged to exist in the first voice cache by the split text judgment module.
8. The apparatus according to claim 7, wherein before the text splitting module splits the acquired text information to obtain a plurality of split texts, the apparatus further comprises:
the cache judging module is used for judging whether the text information exists in a second voice cache;
if the text information is judged to be in the second voice cache, acquiring the voice information corresponding to the text information from the second voice cache;
and splitting the acquired text information to obtain a plurality of split texts if the question information is judged not to exist in the second voice cache.
9. The apparatus of claim 8, wherein before the buffer determination module determines whether the text message exists in a second speech buffer, the method further comprises:
the database judging module is used for judging whether the text information exists in a voice database;
if the text information is judged to be in the voice database, acquiring the voice information corresponding to the text information from the voice database;
and if the text information is not judged to exist in the voice database, judging whether the text information exists in the second voice buffer.
10. A computer-readable storage medium comprising a set of computer-executable instructions that, when executed, perform a speech acquisition method according to any one of claims 1-6.
CN202010060939.2A 2020-01-19 2020-01-19 Voice acquisition method and device and computer readable storage medium Active CN111243570B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010060939.2A CN111243570B (en) 2020-01-19 2020-01-19 Voice acquisition method and device and computer readable storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010060939.2A CN111243570B (en) 2020-01-19 2020-01-19 Voice acquisition method and device and computer readable storage medium

Publications (2)

Publication Number Publication Date
CN111243570A true CN111243570A (en) 2020-06-05
CN111243570B CN111243570B (en) 2023-01-31

Family

ID=70864237

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010060939.2A Active CN111243570B (en) 2020-01-19 2020-01-19 Voice acquisition method and device and computer readable storage medium

Country Status (1)

Country Link
CN (1) CN111243570B (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111953853A (en) * 2020-07-31 2020-11-17 中国工商银行股份有限公司 Voice reading processing method and device
CN112102807A (en) * 2020-08-17 2020-12-18 招联消费金融有限公司 Speech synthesis method, apparatus, computer device and storage medium
CN113421542A (en) * 2021-06-22 2021-09-21 广州小鹏汽车科技有限公司 Voice interaction method, server, voice interaction system and storage medium
CN113421564A (en) * 2021-06-22 2021-09-21 广州小鹏汽车科技有限公司 Voice interaction method, voice interaction system, server and storage medium
EP3958582A3 (en) * 2021-06-22 2022-05-11 Guangzhou Xiaopeng Motors Technology Co., Ltd. Voice interaction method, voice interaction system and storage medium

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103093753A (en) * 2012-12-14 2013-05-08 沈阳美行科技有限公司 Navigation system user voice custom method
US9646601B1 (en) * 2013-07-26 2017-05-09 Amazon Technologies, Inc. Reduced latency text-to-speech system

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103093753A (en) * 2012-12-14 2013-05-08 沈阳美行科技有限公司 Navigation system user voice custom method
US9646601B1 (en) * 2013-07-26 2017-05-09 Amazon Technologies, Inc. Reduced latency text-to-speech system

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111953853A (en) * 2020-07-31 2020-11-17 中国工商银行股份有限公司 Voice reading processing method and device
CN112102807A (en) * 2020-08-17 2020-12-18 招联消费金融有限公司 Speech synthesis method, apparatus, computer device and storage medium
CN113421542A (en) * 2021-06-22 2021-09-21 广州小鹏汽车科技有限公司 Voice interaction method, server, voice interaction system and storage medium
CN113421564A (en) * 2021-06-22 2021-09-21 广州小鹏汽车科技有限公司 Voice interaction method, voice interaction system, server and storage medium
EP3958582A3 (en) * 2021-06-22 2022-05-11 Guangzhou Xiaopeng Motors Technology Co., Ltd. Voice interaction method, voice interaction system and storage medium
EP3929915A3 (en) * 2021-06-22 2022-05-18 Guangzhou Xiaopeng Motors Technology Co., Ltd. Voice interaction method, server, voice interaction system and storage medium

Also Published As

Publication number Publication date
CN111243570B (en) 2023-01-31

Similar Documents

Publication Publication Date Title
CN111243570B (en) Voice acquisition method and device and computer readable storage medium
CN107301170B (en) Method and device for segmenting sentences based on artificial intelligence
CN110853625B (en) Speech recognition model word segmentation training method and system, mobile terminal and storage medium
CN107357777B (en) Method and device for extracting label information
CN102236639A (en) System and method for updating language model
CN112700769A (en) Semantic understanding method, device, equipment and computer readable storage medium
CN111488468A (en) Geographic information knowledge point extraction method and device, storage medium and computer equipment
CN111079410A (en) Text recognition method and device, electronic equipment and storage medium
CN111737464A (en) Text classification method and device and electronic equipment
CN110825840B (en) Word bank expansion method, device, equipment and storage medium
CN111079428B (en) Word segmentation and industry dictionary construction method and device and readable storage medium
CN111859950A (en) Method for automatically generating lecture notes
CN109062891B (en) Media processing method, device, terminal and medium
CN111414471A (en) Method and apparatus for outputting information
CN110674243A (en) Corpus index construction method based on dynamic K-means algorithm
CN112466287A (en) Voice segmentation method and device and computer readable storage medium
CN110413770B (en) Method and device for classifying group messages into group topics
CN116644159A (en) Keyword extraction method, keyword extraction device, keyword extraction equipment and computer readable storage medium
CN111783433A (en) Text retrieval error correction method and device
CN111737424A (en) Question matching method, device, equipment and storage medium
CN112259092B (en) Voice broadcasting method and device and voice interaction equipment
CN115691503A (en) Voice recognition method and device, electronic equipment and storage medium
CN110895538A (en) Data retrieval method, device, storage medium and processor
CN114330280A (en) Sensitive data identification method and device
CN114969001A (en) Database metadata field matching method, device, equipment and medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
TA01 Transfer of patent application right
TA01 Transfer of patent application right

Effective date of registration: 20211115

Address after: 210000 floor 8, building D11, Hongfeng Science Park, Nanjing Economic and Technological Development Zone, Jiangsu Province

Applicant after: New Technology Co.,Ltd.

Applicant after: VOLKSWAGEN (CHINA) INVESTMENT Co.,Ltd.

Address before: 1001, floor 10, office building a, No. 19, Zhongguancun Street, Haidian District, Beijing 100044

Applicant before: MOBVOI INFORMATION TECHNOLOGY Co.,Ltd.

GR01 Patent grant
GR01 Patent grant