CN105391873A

CN105391873A - Method for realizing local voice recognition in mobile device

Info

Publication number: CN105391873A
Application number: CN201510834406.4A
Authority: CN
Inventors: 景蔚亮; 陈邦明
Original assignee: Shanghai Xinchu Integrated Circuit Co Ltd
Current assignee: Shanghai Xinchu Integrated Circuit Co Ltd
Priority date: 2015-11-25
Filing date: 2015-11-25
Publication date: 2016-03-09

Abstract

The invention discloses a method for realizing local voice intelligent recognition in a mobile handheld device, and utilizes a 3D nonvolatile memory to locally store voice database information of each device user and an artificial neural network learning database. The characteristic of a 3D nonvolatile memory technology is not realized through chip stacking or 3D packaging, but by adoption of a 3D technology by a memory cell, and thus high storage density can be achieved. According to the method provided by the invention, the voice database information of each device user and the artificial neural network learning database are locally stored in the 3D nonvolatile memory, the step of data transmission of the mobile handheld device through a network and a cloud data center is avoided, thereby greatly improving the response speed of voice recognition, and guaranteeing security of use. To further reduce power consumption, a baseband processor responds to a user voice command, and an application process and a memory which are serious in electric leakage are in a dormant state, and thus power consumption is further reduced.

Description

One realizes local voice in a mobile device and knows method for distinguishing

Technical field

The present invention relates to field of speech recognition, particularly relate to one and realize local voice knowledge method for distinguishing in a mobile device.

Background technology

Along with deepening continuously of studying artificial neural network (ANN), modern science and technology technology has made great progress in artificial intelligence field.Such as, pattern recognition, intelligent robot, the automatically field such as control and speech recognition technology, all show good intelligent characteristic.Wherein, speech recognition technology will substitute key-press input, become the next developing direction of mobile hand-held device.

Because speech recognition technology needs the speech database that can store vast capacity data, and artificial neural network study also needs the support of mass storage, and ability of data processing requires also very high, therefore the realization of intelligent sound recognition technology generally realizes in data center beyond the clouds.Current mobile hand-held device (such as mobile phone, panel computer) storage capacity and data-handling capacity are all quite limited, are therefore difficult to realize intelligent sound identification.

The speech database of high in the clouds data center is for general population widely, can not formulate individual speech database specific to the accent of someone, intonation, term custom and word speed etc., therefore the accuracy of the speech database at cloud device center is not identical concerning different individual.

Mobile hand-held device to the basic procedure of speech processes, as shown in Figure 1.Mobile hand-held device receives the speech data of user, by network, the speech data received can be sent to high in the clouds data center, by high in the clouds data center to speech data after treatment, command operation after resolving is sent it back mobile hand-held device by network again, and mobile hand-held device makes response according to this command operation.This shows, real-time network transfer speeds affects the latency that can mobile hand-held device respond fast.Common mobile hand-held device is normally performed by mobile hand-held device internal applications processor (Applicationprocessor) speech recognition and process and processes in internal memory.Therefore application processor and internal memory must remain that opening can voice responsive order in time, obvious power consumption can increase greatly, in order to ensure longer flying power, mobile hand-held device needs the battery of high power capacity as support, and this will increase the cost of mobile hand-held device undoubtedly.

Therefore, those skilled in the art is devoted to develop a kind of method realizing local voice Intelligent Recognition in mobile hand-held device, improves the accuracy of speech recognition, accelerates voice response speed, reduce power consumption.

Summary of the invention

Because the above-mentioned defect of prior art, technical problem to be solved by this invention how to realize speech recognition in this locality of mobile device, improves the accuracy and quickening response speed that identify.

For achieving the above object, the invention provides one and realize local voice knowledge method for distinguishing in a mobile device, based on 3D nonvolatile memory, in described mobile device this locality stores, set up speech database and artificial neural network learning database.

Further, described speech database carries out learning for the accent of each equipment user, intonation, term custom and word speed thus analyze and store.

Further, described mobile device is mobile phone or panel computer.

Further, base band processor module is configured to carry out user speech Intelligent Recognition, comprises response user voice command.

Further, base band processor module and 3D nonvolatile memory integrate, and described base band processor module is fabricated on 3D nonvolatile memory silicon substrate.

Further, described mobile device is configured to light or do not light screen and can carries out local voice Intelligent Recognition.

Further, described 3D nonvolatile memory refers to that memory cell array adopts 3D technique.

Further, the silicon substrate of described 3D nonvolatile memory is body silicon or silicon-on-insulator.

The present invention proposes a kind of method realizing local voice Intelligent Recognition in mobile hand-held device, utilizes 3D nonvolatile memory to store for the speech data library information of each equipment user and artificial neural network learning database in this locality.Described mobile hand-held device can be mobile phone, panel computer etc.The feature of 3D non-volatile memory technologies of the present invention is not realized by the stacking of chip or 3D encapsulation, but memory cell employing is 3D technique, thus can reach the storage density of superelevation.

As shown in Figure 2 be the structural representation of 3D nonvolatile memory of the present invention.Wherein, 1 is the storage array of 3D nonvolatile memory, in order to store speech data library information for each equipment user and artificial neural network learning database in this locality; 2 is silicon substrate, can make body silicon or silicon-on-insulator, in order to realize the peripheral logical circuit (such as, decoding circuit, read/write circuit, control circuit, output input circuit etc.) of 3D nonvolatile memory.In addition, the 3D nonvolatile memory (NVM) of this superelevation of the present invention storage density can also substitute the storage chip (being generally nand flash memory chip) in traditional mobile hand-held device.The present invention stores speech data library information for each equipment user and artificial neural network learning database by local in 3D nonvolatile memory, avoid mobile hand-held device transmits data step by network and high in the clouds data center, thus substantially increase the response speed of speech recognition, more ensure that the fail safe of use.Because these data are for specific user, can carry out learning for the accent of each different user, intonation, term custom and word speed etc. thus analyze and store, therefore can carry out speech recognition to individual subscriber more accurately.In order to reduce power consumption further, baseband processor can be allowed to respond user voice command, and allow the severe application processor of electric leakage and internal memory be in resting state, thus more reducing power consumption.In order to reduce power consumption further and improve response speed, 3D nonvolatile memory and baseband processor can also integrate by the present invention, as shown in Figure 3.Wherein, the 3 dimensional drawing that (1) realizes for 3D nonvolatile memory of the present invention and baseband processor, (2) are sectional view.Wherein, on silicon chip be the storage array of 3D nonvolatile memory; In substrate silicon except realize 3D nonvolatile memory peripheral logical circuit (such as, decoding circuit, read/write circuit, control circuit, output input circuit etc.) outside, also will realize baseband processor logical circuit.The present invention is integrated with 3D nonvolatile memory and baseband processor on a chips simultaneously, substantially increases silicon chip utilance, and reduces manufacturing cost; Meanwhile, the response speed of speech recognition can be further increased, also can save power consumption further.

Therefore, this method realizing local voice Intelligent Recognition in mobile hand-held device of the present invention, be stored in local jumbo 3D nonvolatile memory by for the speech data library information of each equipment user and artificial neural network learning database, improve the accuracy of speech recognition, accelerate voice response speed, reduce power consumption.Further, a chips is integrated with 3D nonvolatile memory and baseband processor simultaneously, silicon chip utilance can be substantially increased, and reduce manufacturing cost.

Be described further below with reference to the technique effect of accompanying drawing to design of the present invention, concrete structure and generation, to understand object of the present invention, characteristic sum effect fully.

Accompanying drawing explanation

Fig. 1 is that in prior art, mobile device relies on high in the clouds to realize the functional schematic of speech recognition;

Fig. 2 is the structural representation of the 3D nonvolatile memory of a preferred embodiment of the present invention;

Fig. 3 is the 3 dimensional drawing that realizes of the 3D nonvolatile memory of a preferred embodiment of the present invention and baseband processor and sectional view;

Fig. 4 is the voice operating schematic diagram of the mobile device response user of a preferred embodiment of the present invention.

Embodiment

Be further elaborated under lifting an instantiation below:

Mobile phone traditional at present only supports button operation, if user drives, user wants suddenly to check a mail, he just must pick up mobile phone and light screen by button, then mailbox position is found, then opened the mail wanting to check by button, and in startup procedure, do such thing be danger close.If adopt this method that can realize local voice Intelligent Recognition in mobile hand-held device of the present invention, just can be simply many.User can pick up mobile phone, only needs to carry out voice operating to mobile phone just passable.As shown in Figure 4, the voice operating of mobile phone response user, to be analyzed and coupling by the speech database of baseband processor in the 3D nonvolatile memory inside of this locality and artificial neural network learning data library lookup, mobile phone screen can light, then respond corresponding voice operating fast, user is wanted the e-mail messages searched feeds back to user by speech form.Visible, this method realizing speech recognition in this locality of the present invention, without the need to button operation, soon, more safer, more economize power consumption.This intelligent sound operation of the present invention is except for checking mail, and can also be used to make a phone call, check or answer short message, speech cipher inputs, and plays music, reading articles etc., is applicable to widely among people's life.

More than describe preferred embodiment of the present invention in detail.Should be appreciated that the ordinary skill of this area just design according to the present invention can make many modifications and variations without the need to creative work.Therefore, all technical staff in the art, all should by the determined protection range of claims under this invention's idea on the basis of existing technology by the available technical scheme of logical analysis, reasoning, or a limited experiment.

Claims

1. realize local voice in a mobile device and know a method for distinguishing, it is characterized in that, based on 3D nonvolatile memory, in described mobile device this locality stores, set up speech database and artificial neural network learning database.

2. realize as claimed in claim 1 local voice in a mobile device and know method for distinguishing, it is characterized in that, described speech database carries out learning for the accent of each equipment user, intonation, term custom and word speed thus analyze and store.

3. realize local voice as claimed in claim 1 in a mobile device and know method for distinguishing, it is characterized in that, described mobile device is mobile phone or panel computer.

4. realize local voice as claimed in claim 3 in a mobile device and know method for distinguishing, it is characterized in that, base band processor module is configured to carry out user speech Intelligent Recognition, comprises response user voice command.

5. realize local voice as claimed in claim 3 in a mobile device and know method for distinguishing, it is characterized in that, base band processor module and 3D nonvolatile memory integrate, and described base band processor module is fabricated on 3D nonvolatile memory silicon substrate.

6., as the local voice that realizes in a mobile device in claim 3 ~ 5 as described in any knows method for distinguishing, it is characterized in that, described mobile device is configured to light or do not light screen can carry out local voice Intelligent Recognition.

7., as the local voice that realizes in a mobile device in Claims 1 to 5 as described in any knows method for distinguishing, it is characterized in that, described 3D nonvolatile memory refers to that memory cell array adopts 3D technique.

8., as the local voice that realizes in a mobile device in Claims 1 to 5 as described in any knows method for distinguishing, it is characterized in that, the silicon substrate of described 3D nonvolatile memory is body silicon or silicon-on-insulator.