CN107657031A - Method based on android system management intelligent sound box voice technical ability - Google Patents

Method based on android system management intelligent sound box voice technical ability Download PDF

Info

Publication number
CN107657031A
CN107657031A CN201710896939.4A CN201710896939A CN107657031A CN 107657031 A CN107657031 A CN 107657031A CN 201710896939 A CN201710896939 A CN 201710896939A CN 107657031 A CN107657031 A CN 107657031A
Authority
CN
China
Prior art keywords
technical ability
voice
voice technical
speech engine
service
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201710896939.4A
Other languages
Chinese (zh)
Inventor
董天旭
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Sichuan Changhong Electric Co Ltd
Original Assignee
Sichuan Changhong Electric Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Sichuan Changhong Electric Co Ltd filed Critical Sichuan Changhong Electric Co Ltd
Priority to CN201710896939.4A priority Critical patent/CN107657031A/en
Publication of CN107657031A publication Critical patent/CN107657031A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/60Information retrieval; Database structures therefor; File system structures therefor of audio data
    • G06F16/68Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/683Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/223Execution procedure of a spoken command

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Library & Information Science (AREA)
  • Multimedia (AREA)
  • Acoustics & Sound (AREA)
  • Human Computer Interaction (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Health & Medical Sciences (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Telephonic Communication Services (AREA)

Abstract

The invention belongs to software design arts, and it discloses a kind of method based on android system management intelligent sound box voice technical ability, solves the problem of voice technical ability can not be independently updated, dynamic does not increase and decrease, and code is huge and management difficulty is big in conventional art.Speech engine and voice technical ability are designed as independent application program by the present invention;For voice skills definition parent service, parent service definition communication interface;Voice technical ability inherits parent service and realizes communication interface;The speech engine parsing applicable configuration file of voice skill identifies the responsible business of voice technical ability;Phonetic order is distributed to specified speech technical ability by speech engine after speech recognition, according to specific type of service by parent servicing communications interface.

Description

Method based on android system management intelligent sound box voice technical ability
Technical field
The invention belongs to software design arts, and in particular to based on android system management intelligent sound box voice technical ability Method.
Background technology
Explosive growth was presented in 2017 for intelligent sound box, and must strive entrance as artificial intelligence;Intelligent sound box at present Mainly use two kinds of systems, i.e. Linux and Android;Android system has successfully been applied as a kind of intelligence system In equipment such as mobile phone, flat board, TVs, in order to keep the inheritance of technology and product, by the technology in other smart machines and production Product directly apply to intelligent sound box, then it is extraordinary selection that intelligent sound box, which carries android system,.
Intelligent sound box based on android system, speech engine and voice technical ability are typically what different schemes business provided, Need to perform corresponding voice technical ability after speech recognition;Currently used way is that speech engine and voice technical ability are integrated into one Individual application program, but this mode causes voice technical ability can not the not dynamic increase and decrease, while apply generation of independently updated, voice technical ability It is not usually that same developer, code administration difficulty are big that code, which becomes huge and speech engine and voice technical ability,.
The content of the invention
The technical problems to be solved by the invention are:It is proposed a kind of based on android system management intelligent sound box voice skill Can method, solve conventional art in voice technical ability can not it is independently updated, dynamic increase and decrease, code is huge and management difficulty is big The problem of.
The technical solution adopted for the present invention to solve the technical problems is:
Based on the method for android system management intelligent sound box voice technical ability, including:
A. speech engine and voice technical ability are designed as independent application program;
B. it is voice skills definition parent service, parent service definition communication interface;
C. voice technical ability inherits parent service and realizes communication interface;
D. the speech engine parsing applicable configuration file of voice skill identifies the responsible business of voice technical ability;
E. speech engine is referred to voice by parent servicing communications interface according to specific type of service after speech recognition Order is distributed to specified speech technical ability.
It is voice skills definition parent service in step b as further optimization, parent service definition communication interface;Tool Body includes:The communication interface of striding course is serviced using Android AIDL language designs parent, speech engine is in binding voice skill After energy, by registering the callback interface of striding course, voice technical ability can is transmitted by the method for callback interface to speech engine Breath;Phonetic order is sent to the voice technical ability specified by communication interface by speech engine after speech recognition, and voice technical ability is received Corresponding service logic is performed after to instruction.
Optimize as further, in step c, the voice technical ability inherits voice technical ability parent service, and realizes that communication connects Mouthful, specifically include:
Voice technical ability must be stated as a service in its AndroidManifest.xml;The content of statement Mainly include:The Action for binding voice technical ability is stated, is just as all this values of voice technical ability, it It is the parameter that voice technical ability is found for speech engine;State the Meta-data for distinguishing voice skillset.
Optimize as further, in step d, the speech engine parsing applicable configuration file of voice skill identifies voice skill The business that can be responsible for, is specifically included:
Start in intelligent sound box and voice technical ability updates, after increase and decrease, speech engine can obtain voice technical ability and its class Type;Obtaining the method for voice technical ability is:Obtained by the PackageManager of android system It is voice technical ability that the service that Action is particular value is stated in AndroidManifest.xml;Differentiate voice skillset Method be to parse the log-on message of voice technical ability, obtain and represent that the Meta-data value of voice skillset is voice skill The type of energy.
Optimize as further, in step e, speech engine passes through father after speech recognition, according to specific type of service Phonetic order is distributed to specified speech technical ability by class servicing communications interface, is specifically included:
The voice identification result that speech engine obtains contains the voice skillset identified, i.e. type of service, from language The voice technical ability of the business can be handled by being matched in sound technical ability list, and now voice technical ability is represented by bag name and class name;Obtaining After the bag name and the class name that obtain voice technical ability, voice technical ability is bound with reference to Action, speech engine can passes through after binding success Communication interface sends phonetic order and gives voice technical ability, and voice technical ability can be by communication interface feedback message to speech engine.
Optimize as further, in step a, in addition to:By the configuration of all voice technical ability in same process.
It is described to include method of all voice technical ability configuration in same process as further optimization:
SharedUserId and process is set for voice technical ability, allowed with identical sharedUserId's and process Voice technical ability is run in a process.
The beneficial effects of the invention are as follows:Due to having decoupled speech engine and voice technical ability, the independence of speech engine is realized more Newly, the independently updated and dynamic of voice technical ability increases and decreases;In addition, although voice technical ability is independent application after decoupling;By all languages Sound technical ability is configured in same process, it is possible to reduce EMS memory occupation;Also, from the point of view of developer, also achieve code The decoupling of management, management difficulty also reduce.
Brief description of the drawings
Fig. 1 is the realization principle figure that the present invention manages intelligent sound box voice technical ability based on android system.
Embodiment
Originally a kind of method based on android system management intelligent sound box voice technical ability is directed to, solves conventional art The problem of middle voice technical ability can not be independently updated, dynamic does not increase and decrease, and code is huge and management difficulty is big.
As shown in figure 1, in the present invention, speech engine and voice technical ability are designed as independent application program;For voice Skills definition parent service, parent service definition communication interface;Voice technical ability inherits parent service and realizes communication interface;Voice The engine parsing applicable configuration file of voice skill identifies the responsible business of voice technical ability;Speech engine after speech recognition, Phonetic order is distributed to by specified speech technical ability by parent servicing communications interface according to specific type of service.
As independent application program, wherein speech engine is responsible for following work:1st, the list of voice technical ability and type are obtained; 2nd, voice technical ability 3, binding voice technical ability establish communication corresponding to being matched after speech recognition;4th, phonetic order (business) is sent to give Corresponding voice technical ability;5th, the feedback of voice technical ability is monitored, is responded;6th, when program exits, unbind and close voice technical ability.
Voice technical ability is responsible for following work:1st, inherit voice technical ability parent service 2, realize communication interface;3rd, realize specific Service logic.
In specific implementation, the present invention includes following design meanses:
1), the design of voice technical ability parent service:
The communication interface of striding course is serviced using Android AIDL language designs parent, speech engine is in binding voice After technical ability, by registering the callback interface of striding course, voice technical ability can is sent out by the method for callback interface to speech engine Information;Phonetic order is sent to the voice technical ability specified, voice technical ability by communication interface by speech engine after speech recognition Corresponding service logic is performed after receiving instruction.
2), the realization of voice technical ability:
Voice technical ability inherits voice technical ability parent service, and realizes the method for communication interface to handle specific speech business Logic.
Voice technical ability must be stated as a service in its AndroidManifest.xml;The content of statement Mainly include:The Action for binding voice technical ability is stated, is just as all this values of voice technical ability, it It is the parameter that voice technical ability is found for speech engine;State the Meta-data for distinguishing voice skillset.
3) method that speech engine obtains voice technical ability:
Start in intelligent sound box and voice technical ability updates, after increase and decrease, speech engine can obtain voice technical ability and its class Type;The method for obtaining voice technical ability is to obtain AndroidManifest.xml by the PackageManager of android system In state Action be particular value service be voice technical ability;The method for differentiating voice skillset is parsing voice technical ability Log-on message, obtain represent voice skillset Meta-data value be voice technical ability type.
4) method that speech engine point issues a command to voice technical ability:
The voice identification result that speech engine obtains contains the voice skillset (type of service) identified, from voice The voice technical ability of the business can be handled by being matched in technical ability list, and now voice technical ability is represented by bag name and class name;Obtaining After the bag name and class name of voice technical ability, with reference to Action binding voice technical ability, speech engine can passes through logical after binding success Communication interface sends phonetic order and gives voice technical ability, and voice technical ability can be by communication interface feedback message to speech engine.
5) method for reducing voice technical ability EMS memory occupation:
After using voice technical ability as independent application program, when the change of voice technical ability number is more, what voice technical ability took is System internal memory will increase;Android system for each application assigned it is minimum in save as 8M, but some voice technical ability are at all Take less than so much internal memory;Therefore, all voice technical ability can be configured in same process, can so reduced in system Deposit occupancy.

Claims (7)

1. the method based on android system management intelligent sound box voice technical ability, it is characterised in that including:
A. speech engine and voice technical ability are designed as independent application program;
B. it is voice skills definition parent service, parent service definition communication interface;
C. voice technical ability inherits parent service and realizes communication interface;
D. the speech engine parsing applicable configuration file of voice skill identifies the responsible business of voice technical ability;
E. speech engine is divided phonetic order by parent servicing communications interface according to specific type of service after speech recognition It is dealt into specified speech technical ability.
2. the method as claimed in claim 1 based on android system management intelligent sound box voice technical ability, it is characterised in that It is voice skills definition parent service in step b, parent service definition communication interface;Specifically include:Using Android's AIDL language designs parent services the communication interface of striding course, and speech engine is after voice technical ability is bound, by registering striding course Callback interface, voice technical ability can is by the method for callback interface to speech engine photos and sending messages;Speech engine is known in voice After not, phonetic order is sent to the voice technical ability specified by communication interface, voice technical ability performs accordingly after receiving instruction Service logic.
3. the method as claimed in claim 1 based on android system management intelligent sound box voice technical ability, it is characterised in that In step c, the voice technical ability inherits voice technical ability parent service, and realizes communication interface, specifically includes:
Voice technical ability must be stated as a service in its AndroidManifest.xml;The content of statement is main Comprising:The Action for binding voice technical ability is stated, is just as all this values of voice technical ability, it is also to use The parameter of voice technical ability is found in speech engine;State the Meta-data for distinguishing voice skillset.
4. the method as claimed in claim 1 based on android system management intelligent sound box voice technical ability, it is characterised in that In step d, the speech engine parsing applicable configuration file of voice skill identifies the responsible business of voice technical ability, specifically includes:
Start in intelligent sound box and voice technical ability updates, after increase and decrease, speech engine can obtain voice technical ability and its type;Obtain Taking the method for voice technical ability is:Sound in AndroidManifest.xml is obtained by the PackageManager of android system Understand that the service that Action is particular value is voice technical ability;The method for differentiating voice skillset is to parse the note of voice technical ability Volume information, the value for obtaining the Meta-data for representing voice skillset is the type of voice technical ability.
5. the method as claimed in claim 1 based on android system management intelligent sound box voice technical ability, it is characterised in that In step e, speech engine is after speech recognition, according to specific type of service by parent servicing communications interface by phonetic order Specified speech technical ability is distributed to, is specifically included:
The voice identification result that speech engine obtains contains the voice skillset identified, i.e. type of service, from voice skill The voice technical ability that can handle the business can be matched in list, now voice technical ability is represented by bag name and class name;Obtaining language After the bag name and class name of sound technical ability, voice technical ability is bound with reference to Action, speech engine can passes through communication after binding success Interface sends phonetic order and gives voice technical ability, and voice technical ability can be by communication interface feedback message to speech engine.
6. the method based on android system management intelligent sound box voice technical ability as described in claim 1-5 any one, its It is characterised by, in step a, in addition to:By the configuration of all voice technical ability in same process.
7. the method as claimed in claim 6 based on android system management intelligent sound box voice technical ability, it is characterised in that It is described to include method of all voice technical ability configuration in same process:
SharedUserId and process is set for voice technical ability, allows the voice with identical sharedUserId and process Technical ability is run in a process.
CN201710896939.4A 2017-09-28 2017-09-28 Method based on android system management intelligent sound box voice technical ability Pending CN107657031A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710896939.4A CN107657031A (en) 2017-09-28 2017-09-28 Method based on android system management intelligent sound box voice technical ability

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710896939.4A CN107657031A (en) 2017-09-28 2017-09-28 Method based on android system management intelligent sound box voice technical ability

Publications (1)

Publication Number Publication Date
CN107657031A true CN107657031A (en) 2018-02-02

Family

ID=61116912

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710896939.4A Pending CN107657031A (en) 2017-09-28 2017-09-28 Method based on android system management intelligent sound box voice technical ability

Country Status (1)

Country Link
CN (1) CN107657031A (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110223682A (en) * 2019-04-29 2019-09-10 百度在线网络技术(北京)有限公司 A kind of phonetic order referee method and device
CN110275790A (en) * 2019-06-26 2019-09-24 北京金山安全软件有限公司 Inter-process communication synchronization callback method, system and related equipment in application program
CN110322873A (en) * 2019-07-02 2019-10-11 百度在线网络技术(北京)有限公司 Voice technical ability exits method, apparatus, equipment and storage medium
CN110728978A (en) * 2018-06-29 2020-01-24 南京芝兰人工智能技术研究院有限公司 Voice conversation method and system based on artificial intelligence
CN111599362A (en) * 2020-05-20 2020-08-28 湖南华诺科技有限公司 System and method for self-defining intelligent sound box skill and storage medium

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1723487A (en) * 2002-12-13 2006-01-18 摩托罗拉公司 Method and apparatus for selective speech recognition
CN103117058A (en) * 2012-12-20 2013-05-22 四川长虹电器股份有限公司 Multi-voice engine switch system and method based on intelligent television platform
CN103714814A (en) * 2013-12-11 2014-04-09 四川长虹电器股份有限公司 Voice introducing method of voice recognition engine
CN106844392A (en) * 2015-12-07 2017-06-13 北京航天长峰科技工业集团有限公司 A kind of isomery speech data access platform
CN107018228A (en) * 2016-01-28 2017-08-04 中兴通讯股份有限公司 A kind of speech control system, method of speech processing and terminal device

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1723487A (en) * 2002-12-13 2006-01-18 摩托罗拉公司 Method and apparatus for selective speech recognition
CN103117058A (en) * 2012-12-20 2013-05-22 四川长虹电器股份有限公司 Multi-voice engine switch system and method based on intelligent television platform
CN103714814A (en) * 2013-12-11 2014-04-09 四川长虹电器股份有限公司 Voice introducing method of voice recognition engine
CN106844392A (en) * 2015-12-07 2017-06-13 北京航天长峰科技工业集团有限公司 A kind of isomery speech data access platform
CN107018228A (en) * 2016-01-28 2017-08-04 中兴通讯股份有限公司 A kind of speech control system, method of speech processing and terminal device

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
DEVDIV移动开发社区: "《Android实战体验》", 31 January 2013 *
吴志祥: "《Android应用开发案例教程》", 28 February 2015 *

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110728978A (en) * 2018-06-29 2020-01-24 南京芝兰人工智能技术研究院有限公司 Voice conversation method and system based on artificial intelligence
CN110223682A (en) * 2019-04-29 2019-09-10 百度在线网络技术(北京)有限公司 A kind of phonetic order referee method and device
CN110275790A (en) * 2019-06-26 2019-09-24 北京金山安全软件有限公司 Inter-process communication synchronization callback method, system and related equipment in application program
CN110322873A (en) * 2019-07-02 2019-10-11 百度在线网络技术(北京)有限公司 Voice technical ability exits method, apparatus, equipment and storage medium
US11580974B2 (en) 2019-07-02 2023-02-14 Baidu Online Network Technology (Beijing) Co., Ltd. Method for exiting a voice skill, apparatus, device and storage medium
CN111599362A (en) * 2020-05-20 2020-08-28 湖南华诺科技有限公司 System and method for self-defining intelligent sound box skill and storage medium

Similar Documents

Publication Publication Date Title
CN107657031A (en) Method based on android system management intelligent sound box voice technical ability
EP3731161A1 (en) Model application method and system, and model management method and server
CN107766132A (en) Multi-task scheduling method, application server and computer-readable recording medium
CN112527528A (en) Data transmission method, device and storage medium based on message queue
CN108509282A (en) Across the language call service administering methods of rpc based on golang reflection technologies
CN111488332B (en) AI service opening middle platform and method
CN114844930B (en) Vehicle-end remote control method and system based on SOA architecture
CN103902389A (en) Equipment management method supporting bidirectional communication based on android platform
CN110853645A (en) Method and device for recognizing voice command
CN108471441A (en) A kind of SAP middlewares and the SAP R3 both-way communication methods using the SAP middlewares
CN108829385A (en) A kind of business flow processing method and middle-agent's layer
US20200286012A1 (en) Model application method, management method, system and server
CN109348434A (en) A kind of sending method of scene information, sending device and terminal device
CN102801544A (en) Method and system for unified terminal management under Internet of things architecture
CN105933396B (en) A kind of each canalization interface of collaboration realizes the method and system of long flow
CN107783848A (en) A kind of JSON command handling methods and device based on socket communication
CN110175917A (en) The device and method of parameterized graphicsization processing
CN111562964A (en) Settlement service system simulator implementation method based on rule engine
CN105608097A (en) Method and device for adaptively adjusting database structure
CN111176622B (en) Implementation method, management system and management method for unified management of voice skills of solution
CN115526586A (en) Business process processing method based on intelligent operation
CN109036427B (en) Method and system for dynamically configuring voice recognition service
CN114866970A (en) Policy control method, system and related equipment
CN103338229B (en) The exchange method of remote terminal unit and data transmission unit and system
CN109117146A (en) Automatic deployment method, device, storage medium and the computer equipment of cloud platform duoble computer disaster-tolerance system

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20180202

RJ01 Rejection of invention patent application after publication