CN112706177A

CN112706177A - Voice-triggered robot expression system

Info

Publication number: CN112706177A
Application number: CN202011581680.2A
Authority: CN
Inventors: 孙文捷
Original assignee: Zhejiang Hozon New Energy Automobile Co Ltd
Current assignee: Zhejiang Hozon New Energy Automobile Co Ltd
Priority date: 2020-12-28
Filing date: 2020-12-28
Publication date: 2021-04-27

Abstract

The invention relates to the field of automobile artificial intelligence, in particular to a robot expression system triggered by voice. The voice-triggered robot expression system provided by the invention comprises a vehicle machine system, a robot system and a cloud platform: the car machine system receives and analyzes the voice input instruction, sends an expression action instruction to the robot system according to the analyzed voice input instruction content, receives the update data of the robot expression sent by the cloud platform, and updates a local database of the robot expression; the robot system is connected with the vehicle machine system and executes corresponding expression actions according to the expression action instructions sent by the vehicle machine system; the cloud platform is connected with the vehicle-mounted machine system and sends the updated data of the robot expression to the vehicle-mounted machine system. The invention realizes the emotional interaction between the automobile and the user, can update the corresponding expression according to the service content, and meets the personalized customization requirement of the user.

Description

Voice-triggered robot expression system

Technical Field

The invention relates to the field of automobile artificial intelligence, in particular to a robot expression system triggered by voice.

Background

With the application of emerging technologies such as voice interaction, image recognition, smart devices, etc. in automobiles, automobiles are undergoing a transition from "travel tools" to "travel partners", providing more and more intelligent emotional services to users.

The interactive experience of the user on the automobile is higher and higher, and the intelligent requirement of the automobile becomes the research and development target of each enterprise. At this stage, the most sophisticated solution to free hands from manipulation and reduce distraction during driving is voice control.

At present, a man-machine interaction system of an automobile can only provide service content according to voice communication of a user, but cannot trigger reasonable expressions according to the voice communication of the user, and the robot has single expression, is not intelligent enough and does not have feelings.

Meanwhile, in the man-machine interaction system at the present stage, the expression data of the robot is locally and fixedly stored, so that the expression data is difficult to be synchronously updated with the automobile service content, the technological sense is not strong, and the personalized customization requirements of users are difficult to meet.

Disclosure of Invention

The invention aims to provide a voice-triggered robot expression system, which solves the problems that the robot expression in the prior art is too single, does not have emotion and is difficult to update synchronously.

In order to achieve the above object, the present invention provides a speech-triggered robot expression system, which includes a car machine system, a robot system, and a cloud platform:

the car machine system receives and analyzes the voice input instruction, sends an expression action instruction to the robot system according to the analyzed voice input instruction content, receives the update data of the robot expression sent by the cloud platform, and updates a local database of the robot expression;

the robot system is connected with the vehicle machine system and executes corresponding expression actions according to the expression action instructions sent by the vehicle machine system;

the cloud platform is connected with the vehicle-mounted machine system and sends the updated data of the robot expression to the vehicle-mounted machine system.

In an embodiment, the car machine system includes a voice module and an expression calling module:

the voice module receives and analyzes a voice input instruction, and sends an expression calling instruction to the expression calling module according to a robot expression linkage rule, wherein the robot expression linkage rule refers to a linkage corresponding relation between voice input instruction content and robot expressions;

and the expression calling module calls image audio action information of corresponding robot expressions according to the expression calling instruction and sends the image audio action information to the robot system.

In one embodiment, the update data of the robot expression sent by the cloud platform comprises update data of image audio action information of the robot expression;

the voice module receives the update data of the image audio action of the robot expression sent by the cloud platform and then sends the update data to the expression calling module;

and the expression calling module is used for updating a local database of the image audio action information of the robot expression.

In one embodiment, the update data of the robot expression sent by the cloud platform comprises update data of a robot expression linkage rule;

the voice module receives the updating data of the robot expression linkage rules sent by the cloud platform and updates the local database of the robot expression linkage rules.

In one embodiment, the robotic expressions include generic and content-like expressions:

the general expression, the executed expression action is irrelevant to the content of the voice input instruction;

and the content type expression, the executed expression action and the voice input instruction content correspond to each other.

In an embodiment, the generic expressions include non-awakening expressions:

the non-awakening expression refers to a basic expression action executed by the robot system in a circulating playing mode when no voice input instruction exists.

In an embodiment, the generic expressions include wake expressions:

the awakening expression is an expression action executed by the robot system in a circulating playing mode after the voice input instruction is received and before the content expression is executed.

In an embodiment, the cloud platform increases the update data of the robot expression corresponding to the content-class expression according to the updated service content.

In one embodiment, the content-like expression includes:

audio expression;

navigating the expression;

weather expressions;

communication expression;

photographing an expression;

controlling the expression by an air conditioner;

searching for a list expression;

video expression;

a news expression;

financial expressions;

restricting expression;

welcome expressions;

and (5) cheering the expression.

The voice-triggered robot expression system provided by the invention solves the problems that the robot expression is single, does not have emotion and only provides content, realizes emotional interaction between an automobile and a user, can update corresponding expression according to service content, meets the personalized customization requirement of the user, realizes the uniform expression style of automobile brands, and generates the sense of identity of value and appearance.

Drawings

The above and other features, properties and advantages of the present invention will become more apparent from the following description of the embodiments with reference to the accompanying drawings in which like reference numerals denote like features throughout the several views, wherein:

FIG. 1 discloses a functional block diagram of a speech triggered robot expression system according to an embodiment of the present invention;

fig. 2 discloses an updating flow chart of the speech-triggered robot expression system according to an embodiment of the invention.

The meanings of the reference symbols in the figures are as follows:

100 a vehicle machine system;

101 a voice module;

102 expression calling module;

200 a robotic system;

300 cloud platform.

Detailed Description

In order to make the objects, technical solutions and advantages of the present invention more apparent, the present invention is described in further detail below with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are merely illustrative of the invention and are not intended to limit the invention.

Fig. 1 discloses a schematic block diagram of a speech-triggered robot expression system according to an embodiment of the present invention, and the speech-triggered robot expression system shown in fig. 1 includes a car machine system 100, a robot system 200, and a cloud platform 300:

the car machine system 100 receives and analyzes the voice input instruction, sends an expression action instruction to the robot system 200 according to the analyzed voice input instruction content, receives the update data of the robot expression sent by the cloud platform 300, and updates the local database of the robot expression;

the robot system 200 is connected with the car machine system 100, and executes corresponding expression actions according to the expression action instructions sent by the car machine system 100;

the cloud platform 300 is connected to the car machine system 100, and sends the update data of the robot expression to the car machine system 100.

When the user drives the vehicle, the robot system is triggered to execute the expression action through voice input.

Furthermore, the car machine system 100 includes a voice module 101 and an expression calling module 102:

the voice module 101 receives and analyzes a voice input instruction, analyzes the speaking content of a user, and sends an expression calling instruction to the expression calling module 102 according to a robot expression linkage rule, wherein the robot expression linkage rule refers to a corresponding linkage relation between the voice input instruction content and a robot expression;

the expression calling module 102 calls image audio action information of corresponding robot expressions according to the expression calling instruction, sends the image audio action information to the robot system 200, and notifies the robot system 200 to play the expressions with different contents.

The expressive actions performed by the robotic system 200 include generic expressions and content-like expressions.

The general expression is similar to the expression, and the executed expression action is independent of the content of the voice input instruction.

And general expressions, including non-awakening expressions and awakening expressions.

The non-awakening expression refers to a basic expression action executed by the robot system in a circulating manner when no voice input instruction exists.

And the awakening expression is a transitional expression action which is executed by the robot system in a circulating way after the voice input instruction is received and before the content expression is executed.

And content type expressions, wherein the executed expression actions correspond to the contents of the voice input instructions.

Content-like expressions, including but not limited to:

-an audio expression;

-a navigational expression;

-weather expressions;

-a communication expression;

-a photographic expression;

-air conditioning control expressions;

-search list emotions;

-a video expression;

-a news emoticon;

-a financial expression;

-a restricted expression;

-a welcome expression;

-cheering an expression.

Obviously, the content expression can be increased according to the updated service content.

When new service content is added, the cloud platform 300 adds the update data of the robot expression corresponding to the content-like expression, and sends the update data to the in-vehicle system 100 to update the local database.

The cloud platform 300 sends the robot expression update data to the car machine system 100, including the robot expression image audio action information and the robot expression linkage rule update data.

The image audio frequency action information of robot expression can be updated from the cloud service of host computer factory in real time:

the voice module 101 receives the update data of the image audio action of the robot expression sent by the cloud platform 300 and sends the update data to the expression calling module 102;

and the expression calling module 102 is used for updating the local database of the image audio action information of the robot expression.

The robot expression linkage rule is a corresponding linkage relation between voice input instruction content and robot expressions, and can be updated in real time through cloud service of a host factory:

the voice module 101 receives the update data of the robot expression linkage rule sent by the cloud platform 300, and updates the local database of the robot expression linkage rule.

Fig. 2 discloses an update flow chart of a speech-triggered robot expression system according to an embodiment of the present invention, where the robot expression update flow shown in fig. 2 further includes the following steps:

s1, starting the car machine system 100;

s2, the upgrade module in the in-vehicle system 100 automatically sends an update request to the cloud platform 300;

s3, the cloud platform 300 detects that the new robot expression update data exists, where the robot expression update data includes robot expression image audio motion information and robot expression linkage rule update data.

S4, the car machine system 100 downloads the updated data of the robot expression from the cloud platform 300;

and S5, the car machine system 100 updates the update data of the robot expression.

Those of skill in the art would understand that information, signals, and data may be represented using any of a variety of different technologies and techniques. For example, data, instructions, commands, information, signals, bits (bits), symbols, and chips that may be referenced throughout the above description may be represented by voltages, currents, electromagnetic waves, magnetic fields or particles, optical fields or particles, or any combination thereof.

Those of skill would further appreciate that the various illustrative logical blocks, modules, circuits, and algorithm steps described in connection with the embodiments disclosed herein may be implemented as electronic hardware, computer software, or combinations of both. To clearly illustrate this interchangeability of hardware and software, various illustrative components, blocks, modules, circuits, and steps have been described above generally in terms of their functionality. Whether such functionality is implemented as hardware or software depends upon the particular application and design constraints imposed on the overall system. Skilled artisans may implement the described functionality in varying ways for each particular application, but such implementation decisions should not be interpreted as causing a departure from the scope of the present invention.

The various illustrative logical modules, and circuits described in connection with the embodiments disclosed herein may be implemented or performed with a general purpose processor, a Digital Signal Processor (DSP), an Application Specific Integrated Circuit (ASIC), a Field Programmable Gate Array (FPGA) or other programmable logic device, discrete gate or transistor logic, discrete hardware components, or any combination thereof designed to perform the functions described herein. A general purpose processor may be a microprocessor, but in the alternative, the processor may be any conventional processor, controller, microcontroller, or state machine. A processor may also be implemented as a combination of computing devices, e.g., a combination of a DSP and a microprocessor, a plurality of microprocessors, one or more microprocessors in conjunction with a DSP core, or any other such configuration.

The steps of a method or algorithm described in connection with the embodiments disclosed herein may be embodied directly in hardware, in a software module executed by a processor, or in a combination of the two. A software module may reside in RAM memory, flash memory, ROM memory, EPROM memory, EEPROM memory, registers, hard disk, a removable disk, a CD-ROM, or any other form of storage medium known in the art. An exemplary storage medium is coupled to the processor such the processor can read information from, and write information to, the storage medium. In the alternative, the storage medium may be integral to the processor. The processor and the storage medium may reside in an ASIC. The ASIC may reside in a user terminal. In the alternative, the processor and the storage medium may reside as discrete components in a user terminal.

In one or more exemplary embodiments, the functions described may be implemented in hardware, software, firmware, or any combination thereof. If implemented in software as a computer program product, the functions may be stored on or transmitted over as one or more instructions or code on a computer-readable medium. Computer-readable media includes both computer storage media and communication media including any medium that facilitates transfer of a computer program from one place to another. A storage media may be any available media that can be accessed by a computer. By way of example, and not limitation, such computer-readable media can comprise RAM, ROM, EEPROM, CD-ROM or other optical disk storage, magnetic disk storage or other magnetic storage devices, or any other medium that can be used to carry or store desired program code in the form of instructions or data structures and that can be accessed by a computer. Any connection is properly termed a computer-readable medium. For example, if the software is transmitted from a web site, server, or other remote source using a coaxial cable, fiber optic cable, twisted pair, Digital Subscriber Line (DSL), or wireless technologies such as infrared, radio, and microwave, then the coaxial cable, fiber optic cable, twisted pair, DSL, or wireless technologies such as infrared, radio, and microwave are included in the definition of medium. Disk (disk) and disc (disc), as used herein, includes Compact Disc (CD), laser disc, optical disc, Digital Versatile Disc (DVD), floppy disk and blu-ray disc where disks (disks) usually reproduce data magnetically, while discs (discs) reproduce data optically with lasers. Combinations of the above should also be included within the scope of computer-readable media.

While, for purposes of simplicity of explanation, the methodologies are shown and described as a series of acts, it is to be understood and appreciated that the methodologies are not limited by the order of acts, as some acts may, in accordance with one or more embodiments, occur in different orders and/or concurrently with other acts from that shown and described herein or not shown and described herein, as would be understood by one skilled in the art.

As used in this application and the appended claims, the terms "a," "an," "the," and/or "the" are not intended to be inclusive in the singular, but rather are intended to be inclusive in the plural unless the context clearly dictates otherwise. In general, the terms "comprises" and "comprising" merely indicate that steps and elements are included which are explicitly identified, that the steps and elements do not form an exclusive list, and that a method or apparatus may include other steps or elements.

The embodiments described above are provided to enable persons skilled in the art to make or use the invention and that modifications or variations can be made to the embodiments described above by persons skilled in the art without departing from the inventive concept of the present invention, so that the scope of protection of the present invention is not limited by the embodiments described above but should be accorded the widest scope consistent with the innovative features set forth in the claims.

Claims

1. The utility model provides a pronunciation trigger robot expression system which characterized in that, includes car machine system, robot system and cloud platform:

2. The voice-triggered robot expression system of claim 1, wherein the car machine system comprises a voice module and an expression calling module:

the voice module receives and analyzes a voice input instruction, and sends an expression calling instruction to the expression calling module according to a robot expression linkage rule, wherein the robot expression linkage rule refers to a corresponding linkage relation between voice input instruction content and robot expressions;

3. The voice-triggered robotic expression system of claim 2, wherein:

the updating data of the robot expression sent by the cloud platform comprise updating data of image audio action information of the robot expression;

4. The voice-triggered robotic expression system of claim 2, wherein:

the updating data of the robot expression sent by the cloud platform comprise updating data of a robot expression linkage rule;

5. The voice-triggered robotic expression system of claim 2, wherein the robotic expressions comprise generic and content-like expressions:

6. The voice-triggered robotic expression system of claim 5, wherein the generic expressions comprise non-awakened expressions:

7. The voice-triggered robotic expression system of claim 5, wherein the generic expressions comprise wake-like expressions:

8. The voice-triggered robot expression system of claim 5, wherein the cloud platform adds update data of robot expressions corresponding to content-like expressions according to updated service content.

9. The voice-triggered robotic expression system of claim 8, wherein the content-class expressions comprise: