CN112706177A - Voice-triggered robot expression system - Google Patents

Voice-triggered robot expression system Download PDF

Info

Publication number
CN112706177A
CN112706177A CN202011581680.2A CN202011581680A CN112706177A CN 112706177 A CN112706177 A CN 112706177A CN 202011581680 A CN202011581680 A CN 202011581680A CN 112706177 A CN112706177 A CN 112706177A
Authority
CN
China
Prior art keywords
expression
robot
voice
expressions
content
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202011581680.2A
Other languages
Chinese (zh)
Inventor
孙文捷
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Zhejiang Hozon New Energy Automobile Co Ltd
Original Assignee
Zhejiang Hozon New Energy Automobile Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Zhejiang Hozon New Energy Automobile Co Ltd filed Critical Zhejiang Hozon New Energy Automobile Co Ltd
Priority to CN202011581680.2A priority Critical patent/CN112706177A/en
Publication of CN112706177A publication Critical patent/CN112706177A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • BPERFORMING OPERATIONS; TRANSPORTING
    • B25HAND TOOLS; PORTABLE POWER-DRIVEN TOOLS; MANIPULATORS
    • B25JMANIPULATORS; CHAMBERS PROVIDED WITH MANIPULATION DEVICES
    • B25J11/00Manipulators not otherwise provided for
    • B25J11/0005Manipulators having means for high-level communication with users, e.g. speech generator, face recognition means
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue

Landscapes

  • Engineering & Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • General Health & Medical Sciences (AREA)
  • Robotics (AREA)
  • Mechanical Engineering (AREA)
  • Computational Linguistics (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • User Interface Of Digital Computer (AREA)

Abstract

The invention relates to the field of automobile artificial intelligence, in particular to a robot expression system triggered by voice. The voice-triggered robot expression system provided by the invention comprises a vehicle machine system, a robot system and a cloud platform: the car machine system receives and analyzes the voice input instruction, sends an expression action instruction to the robot system according to the analyzed voice input instruction content, receives the update data of the robot expression sent by the cloud platform, and updates a local database of the robot expression; the robot system is connected with the vehicle machine system and executes corresponding expression actions according to the expression action instructions sent by the vehicle machine system; the cloud platform is connected with the vehicle-mounted machine system and sends the updated data of the robot expression to the vehicle-mounted machine system. The invention realizes the emotional interaction between the automobile and the user, can update the corresponding expression according to the service content, and meets the personalized customization requirement of the user.

Description

Voice-triggered robot expression system
Technical Field
The invention relates to the field of automobile artificial intelligence, in particular to a robot expression system triggered by voice.
Background
With the application of emerging technologies such as voice interaction, image recognition, smart devices, etc. in automobiles, automobiles are undergoing a transition from "travel tools" to "travel partners", providing more and more intelligent emotional services to users.
The interactive experience of the user on the automobile is higher and higher, and the intelligent requirement of the automobile becomes the research and development target of each enterprise. At this stage, the most sophisticated solution to free hands from manipulation and reduce distraction during driving is voice control.
At present, a man-machine interaction system of an automobile can only provide service content according to voice communication of a user, but cannot trigger reasonable expressions according to the voice communication of the user, and the robot has single expression, is not intelligent enough and does not have feelings.
Meanwhile, in the man-machine interaction system at the present stage, the expression data of the robot is locally and fixedly stored, so that the expression data is difficult to be synchronously updated with the automobile service content, the technological sense is not strong, and the personalized customization requirements of users are difficult to meet.
Disclosure of Invention
The invention aims to provide a voice-triggered robot expression system, which solves the problems that the robot expression in the prior art is too single, does not have emotion and is difficult to update synchronously.
In order to achieve the above object, the present invention provides a speech-triggered robot expression system, which includes a car machine system, a robot system, and a cloud platform:
the car machine system receives and analyzes the voice input instruction, sends an expression action instruction to the robot system according to the analyzed voice input instruction content, receives the update data of the robot expression sent by the cloud platform, and updates a local database of the robot expression;
the robot system is connected with the vehicle machine system and executes corresponding expression actions according to the expression action instructions sent by the vehicle machine system;
the cloud platform is connected with the vehicle-mounted machine system and sends the updated data of the robot expression to the vehicle-mounted machine system.
In an embodiment, the car machine system includes a voice module and an expression calling module:
the voice module receives and analyzes a voice input instruction, and sends an expression calling instruction to the expression calling module according to a robot expression linkage rule, wherein the robot expression linkage rule refers to a linkage corresponding relation between voice input instruction content and robot expressions;
and the expression calling module calls image audio action information of corresponding robot expressions according to the expression calling instruction and sends the image audio action information to the robot system.
In one embodiment, the update data of the robot expression sent by the cloud platform comprises update data of image audio action information of the robot expression;
the voice module receives the update data of the image audio action of the robot expression sent by the cloud platform and then sends the update data to the expression calling module;
and the expression calling module is used for updating a local database of the image audio action information of the robot expression.
In one embodiment, the update data of the robot expression sent by the cloud platform comprises update data of a robot expression linkage rule;
the voice module receives the updating data of the robot expression linkage rules sent by the cloud platform and updates the local database of the robot expression linkage rules.
In one embodiment, the robotic expressions include generic and content-like expressions:
the general expression, the executed expression action is irrelevant to the content of the voice input instruction;
and the content type expression, the executed expression action and the voice input instruction content correspond to each other.
In an embodiment, the generic expressions include non-awakening expressions:
the non-awakening expression refers to a basic expression action executed by the robot system in a circulating playing mode when no voice input instruction exists.
In an embodiment, the generic expressions include wake expressions:
the awakening expression is an expression action executed by the robot system in a circulating playing mode after the voice input instruction is received and before the content expression is executed.
In an embodiment, the cloud platform increases the update data of the robot expression corresponding to the content-class expression according to the updated service content.
In one embodiment, the content-like expression includes:
audio expression;
navigating the expression;
weather expressions;
communication expression;
photographing an expression;
controlling the expression by an air conditioner;
searching for a list expression;
video expression;
a news expression;
financial expressions;
restricting expression;
welcome expressions;
and (5) cheering the expression.
The voice-triggered robot expression system provided by the invention solves the problems that the robot expression is single, does not have emotion and only provides content, realizes emotional interaction between an automobile and a user, can update corresponding expression according to service content, meets the personalized customization requirement of the user, realizes the uniform expression style of automobile brands, and generates the sense of identity of value and appearance.
Drawings
The above and other features, properties and advantages of the present invention will become more apparent from the following description of the embodiments with reference to the accompanying drawings in which like reference numerals denote like features throughout the several views, wherein:
FIG. 1 discloses a functional block diagram of a speech triggered robot expression system according to an embodiment of the present invention;
fig. 2 discloses an updating flow chart of the speech-triggered robot expression system according to an embodiment of the invention.
The meanings of the reference symbols in the figures are as follows:
100 a vehicle machine system;
101 a voice module;
102 expression calling module;
200 a robotic system;
300 cloud platform.
Detailed Description
In order to make the objects, technical solutions and advantages of the present invention more apparent, the present invention is described in further detail below with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are merely illustrative of the invention and are not intended to limit the invention.
Fig. 1 discloses a schematic block diagram of a speech-triggered robot expression system according to an embodiment of the present invention, and the speech-triggered robot expression system shown in fig. 1 includes a car machine system 100, a robot system 200, and a cloud platform 300:
the car machine system 100 receives and analyzes the voice input instruction, sends an expression action instruction to the robot system 200 according to the analyzed voice input instruction content, receives the update data of the robot expression sent by the cloud platform 300, and updates the local database of the robot expression;
the robot system 200 is connected with the car machine system 100, and executes corresponding expression actions according to the expression action instructions sent by the car machine system 100;
the cloud platform 300 is connected to the car machine system 100, and sends the update data of the robot expression to the car machine system 100.
When the user drives the vehicle, the robot system is triggered to execute the expression action through voice input.
Furthermore, the car machine system 100 includes a voice module 101 and an expression calling module 102:
the voice module 101 receives and analyzes a voice input instruction, analyzes the speaking content of a user, and sends an expression calling instruction to the expression calling module 102 according to a robot expression linkage rule, wherein the robot expression linkage rule refers to a corresponding linkage relation between the voice input instruction content and a robot expression;
the expression calling module 102 calls image audio action information of corresponding robot expressions according to the expression calling instruction, sends the image audio action information to the robot system 200, and notifies the robot system 200 to play the expressions with different contents.
The expressive actions performed by the robotic system 200 include generic expressions and content-like expressions.
The general expression is similar to the expression, and the executed expression action is independent of the content of the voice input instruction.
And general expressions, including non-awakening expressions and awakening expressions.
The non-awakening expression refers to a basic expression action executed by the robot system in a circulating manner when no voice input instruction exists.
And the awakening expression is a transitional expression action which is executed by the robot system in a circulating way after the voice input instruction is received and before the content expression is executed.
And content type expressions, wherein the executed expression actions correspond to the contents of the voice input instructions.
Content-like expressions, including but not limited to:
-an audio expression;
-a navigational expression;
-weather expressions;
-a communication expression;
-a photographic expression;
-air conditioning control expressions;
-search list emotions;
-a video expression;
-a news emoticon;
-a financial expression;
-a restricted expression;
-a welcome expression;
-cheering an expression.
Obviously, the content expression can be increased according to the updated service content.
When new service content is added, the cloud platform 300 adds the update data of the robot expression corresponding to the content-like expression, and sends the update data to the in-vehicle system 100 to update the local database.
The cloud platform 300 sends the robot expression update data to the car machine system 100, including the robot expression image audio action information and the robot expression linkage rule update data.
The image audio frequency action information of robot expression can be updated from the cloud service of host computer factory in real time:
the voice module 101 receives the update data of the image audio action of the robot expression sent by the cloud platform 300 and sends the update data to the expression calling module 102;
and the expression calling module 102 is used for updating the local database of the image audio action information of the robot expression.
The robot expression linkage rule is a corresponding linkage relation between voice input instruction content and robot expressions, and can be updated in real time through cloud service of a host factory:
the voice module 101 receives the update data of the robot expression linkage rule sent by the cloud platform 300, and updates the local database of the robot expression linkage rule.
Fig. 2 discloses an update flow chart of a speech-triggered robot expression system according to an embodiment of the present invention, where the robot expression update flow shown in fig. 2 further includes the following steps:
s1, starting the car machine system 100;
s2, the upgrade module in the in-vehicle system 100 automatically sends an update request to the cloud platform 300;
s3, the cloud platform 300 detects that the new robot expression update data exists, where the robot expression update data includes robot expression image audio motion information and robot expression linkage rule update data.
S4, the car machine system 100 downloads the updated data of the robot expression from the cloud platform 300;
and S5, the car machine system 100 updates the update data of the robot expression.
The voice-triggered robot expression system provided by the invention solves the problems that the robot expression is single, does not have emotion and only provides content, realizes emotional interaction between an automobile and a user, can update corresponding expression according to service content, meets the personalized customization requirement of the user, realizes the uniform expression style of automobile brands, and generates the sense of identity of value and appearance.
Those of skill in the art would understand that information, signals, and data may be represented using any of a variety of different technologies and techniques. For example, data, instructions, commands, information, signals, bits (bits), symbols, and chips that may be referenced throughout the above description may be represented by voltages, currents, electromagnetic waves, magnetic fields or particles, optical fields or particles, or any combination thereof.
Those of skill would further appreciate that the various illustrative logical blocks, modules, circuits, and algorithm steps described in connection with the embodiments disclosed herein may be implemented as electronic hardware, computer software, or combinations of both. To clearly illustrate this interchangeability of hardware and software, various illustrative components, blocks, modules, circuits, and steps have been described above generally in terms of their functionality. Whether such functionality is implemented as hardware or software depends upon the particular application and design constraints imposed on the overall system. Skilled artisans may implement the described functionality in varying ways for each particular application, but such implementation decisions should not be interpreted as causing a departure from the scope of the present invention.
The various illustrative logical modules, and circuits described in connection with the embodiments disclosed herein may be implemented or performed with a general purpose processor, a Digital Signal Processor (DSP), an Application Specific Integrated Circuit (ASIC), a Field Programmable Gate Array (FPGA) or other programmable logic device, discrete gate or transistor logic, discrete hardware components, or any combination thereof designed to perform the functions described herein. A general purpose processor may be a microprocessor, but in the alternative, the processor may be any conventional processor, controller, microcontroller, or state machine. A processor may also be implemented as a combination of computing devices, e.g., a combination of a DSP and a microprocessor, a plurality of microprocessors, one or more microprocessors in conjunction with a DSP core, or any other such configuration.
The steps of a method or algorithm described in connection with the embodiments disclosed herein may be embodied directly in hardware, in a software module executed by a processor, or in a combination of the two. A software module may reside in RAM memory, flash memory, ROM memory, EPROM memory, EEPROM memory, registers, hard disk, a removable disk, a CD-ROM, or any other form of storage medium known in the art. An exemplary storage medium is coupled to the processor such the processor can read information from, and write information to, the storage medium. In the alternative, the storage medium may be integral to the processor. The processor and the storage medium may reside in an ASIC. The ASIC may reside in a user terminal. In the alternative, the processor and the storage medium may reside as discrete components in a user terminal.
In one or more exemplary embodiments, the functions described may be implemented in hardware, software, firmware, or any combination thereof. If implemented in software as a computer program product, the functions may be stored on or transmitted over as one or more instructions or code on a computer-readable medium. Computer-readable media includes both computer storage media and communication media including any medium that facilitates transfer of a computer program from one place to another. A storage media may be any available media that can be accessed by a computer. By way of example, and not limitation, such computer-readable media can comprise RAM, ROM, EEPROM, CD-ROM or other optical disk storage, magnetic disk storage or other magnetic storage devices, or any other medium that can be used to carry or store desired program code in the form of instructions or data structures and that can be accessed by a computer. Any connection is properly termed a computer-readable medium. For example, if the software is transmitted from a web site, server, or other remote source using a coaxial cable, fiber optic cable, twisted pair, Digital Subscriber Line (DSL), or wireless technologies such as infrared, radio, and microwave, then the coaxial cable, fiber optic cable, twisted pair, DSL, or wireless technologies such as infrared, radio, and microwave are included in the definition of medium. Disk (disk) and disc (disc), as used herein, includes Compact Disc (CD), laser disc, optical disc, Digital Versatile Disc (DVD), floppy disk and blu-ray disc where disks (disks) usually reproduce data magnetically, while discs (discs) reproduce data optically with lasers. Combinations of the above should also be included within the scope of computer-readable media.
While, for purposes of simplicity of explanation, the methodologies are shown and described as a series of acts, it is to be understood and appreciated that the methodologies are not limited by the order of acts, as some acts may, in accordance with one or more embodiments, occur in different orders and/or concurrently with other acts from that shown and described herein or not shown and described herein, as would be understood by one skilled in the art.
As used in this application and the appended claims, the terms "a," "an," "the," and/or "the" are not intended to be inclusive in the singular, but rather are intended to be inclusive in the plural unless the context clearly dictates otherwise. In general, the terms "comprises" and "comprising" merely indicate that steps and elements are included which are explicitly identified, that the steps and elements do not form an exclusive list, and that a method or apparatus may include other steps or elements.
The embodiments described above are provided to enable persons skilled in the art to make or use the invention and that modifications or variations can be made to the embodiments described above by persons skilled in the art without departing from the inventive concept of the present invention, so that the scope of protection of the present invention is not limited by the embodiments described above but should be accorded the widest scope consistent with the innovative features set forth in the claims.

Claims (9)

1. The utility model provides a pronunciation trigger robot expression system which characterized in that, includes car machine system, robot system and cloud platform:
the car machine system receives and analyzes the voice input instruction, sends an expression action instruction to the robot system according to the analyzed voice input instruction content, receives the update data of the robot expression sent by the cloud platform, and updates a local database of the robot expression;
the robot system is connected with the vehicle machine system and executes corresponding expression actions according to the expression action instructions sent by the vehicle machine system;
the cloud platform is connected with the vehicle-mounted machine system and sends the updated data of the robot expression to the vehicle-mounted machine system.
2. The voice-triggered robot expression system of claim 1, wherein the car machine system comprises a voice module and an expression calling module:
the voice module receives and analyzes a voice input instruction, and sends an expression calling instruction to the expression calling module according to a robot expression linkage rule, wherein the robot expression linkage rule refers to a corresponding linkage relation between voice input instruction content and robot expressions;
and the expression calling module calls image audio action information of corresponding robot expressions according to the expression calling instruction and sends the image audio action information to the robot system.
3. The voice-triggered robotic expression system of claim 2, wherein:
the updating data of the robot expression sent by the cloud platform comprise updating data of image audio action information of the robot expression;
the voice module receives the update data of the image audio action of the robot expression sent by the cloud platform and then sends the update data to the expression calling module;
and the expression calling module is used for updating a local database of the image audio action information of the robot expression.
4. The voice-triggered robotic expression system of claim 2, wherein:
the updating data of the robot expression sent by the cloud platform comprise updating data of a robot expression linkage rule;
the voice module receives the updating data of the robot expression linkage rules sent by the cloud platform and updates the local database of the robot expression linkage rules.
5. The voice-triggered robotic expression system of claim 2, wherein the robotic expressions comprise generic and content-like expressions:
the general expression, the executed expression action is irrelevant to the content of the voice input instruction;
and the content type expression, the executed expression action and the voice input instruction content correspond to each other.
6. The voice-triggered robotic expression system of claim 5, wherein the generic expressions comprise non-awakened expressions:
the non-awakening expression refers to a basic expression action executed by the robot system in a circulating playing mode when no voice input instruction exists.
7. The voice-triggered robotic expression system of claim 5, wherein the generic expressions comprise wake-like expressions:
the awakening expression is an expression action executed by the robot system in a circulating playing mode after the voice input instruction is received and before the content expression is executed.
8. The voice-triggered robot expression system of claim 5, wherein the cloud platform adds update data of robot expressions corresponding to content-like expressions according to updated service content.
9. The voice-triggered robotic expression system of claim 8, wherein the content-class expressions comprise:
audio expression;
navigating the expression;
weather expressions;
communication expression;
photographing an expression;
controlling the expression by an air conditioner;
searching for a list expression;
video expression;
a news expression;
financial expressions;
restricting expression;
welcome expressions;
and (5) cheering the expression.
CN202011581680.2A 2020-12-28 2020-12-28 Voice-triggered robot expression system Pending CN112706177A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202011581680.2A CN112706177A (en) 2020-12-28 2020-12-28 Voice-triggered robot expression system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202011581680.2A CN112706177A (en) 2020-12-28 2020-12-28 Voice-triggered robot expression system

Publications (1)

Publication Number Publication Date
CN112706177A true CN112706177A (en) 2021-04-27

Family

ID=75545905

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202011581680.2A Pending CN112706177A (en) 2020-12-28 2020-12-28 Voice-triggered robot expression system

Country Status (1)

Country Link
CN (1) CN112706177A (en)

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20180366121A1 (en) * 2017-06-14 2018-12-20 Toyota Jidosha Kabushiki Kaisha Communication device, communication robot and computer-readable storage medium
CN110834338A (en) * 2019-11-04 2020-02-25 深圳勇艺达机器人有限公司 Vehicle-mounted robot and control method thereof
CN110871446A (en) * 2018-08-31 2020-03-10 比亚迪股份有限公司 Vehicle-mounted robot, control method and system thereof, vehicle and storage medium
CN111124123A (en) * 2019-12-24 2020-05-08 苏州思必驰信息科技有限公司 Voice interaction method and device based on virtual robot image and intelligent control system of vehicle-mounted equipment

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20180366121A1 (en) * 2017-06-14 2018-12-20 Toyota Jidosha Kabushiki Kaisha Communication device, communication robot and computer-readable storage medium
CN110871446A (en) * 2018-08-31 2020-03-10 比亚迪股份有限公司 Vehicle-mounted robot, control method and system thereof, vehicle and storage medium
CN110834338A (en) * 2019-11-04 2020-02-25 深圳勇艺达机器人有限公司 Vehicle-mounted robot and control method thereof
CN111124123A (en) * 2019-12-24 2020-05-08 苏州思必驰信息科技有限公司 Voice interaction method and device based on virtual robot image and intelligent control system of vehicle-mounted equipment

Similar Documents

Publication Publication Date Title
CN108284840B (en) Autonomous vehicle control system and method incorporating occupant preferences
CN108520743B (en) Voice control method of intelligent device, intelligent device and computer readable medium
US20200042285A1 (en) Acoustic control system, apparatus and method
US9466314B2 (en) Method for controlling functional devices in a vehicle during voice command operation
CN108062212A (en) A kind of voice operating method and device based on scene
CN109671435B (en) Method and apparatus for waking up smart device
US20070239443A1 (en) Device Control Device and Device Control Method
CN111768759A (en) Method and apparatus for generating information
JP7239366B2 (en) AGENT DEVICE, CONTROL METHOD OF AGENT DEVICE, AND PROGRAM
CN111508478A (en) Speech recognition method and device
US20240070213A1 (en) Vehicle driving policy recommendation method and apparatus
CN106372203A (en) Information response method and device for smart terminal and smart terminal
CN112706177A (en) Voice-triggered robot expression system
US20150088489A1 (en) Systems and methods for providing man-machine communications with etiquette
CN110450789B (en) Information processing method and device
WO2021068247A1 (en) Neural network scheduling method and apparatus, computer device, and readable storage medium
EP4369185A1 (en) Execution instruction determination method and apparatus, device, and storage medium
CN111050105A (en) Video playing method and device, toy robot and readable storage medium
EP4369186A1 (en) Control method and apparatus, device, and storage medium
CN115509572A (en) Method for dynamically configuring business logic, cloud platform, vehicle and storage medium
CN113709954A (en) Atmosphere lamp control method and device, electronic equipment and storage medium
US7822614B2 (en) Device control, speech recognition device, agent device, control method
CN117565808A (en) Vehicle-mounted man-machine interaction method, system, medium and device based on singing
CN117995191A (en) Voice instruction recommendation method, system and computer readable medium
CN116353523A (en) Vehicle information processing method and device, storage medium and electronic device

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20210427