CN115953996A

CN115953996A - A method and device for generating natural language based on in-vehicle user information

Info

Publication number: CN115953996A
Application number: CN202211543220.XA
Authority: CN
Inventors: 李龙飞; 刘杰; 张炜玮; 林孟超; 陈彩可
Original assignee: FAW Group Corp
Current assignee: FAW Group Corp
Priority date: 2022-12-02
Filing date: 2022-12-02
Publication date: 2023-04-11

Abstract

The present application discloses a method and device for generating natural language based on in-vehicle user information. The method for generating natural language based on in-vehicle user information includes: acquiring the voice information of the occupants in the vehicle; acquiring the basic information of the occupants in the vehicle; The voice information of the people in the vehicle obtains the template information to be played; the natural language information to be played is generated according to the template information to be played and the slot information to be played. The method for generating natural language based on in-vehicle user information provided by this application obtains the slot information to be played according to the basic information of the in-vehicle personnel, thereby generating different natural voice information to be played according to different basic information of the in-vehicle personnel, so that Voice interaction is more user-friendly.

Description

A method and device for generating natural language based on in-vehicle user information

技术领域technical field

本申请涉及车辆人机交互技术领域，尤其涉及一种基于车内用户信息生成自然语言的方法以及基于车内用户信息生成自然语言的装置。The present application relates to the technical field of vehicle human-computer interaction, and in particular to a method for generating natural language based on in-vehicle user information and a device for generating natural language based on in-vehicle user information.

背景技术Background technique

目前一般使用自然语言生成仅考虑了语音交互发起人来进行自然语言生成。但是在车辆的使用场景下，存在多个用户同时在一个座舱内使用的情况。现有技术无法实现语音交互的过程中考虑多个用户的具体情况，根据不同用户的具体情况进行自然语言生成的问题。At present, the general use of natural language generation only considers the voice interaction initiator for natural language generation. However, in the vehicle usage scenario, there are situations where multiple users use the same cockpit at the same time. The existing technology cannot realize the problem of considering the specific conditions of multiple users in the process of voice interaction, and performing natural language generation according to the specific conditions of different users.

因此，希望有一种技术方案来解决或至少减轻现有技术的上述不足。Therefore, it is desirable to have a technical solution to solve or at least alleviate the above-mentioned deficiencies of the prior art.

发明内容Contents of the invention

本发明的目的在于提供一种基于车内用户信息生成自然语言的方法来至少解决上述的一个技术问题。The purpose of the present invention is to provide a method for generating natural language based on in-vehicle user information to at least solve one of the above technical problems.

本发明提供了下述方案：The present invention provides following scheme:

根据本发明的一个方面，提供一种基于车内用户信息生成自然语言的方法，所述基于车内用户信息生成自然语言的方法包括：According to one aspect of the present invention, a method for generating natural language based on in-vehicle user information is provided, and the method for generating natural language based on in-vehicle user information includes:

获取车内人员语音信息；Obtain the voice information of the people in the car;

获取车内人员基本信息；Get the basic information of the people in the car;

根据车内人员语音信息获取待播放槽位信息；Obtain the slot information to be played according to the voice information of the personnel in the vehicle;

根据所述车内人员基本信息以及车内人员语音信息获取待播放模板信息；Obtain template information to be played according to the basic information of the personnel in the vehicle and the voice information of the personnel in the vehicle;

根据所述待播放模板信息与所述待播放槽位信息生成待播放自然语言信息。Generate natural language information to be played according to the to-be-played template information and the to-be-played slot information.

可选地，所述根据车内人员语音信息获取待播放槽位信息包括：Optionally, said obtaining the slot information to be played according to the voice information of the personnel in the vehicle includes:

解析所述车内人员语音信息，从而获取语义信息；Analyzing the voice information of the people in the vehicle to obtain semantic information;

根据语义解析信息判断是否生成待播放自然语言信息，若是，则Judging whether to generate natural language information to be played according to the semantic analysis information, if so, then

根据语义信息获取待播放槽位信息。Obtain slot information to be played based on semantic information.

可选地，所述获取车内人员基本信息包括：Optionally, the obtaining basic information of people in the vehicle includes:

获取车内的各个座椅上的压力传感器所传递的压力信息；Obtain the pressure information transmitted by the pressure sensor on each seat in the car;

根据压力信息获取车内人员数量。Obtain the number of people in the car according to the pressure information.

获取车内摄像装置所拍摄的车内图像信息；Obtain the in-vehicle image information captured by the in-vehicle camera device;

识别所述图像信息，从而获取车内人员基本信息。Identify the image information to obtain the basic information of the occupants in the vehicle.

可选地，所述车内人员基本信息包括人员数量信息、人员脸部图像信息以及人员年龄信息。Optionally, the basic information about people in the vehicle includes number information, face image information and age information of people.

可选地，所述根据所述车内人员基本信息以及车内人员语音信息获取待播放模板信息包括：Optionally, the acquiring the template information to be played according to the basic information of the personnel in the vehicle and the voice information of the personnel in the vehicle includes:

获取预设模板数据库，所述预设模板数据库包括至少两个预设模板以及预设人员条件，一个预设模板对应一个预设人员条件；Acquiring a preset template database, the preset template database including at least two preset templates and preset personnel conditions, one preset template corresponding to one preset personnel condition;

判断获取的所述人员数量信息、人员脸部图像信息以及人员年龄信息是否符合所述预设模板数据库中的一个预设人员条件，若是，则Judging whether the obtained personnel number information, personnel facial image information and personnel age information conform to a preset personnel condition in the preset template database, if so, then

获取符合的预设人员条件所对应的预设模板作为待播放模板信息。Obtain the preset template corresponding to the preset personnel condition that meets the preset conditions as the template information to be played.

可选地，基于车内用户信息生成自然语言的方法进一步包括：Optionally, the method for generating natural language based on in-vehicle user information further includes:

获取预设人脸数据库，所述预设人脸数据库包括至少一个预设人脸信息；Acquiring a preset face database, the preset face database including at least one preset face information;

为每个人员脸部图像信息进行如下操作：Perform the following operations for each person's face image information:

将获取的人员脸部图像信息分别与各个预设人脸信息进行相似度计算，从而获取相似度值；Carry out similarity calculation between the obtained person's face image information and each preset face information, so as to obtain the similarity value;

判断是否有一个相似度值大于预设阈值，若是，则Determine whether there is a similarity value greater than the preset threshold, if so, then

判断各个人脸脸部特征信息中，相似度值大于预设阈值的人脸脸部特征信息的数量是否超过一个，若否，则Judging whether there is more than one face and face feature information whose similarity value is greater than a preset threshold in each face feature information, if not, then

获取预设特殊语音库，所述预设特殊语音库包括至少一个预设特殊语音类型以及预设人脸信息，一个预设特殊语音类型对应一个预设人脸信息；Obtaining a preset special voice bank, the preset special voice bank includes at least one preset special voice type and preset face information, one preset special voice type corresponds to one preset face information;

获取相似度值大于预设阈值的预设人脸信息所对应的预设特殊语音类型；Obtaining a preset special voice type corresponding to preset face information whose similarity value is greater than a preset threshold;

通过所述预设特殊语音类型对所述待播放自然语言信息进行播报。The to-be-played natural language information is announced through the preset special voice type.

可选地，所述基于车内用户信息生成自然语言的方法进一步包括：Optionally, the method for generating natural language based on in-vehicle user information further includes:

判断各个人脸脸部特征信息中，相似度值大于预设阈值的人脸脸部特征信息的数量是否超过一个，若是，则Judging whether there is more than one face and face feature information with a similarity value greater than a preset threshold in each face feature information, if so, then

获取人员关系图谱，所述人员关系图谱包括至少两个人员名称信息、预设人脸信息，其中，一个人员名称信息与至少一个除自身以外的其他的人员名称信息之间具有优先级关系，一个人员名称信息与一个预设人脸信息对应；Obtaining a personnel relationship graph, the personnel relationship graph includes at least two personnel name information and preset face information, wherein one personnel name information has a priority relationship with at least one other personnel name information except itself, and one The person name information corresponds to a preset face information;

获取各个人脸脸部特征信息中相似度值大于预设阈值人脸脸部特征信息所分别对应的预设人脸信息；Acquiring the preset face information corresponding to the face feature information whose similarity value is greater than the preset threshold in each face feature information;

分别获取各个预设人脸信息所对应的人员名称信息；Respectively obtain the person name information corresponding to each preset face information;

判断所获取的各个人员名称信息之间是否具有优先级关系，若是，则Judging whether there is a priority relationship between the obtained personnel name information, if so, then

获取其中优先级关系高的人员名称信息所对应的预设人脸信息所对应的预设特殊语音类型；Obtain the preset special voice type corresponding to the preset face information corresponding to the name information of the person with a high priority relationship;

可选地，在所述通过所述预设特殊语音类型对所述待播放自然语言信息进行播报之前，所述基于车内用户信息生成自然语言的方法进一步包括：Optionally, before broadcasting the natural language information to be played through the preset special voice type, the method for generating natural language based on in-vehicle user information further includes:

获取睡眠识别分类器；Get sleep recognition classifier;

获取各个所述人员脸部图像信息；Obtaining facial image information of each of the persons;

提取各个人员脸部图像信息中的特征信息；Extract the feature information in the facial image information of each person;

将各个所述特征信息分别输入至所述睡眠识别分类器，从而获取分类标签，所述分类标签包括睡眠标签；Inputting each of the characteristic information into the sleep recognition classifier respectively, so as to obtain classification labels, the classification labels including sleep labels;

当有一个分类标签为睡眠标签时，获取当前系统播报语音的音量信息；When there is a classification label as a sleep label, get the volume information of the current system broadcast voice;

判断音量信息是否超过预设音量阈值，若是，则Determine whether the volume information exceeds the preset volume threshold, and if so, then

将所述音量信息调低至所述预设音量阈值以下并对所述待播放自然语言信息进行播报。Decreasing the volume information below the preset volume threshold and broadcasting the to-be-played natural language information.

本申请还提供了一种基于车内用户信息生成自然语言的装置，所述基于车内用户信息生成自然语言的装置包括：The present application also provides a device for generating natural language based on in-vehicle user information, the device for generating natural language based on in-vehicle user information includes:

车内人员语音信息获取模块，所述车内人员语音信息获取模块用于获取车内人员语音信息；The voice information acquisition module of the personnel in the vehicle, the voice information acquisition module of the personnel in the vehicle is used to obtain the voice information of the personnel in the vehicle;

车内人员基本信息获取模块，所述车内人员基本信息获取模块用于获取车内人员基本信息；The basic information acquisition module of the personnel in the vehicle, the basic information acquisition module of the personnel in the vehicle is used to obtain the basic information of the personnel in the vehicle;

待播放槽位信息获取模块，所述待播放槽位信息获取模块用于根据车内人员语音信息获取待播放槽位信息；The slot information acquisition module to be played, the slot information acquisition module to be played is used to acquire the slot information to be played according to the voice information of the personnel in the vehicle;

待播放模板信息获取模块，所述待播放模板信息获取模块用于根据所述车内人员基本信息以及车内人员语音信息获取待播放模板信息；A template information acquisition module to be played, the template information acquisition module to be played is used to obtain the template information to be played according to the basic information of the personnel in the vehicle and the voice information of the personnel in the vehicle;

待播放自然语言信息生成模块，所述待播放自然语言信息生成模块用于根据所述待播放模板信息与所述待播放槽位信息生成待播放自然语言信息。A to-be-played natural language information generating module, the to-be-played natural language information generating module is used to generate to-be-played natural language information according to the to-be-played template information and the to-be-played slot information.

本申请所提供的基于车内用户信息生成自然语言的方法根据车内人员基本信息来获取待播放槽位信息，从而根据不同的车内人员基本信息来生成不同的待播放自然语音信息，从而使得语音交互更为人性化。The method for generating natural language based on in-vehicle user information provided by this application obtains the slot information to be played according to the basic information of the in-vehicle personnel, thereby generating different natural voice information to be played according to different basic information of the in-vehicle personnel, so that Voice interaction is more user-friendly.

附图说明Description of drawings

图1是本发明一个或多个实施例提供的基于车内用户信息生成自然语言的方法的流程图。Fig. 1 is a flowchart of a method for generating natural language based on in-vehicle user information provided by one or more embodiments of the present invention.

图2是本发明一个或多个实施例提供的基于车内用户信息生成自然语言的方法的一种电子设备结构框图。Fig. 2 is a structural block diagram of an electronic device of a method for generating natural language based on in-vehicle user information provided by one or more embodiments of the present invention.

图3为图1所示的基于车内用户信息生成自然语言的方法中的待播放模板信息的示意图。FIG. 3 is a schematic diagram of template information to be played in the method for generating natural language based on in-vehicle user information shown in FIG. 1 .

具体实施方式Detailed ways

下面将结合附图对本发明的技术方案进行清楚、完整地描述，显然，所描述的实施例是本发明一部分实施例，而不是全部的实施例。基于本发明中的实施例，本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例，都属于本发明保护的范围。The technical solutions of the present invention will be clearly and completely described below in conjunction with the accompanying drawings. Apparently, the described embodiments are some of the embodiments of the present invention, but not all of them. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

如图1所示的基于车内用户信息生成自然语言的方法包括：The method for generating natural language based on in-vehicle user information as shown in Figure 1 includes:

步骤1：获取车内人员语音信息；Step 1: Get the voice information of the people in the car;

步骤2：获取车内人员基本信息；Step 2: Get the basic information of the people in the car;

步骤3：根据车内人员语音信息获取待播放槽位信息；Step 3: Obtain the slot information to be played according to the voice information of the personnel in the vehicle;

步骤4：根据所述车内人员基本信息以及车内人员语音信息获取待播放模板信息；Step 4: Obtain the template information to be played according to the basic information of the personnel in the vehicle and the voice information of the personnel in the vehicle;

步骤5：根据所述待播放模板信息与所述待播放槽位信息生成待播放自然语言信息。Step 5: Generate natural language information to be played according to the template information to be played and the slot information to be played.

在本实施例中，根据车内人员语音信息获取待播放槽位信息包括：In this embodiment, obtaining the slot information to be played according to the voice information of the personnel in the vehicle includes:

解析车内人员语音信息，从而获取语义信息；Analyze the voice information of people in the car to obtain semantic information;

在一个实施例中，获取车内人员基本信息包括：In one embodiment, obtaining the basic information of people in the vehicle includes:

通过压力传感器可以了解到哪些座位上有人坐，在该实施例中，还可以设置车内摄像装置，车内摄像装置可以是多个，一个车内摄像装置用于拍摄一个座位前边的人的图像，采用这种方式，在获取到哪些座位上有人后，开启对应的摄像装置就可以获取到座位上的人的图像信息。It can be known by the pressure sensor which seats are occupied by people. In this embodiment, an in-vehicle camera can also be set. There can be multiple in-vehicle cameras. One in-vehicle camera is used to capture the image of a person in front of a seat. In this way, after obtaining which seats are occupied, the image information of the persons on the seats can be obtained by turning on the corresponding camera device.

在本实施例中，获取车内人员基本信息包括：In this embodiment, obtaining the basic information of people in the vehicle includes:

在本实施例中，不通过压力传感器去检测座椅的情况，而是直接开启各个摄像装置从而拍摄车内图像信息，通过图像识别的方式来获取车内人员基本信息。In this embodiment, the pressure sensor is not used to detect the condition of the seat, but each camera device is directly turned on to capture image information inside the vehicle, and the basic information of the occupants in the vehicle is obtained through image recognition.

在本实施例中，所述车内人员基本信息包括人员数量信息、人员脸部图像信息以及人员年龄信息。In this embodiment, the basic information about people in the vehicle includes information about the number of people, information about facial images of people, and information about the age of people.

在本实施例中，人员数量信息可以通过摄像装置所获取的各个图像进行识别获取，例如，可以通过人脸图像分类器判断各个图像中是否具有人脸，若具有人脸，则通过人脸的数量即可以知道人员数量信息。In this embodiment, the information on the number of people can be identified and acquired through each image captured by the camera device. For example, a face image classifier can be used to determine whether each image has a human face. Quantity can know the number of personnel information.

在获取到每个人的人员脸部图像信息后，还可以提取各个人员脸部图像信息的特征，从而输入至预设的经过训练的年龄分类器中，从而获取每个人员的人员年龄信息。After the facial image information of each person is obtained, the features of the facial image information of each person can be extracted, and then input into a preset trained age classifier, so as to obtain the age information of each person.

在本实施例中，根据所述车内人员基本信息以及车内人员语音信息获取待播放模板信息包括：In this embodiment, obtaining the template information to be played according to the basic information of the personnel in the vehicle and the voice information of the personnel in the vehicle includes:

在本实施例中，基于车内用户信息生成自然语言的方法进一步包括：In this embodiment, the method for generating natural language based on in-vehicle user information further includes:

采用这种方式，一方面考虑了各个车内人员基本信息的情况，另一方面，也考虑了一些特殊人物的情况，例如，车内有某些经常坐车的孩子，孩子喜欢某一种特殊语音类型，例如喜欢哆啦A梦的配音，此时，在进行待播放自然语言信息时，要通过预设特殊语音类型(例如哆啦A梦的配音的声音)来体现待播放自然语言信息。In this way, on the one hand, the basic information of each person in the car is considered, and on the other hand, the situation of some special characters is also considered. For example, there are some children who often ride in the car, and the children like a certain special voice. Type, for example like the dubbing of Doraemon, at this time, when performing the natural language information to be played, it is necessary to reflect the natural language information to be played by preset special voice type (such as the voice of Doraemon's dubbing).

在一些情况下，可能出现有多个特殊人员的情况，此时，根据各个特殊人员的关系来判定用哪个预设特殊语音类型，例如，一家三口在车内，一般会以孩子为主，因此，孩子的优先级比较高，可以理解的是，该优先级关系可以根据情况自行设定。In some cases, there may be multiple special persons. At this time, it is determined which preset special voice type to use according to the relationship between each special person. , the priority of the child is relatively high. It is understandable that the priority relationship can be set according to the situation.

在本实施例中，在所述通过所述预设特殊语音类型对所述待播放自然语言信息进行播报之前，所述基于车内用户信息生成自然语言的方法进一步包括：In this embodiment, before broadcasting the natural language information to be played through the preset special voice type, the method for generating natural language based on in-vehicle user information further includes:

获取睡眠识别分类器；Get sleep recognition classifier;

在一些情况下，可能播放的声音会吵醒正在熟睡的孩子，此时，通过这种方法，可以尽量以较轻的声音来进行播放。In some cases, the sound that may be played may wake up the sleeping child. At this time, by this method, the sound can be played with a softer sound as much as possible.

下面以举例的方式对本申请进行进一步详细阐述，可以理解的是，该举例并不构成对本申请的任何限制。The present application will be further described in detail below by way of examples, and it should be understood that the examples do not constitute any limitation to the present application.

在本举例中，以需要播放音乐为场景进行举例，可以理解的是，本申请还可以应用在其他交互场景上，例如导航等，在此不再赘述。In this example, the scenario where music needs to be played is used as an example. It can be understood that this application can also be applied to other interactive scenarios, such as navigation, etc., which will not be repeated here.

在该需要播放音乐场景中，基于车内用户信息生成自然语言的方法包括：In the scene where music needs to be played, methods for generating natural language based on user information in the car include:

步骤1：获取车内人员语音信息；在本实施例中，车内人员语音信息为：播放歌唱祖国这首歌。Step 1: Obtain the voice information of the people in the car; in this embodiment, the voice information of the people in the car is: play the song Singing the Motherland.

步骤2：获取车内人员基本信息，在本实施例中，车内人员基本信息为：车里共3个人，通过图像识别获取到车内人员基本信息为：驾驶员位置为男性，岁数为成年人(18到30岁)，副驾驶员位置为女性(18到30岁)，岁数为成年人，后排座椅位置为男性，岁数为孩童(6到10岁)。可以理解的是，上述岁数通过年龄分类器即可获得，在此不再赘述。Step 2: Get the basic information of the people in the car. In this embodiment, the basic information of the people in the car is: there are 3 people in the car. The basic information of the people in the car is obtained through image recognition: the driver is a male, and his age is an adult People (18 to 30 years old), the co-pilot position is female (18 to 30 years old), the age is adults, the rear seat is male, and the age is children (6 to 10 years old). It can be understood that the above-mentioned age can be obtained through an age classifier, and details will not be described here.

步骤3：根据车内人员语音信息获取待播放槽位信息，具体而言，解析所述车内人员语音信息，从而获取语义信息；Step 3: Obtain the slot information to be played according to the voice information of the people in the car, specifically, analyze the voice information of the people in the car to obtain semantic information;

根据语义信息获取待播放槽位信息，在本实施例中，语义信息为播放歌唱祖国，则待播放槽位信息为歌唱祖国。The slot information to be played is obtained according to the semantic information. In this embodiment, the semantic information is to play and sing about the motherland, and the slot information to be played is to sing about the motherland.

步骤4根据所述车内人员基本信息以及车内人员语音信息获取待播放模板信息，在本实施例中，根据车内人员基本信息以及车内人员语音信息获取待播放模板信息包括：Step 4 obtains the template information to be played according to the basic information of the personnel in the vehicle and the voice information of the personnel in the vehicle. In this embodiment, obtaining the template information to be played according to the basic information of the personnel in the vehicle and the voice information of the personnel in the vehicle includes:

参见图3，在本实施例中，假设预设人员条件为：有用户年龄特征为儿童，则生成的待播放模板信息为：小朋友，让我们来听。Referring to FIG. 3 , in this embodiment, assuming that the preset personnel condition is: a user whose age characteristic is a child, the generated template information to be played is: children, let's listen.

可以理解的是，预设人员条件可以设置很多种，如图3所示，如果是多个人的话，可以是图3中的模板4，如果是其他条件，也可以是其他对应模板，可以理解的是，当符合多个预设人员条件时，也可以通过优先级来进行选择，例如，以有小孩的模板为最优先模板，以多人模板为第二优先模板这种方式，设置各个模板的优先级，从而在获取模板时，获取优先级最高的模板。It is understandable that the preset personnel conditions can be set in many ways, as shown in Figure 3, if there are multiple people, it can be the template 4 in Figure 3, if it is other conditions, it can also be other corresponding templates, it is understandable Yes, when multiple preset personnel conditions are met, you can also select by priority. For example, the template with children is the highest priority template, and the multi-person template is the second priority template. In this way, set the Priority, so that when obtaining templates, obtain the template with the highest priority.

步骤5：根据待播放模板信息与所述待播放槽位信息生成待播放自然语言信息，将生成的待播放槽位信息为：小朋友，让我们来听以及待播放槽位信息歌唱祖国结合，从而生成待播放自然语言信息：小朋友，让我们来听歌唱祖国。Step 5: Generate natural language information to be played according to the template information to be played and the slot information to be played, and the generated slot information to be played is: children, let us listen and the slot information to be played sings the combination of the motherland, thereby Generate natural language information to be played: children, let's listen to singing about the motherland.

在本实施例中，待播放自然语言信息包括待播放模板信息(TTSID(PlayMusic))与槽位信息(SongName、Singer)。待播放自然语言信息包括的内容属于现有技术，在此不再赘述。In this embodiment, the to-be-played natural language information includes to-be-played template information (TTSID (PlayMusic)) and slot information (SongName, Singer). The content included in the natural language information to be played belongs to the prior art, and will not be repeated here.

当生成待播放自然语言信息后，还需要考虑采用什么样的语音类型来进行播报，此时，根据车辆内的人员的人脸进行判断，具体而言，获取预设人脸数据库，所述预设人脸数据库包括至少一个预设人脸信息；After the natural language information to be played is generated, it is also necessary to consider what type of voice is used for broadcasting. At this time, judge according to the faces of the people in the vehicle, specifically, obtain a preset face database, and the preset It is assumed that the face database includes at least one preset face information;

举例来说，以上述的三人为例，孩子的人脸脸部图像信息与预设人脸数据库中的预设人脸信息相同，则表示孩子已经登记在预设人脸数据库中，此时，其具有预设特殊语音类型，例如，哆啦A梦的语音类型，此时，通过所述预设特殊语音类型对所述待播放自然语言信息进行播报。For example, taking the above-mentioned three people as an example, if the child's face image information is the same as the preset face information in the preset face database, it means that the child has been registered in the preset face database. At this time, It has a preset special voice type, for example, the voice type of Doraemon. At this time, the to-be-played natural language information is broadcast through the preset special voice type.

在本实施例中，在进行播报时，还要考虑是否其他乘客正在睡觉，例如，三个人中的女性正在睡觉，此时，应当降低声音播放。In this embodiment, when broadcasting, it is also considered whether other passengers are sleeping, for example, the female among the three people is sleeping, at this time, the sound should be lowered.

本申请还提供了一种基于车内用户信息生成自然语言的装置，所述基于车内用户信息生成自然语言的装置包括车内人员语音信息获取模块、车内人员基本信息获取模块、待播放槽位信息获取模块、待播放模板信息获取模块以及待播放自然语言信息生成模块，其中，The present application also provides a device for generating natural language based on in-vehicle user information. The device for generating natural language based on in-vehicle user information includes a voice information acquisition module for in-vehicle personnel, an acquisition module for basic information about in-vehicle personnel, and a slot to be played. Bit information acquisition module, template information acquisition module to be played and natural language information generation module to be played, wherein,

车内人员语音信息获取模块用于获取车内人员语音信息；车内人员基本信息获取模块用于获取车内人员基本信息；待播放槽位信息获取模块用于根据车内人员语音信息获取待播放槽位信息；待播放模板信息获取模块用于根据所述车内人员基本信息以及车内人员语音信息获取待播放模板信息；待播放自然语言信息生成模块用于根据所述待播放模板信息与所述待播放槽位信息生成待播放自然语言信息。The voice information acquisition module of the personnel in the vehicle is used to obtain the voice information of the personnel in the vehicle; the basic information acquisition module of the personnel in the vehicle is used to obtain the basic information of the personnel in the vehicle; the slot information acquisition module to be played is used to obtain the voice information of the personnel in the vehicle to be played Slot information; the template information acquisition module to be played is used to obtain the template information to be played according to the basic information of the personnel in the car and the voice information of the personnel in the vehicle; the natural language information generation module to be played is used to obtain the template information to be played according to the template information to be played and the Describe the slot information to be played to generate natural language information to be played.

图2是本发明一个或多个实施例提供的一种电子设备结构框图。Fig. 2 is a structural block diagram of an electronic device provided by one or more embodiments of the present invention.

如图2所示，本申请还公开了一种电子设备，包括：处理器、通信接口、存储器和通信总线，其中，处理器，通信接口，存储器通过通信总线完成相互间的通信；存储器中存储有计算机程序，当计算机程序被处理器执行时，使得处理器执行基于车内用户信息生成自然语言的方法的步骤。As shown in Figure 2, the present application also discloses an electronic device, including: a processor, a communication interface, a memory, and a communication bus, wherein the processor, the communication interface, and the memory complete communication with each other through the communication bus; There is a computer program which, when executed by the processor, causes the processor to perform the steps of the method of generating natural language based on in-vehicle user information.

本申请还提供了一种计算机可读存储介质，其存储有可由电子设备执行的计算机程序，当计算机程序在电子设备上运行时，使得电子设备执行基于车内用户信息生成自然语言的方法的步骤。The present application also provides a computer-readable storage medium, which stores a computer program executable by an electronic device, and when the computer program runs on the electronic device, the electronic device executes the steps of the method for generating natural language based on in-vehicle user information .

上述电子设备提到的通信总线可以是外设部件互连标准(PeripheralComponentInterconnect，PCI)总线或扩展工业标准结构(ExtendedIndustryStandardArchitecture，EISA)总线等。该通信总线可以分为地址总线、数据总线、控制总线等。为便于表示，图中仅用一条粗线表示，但并不表示仅有一根总线或一种类型的总线。The communication bus mentioned in the above-mentioned electronic device may be a Peripheral Component Interconnect (PCI) bus or an Extended Industry Standard Architecture (Extended Industry Standard Architecture, EISA) bus or the like. The communication bus can be divided into an address bus, a data bus, a control bus, and the like. For ease of representation, only one thick line is used in the figure, but it does not mean that there is only one bus or one type of bus.

电子设备包括硬件层，运行在硬件层之上的操作系统层，以及运行在操作系统上的应用层。该硬件层包括中央处理器(CPU，CentralProcessingUnit)、内存管理单元(MMU，MemoryManagementUnit)和内存等硬件。该操作系统可以是任意一种或多种通过进程(Process)实现电子设备控制的计算机操作系统，例如，Linux操作系统、Unix操作系统、Android操作系统、iOS操作系统或windows操作系统等。并且在本发明实施例中该电子设备可以是智能手机、平板电脑等手持设备，也可以是桌面计算机、便携式计算机等电子设备，本发明实施例中并未特别限定。An electronic device includes a hardware layer, an operating system layer running on the hardware layer, and an application layer running on the operating system. The hardware layer includes hardware such as a central processing unit (CPU, Central Processing Unit), a memory management unit (MMU, Memory Management Unit) and memory. The operating system can be any one or more computer operating systems that realize electronic device control through processes, for example, Linux operating system, Unix operating system, Android operating system, iOS operating system, or windows operating system. And in the embodiment of the present invention, the electronic device may be a handheld device such as a smart phone or a tablet computer, or may be an electronic device such as a desktop computer or a portable computer, which is not particularly limited in the embodiment of the present invention.

本发明实施例中的电子设备控制的执行主体可以是电子设备，或者是电子设备中能够调用程序并执行程序的功能模块。电子设备可以获取到存储介质对应的固件，存储介质对应的固件由供应商提供，不同存储介质对应的固件可以相同可以不同，在此不做限定。电子设备获取到存储介质对应的固件后，可以将该存储介质对应的固件写入存储介质中，具体地是往该存储介质中烧入该存储介质对应固件。将固件烧入存储介质的过程可以采用现有技术实现，在本发明实施例中不做赘述。The execution subject of electronic device control in the embodiment of the present invention may be an electronic device, or a functional module in the electronic device that can call a program and execute the program. The electronic device can obtain the firmware corresponding to the storage medium. The firmware corresponding to the storage medium is provided by the supplier. The firmware corresponding to different storage media may be the same or different, which is not limited here. After the electronic device obtains the firmware corresponding to the storage medium, it may write the firmware corresponding to the storage medium into the storage medium, specifically burn the firmware corresponding to the storage medium into the storage medium. The process of burning the firmware into the storage medium can be realized by using the existing technology, and will not be repeated in the embodiment of the present invention.

电子设备还可以获取到存储介质对应的重置命令，存储介质对应的重置命令由供应商提供，不同存储介质对应的重置命令可以相同可以不同，在此不做限定。The electronic device can also obtain a reset command corresponding to the storage medium. The reset command corresponding to the storage medium is provided by the supplier. The reset commands corresponding to different storage media can be the same or different, which is not limited here.

此时电子设备的存储介质为写入了对应的固件的存储介质，电子设备可以在写入了对应的固件的存储介质中响应该存储介质对应的重置命令，从而电子设备根据存储介质对应的重置命令，对该写入对应的固件的存储介质进行重置。根据重置命令对存储介质进行重置的过程可以现有技术实现，在本发明实施例中不做赘述。At this time, the storage medium of the electronic device is the storage medium in which the corresponding firmware is written, and the electronic device can respond to the reset command corresponding to the storage medium in the storage medium in which the corresponding firmware is written, so that the electronic device can The reset command resets the storage medium in which the corresponding firmware is written. The process of resetting the storage medium according to the reset command can be implemented in the prior art, and will not be described in detail in this embodiment of the present invention.

为了描述的方便，描述以上装置时以功能分为各种单元、模块分别描述。当然在实施本申请时可以把各单元、模块的功能在同一个或多个软件和/或硬件中实现。For the convenience of description, when describing the above devices, the functions are divided into various units and modules and described separately. Of course, when implementing the present application, the functions of each unit and module can be implemented in one or more software and/or hardware.

本技术领域技术人员可以理解，除非另外定义，这里使用的所有术语(包括技术术语和科学术语)，具有与本发明所属领域中的普通技术人员的一般理解相同的意义。还应该理解的是，诸如通用字典中定义的那些术语，应该被理解为具有与现有技术的上下文中的意义一致的意义，并且除非被特定定义，否则不会用理想化或过于正式的含义来解释。Those skilled in the art can understand that, unless otherwise defined, all terms (including technical terms and scientific terms) used herein have the same meaning as commonly understood by those of ordinary skill in the art to which this invention belongs. It should also be understood that terms, such as those defined in commonly used dictionaries, should be understood to have meanings consistent with the meanings in the context of the prior art, and will not be used in an idealized or overly formal sense unless specifically defined to explain.

对于方法实施例，为了简单描述，故将其都表述为一系列的动作组合，但是本领域技术人员应该知悉，本发明实施例并不受所描述的动作顺序的限制，因为依据本发明实施例，某些步骤可以采用其他顺序或者同时进行。其次，本领域技术人员也应该知悉，说明书中所描述的实施例均属于优选实施例，所涉及的动作并不一定是本发明实施例所必须的。For the method embodiment, for the sake of simple description, it is expressed as a series of action combinations, but those skilled in the art should know that the embodiment of the present invention is not limited by the described action order, because according to the embodiment of the present invention , certain steps may be performed in other order or simultaneously. Secondly, those skilled in the art should also know that the embodiments described in the specification belong to preferred embodiments, and the actions involved are not necessarily required by the embodiments of the present invention.

通过以上的实施方式的描述可知，本领域的技术人员可以清楚地了解到本申请可借助软件加必需的通用硬件平台的方式来实现。基于这样的理解，本申请的技术方案本质上或者说对现有技术做出贡献的部分可以以软件产品的形式体现出来，该计算机软件产品可以存储在存储介质中，如ROM/RAM、磁碟、光盘等，包括若干指令用以使得一台计算机设备(可以是个人计算机，服务器或者网络设备等)执行本申请各个实施方式或者实施方式的某些部分所述的方法。It can be known from the above description of the implementation manners that those skilled in the art can clearly understand that the present application can be implemented by means of software plus a necessary general-purpose hardware platform. Based on this understanding, the essence of the technical solution of this application or the part that contributes to the prior art can be embodied in the form of software products, and the computer software products can be stored in storage media, such as ROM/RAM, disk , optical disc, etc., including several instructions to make a computer device (which may be a personal computer, server or network device, etc.) execute the methods described in various embodiments or some parts of the embodiments of this application.

最后应说明的是：以上各实施例仅用以说明本发明的技术方案，而非对其限制；尽管参照前述各实施例对本发明进行了详细的说明，本领域的普通技术人员应当理解：其依然可以对前述各实施例所记载的技术方案进行修改，或者对其中部分或者全部技术特征进行等同替换；而这些修改或者替换，并不使相应技术方案的本质脱离本发明各实施例技术方案的范围。Finally, it should be noted that: the above embodiments are only used to illustrate the technical solutions of the present invention, rather than limiting them; although the present invention has been described in detail with reference to the foregoing embodiments, those of ordinary skill in the art should understand that: It is still possible to modify the technical solutions described in the foregoing embodiments, or perform equivalent replacements for some or all of the technical features; and these modifications or replacements do not make the essence of the corresponding technical solutions deviate from the technical solutions of the various embodiments of the present invention. scope.

Claims

1. A method for generating natural language based on in-vehicle user information, characterized in that, the method for generating natural language based on in-vehicle user information comprises:

Obtain the voice information of the people in the car;

Get the basic information of the people in the car;

Obtain the slot information to be played according to the voice information of the personnel in the vehicle;

Obtain template information to be played according to the basic information of the personnel in the vehicle and the voice information of the personnel in the vehicle;

Generate natural language information to be played according to the to-be-played template information and the to-be-played slot information.

2. the method for generating natural language based on user information in the car as claimed in claim 1, is characterized in that, described according to the personnel voice information in the car to obtain the slot information to be played comprises:

Analyzing the voice information of the people in the vehicle to obtain semantic information;

Judging whether to generate natural language information to be played according to the semantic analysis information, if so, then

Obtain slot information to be played based on semantic information.

3. the method for generating natural language based on user information in the car according to claim 2, is characterized in that, described acquisition basic information of personnel in the car comprises:

Obtain the pressure information transmitted by the pressure sensor on each seat in the car;

Obtain the number of people in the vehicle according to the pressure information.

4. the method for generating natural language based on user information in the car according to claim 2, is characterized in that, described acquisition basic information of personnel in the car comprises:

Obtain the in-vehicle image information captured by the in-vehicle camera device;

Identify the image information to obtain the basic information of the occupants in the vehicle.

5. The method for generating natural language based on in-vehicle user information as claimed in claim 4, wherein the basic information about the occupants in the vehicle includes information on the number of persons, information on facial images of persons, and information on the age of persons.

6. the method for generating natural language based on user information in the car as claimed in claim 5, is characterized in that, described according to the basic information of the people in the car and the voice information of the people in the car to obtain the template information to be played comprises:

Acquiring a preset template database, the preset template database including at least two preset templates and preset personnel conditions, one preset template corresponding to one preset personnel condition;

Judging whether the obtained personnel number information, personnel facial image information and personnel age information conform to a preset personnel condition in the preset template database, if so, then

Obtain the preset template corresponding to the preset personnel condition that meets the preset conditions as the template information to be played.

7. The method for generating natural language based on in-vehicle user information as claimed in claim 6, wherein the method for generating natural language based on in-vehicle user information further comprises:

Acquiring a preset face database, the preset face database including at least one preset face information;

Perform the following operations for each person's face image information:

Carry out similarity calculation between the obtained person's face image information and each preset face information, so as to obtain the similarity value;

Determine whether there is a similarity value greater than the preset threshold, if so, then

Judging whether there is more than one face and face feature information whose similarity value is greater than a preset threshold in each face feature information, if not, then

Obtaining a preset special voice bank, the preset special voice bank includes at least one preset special voice type and preset face information, one preset special voice type corresponds to one preset face information;

Obtaining a preset special voice type corresponding to preset face information whose similarity value is greater than a preset threshold;

The to-be-played natural language information is announced through the preset special voice type.

8. The method for generating natural language based on in-vehicle user information as claimed in claim 7, wherein the method for generating natural language based on in-vehicle user information further comprises:

Judging whether there is more than one face and face feature information with a similarity value greater than a preset threshold in each face feature information, if so, then

Obtaining a personnel relationship graph, the personnel relationship graph includes at least two personnel name information and preset face information, wherein one personnel name information has a priority relationship with at least one other personnel name information except itself, and one The person name information corresponds to a preset face information;

Acquiring the preset face information corresponding to the face feature information whose similarity value is greater than the preset threshold in each face feature information;

Respectively obtain the person name information corresponding to each preset face information;

Judging whether there is a priority relationship between the obtained personnel name information, if so, then

Obtain the preset special voice type corresponding to the preset face information corresponding to the name information of the person with a high priority relationship;

9. The method for generating natural language based on in-vehicle user information as claimed in claim 8, wherein, before the broadcast of the natural language information to be played through the preset special voice type, the The method for generating natural language from user information in the vehicle further includes:

Get a sleep recognition classifier;

Acquiring facial image information of each of the persons;

Extract the feature information in the facial image information of each person;

Inputting each of the characteristic information into the sleep recognition classifier respectively, so as to obtain classification labels, the classification labels including sleep labels;

When there is a classification label as a sleep label, get the volume information of the current system broadcast voice;

Determine whether the volume information exceeds the preset volume threshold, and if so, then

Decreasing the volume information below the preset volume threshold and broadcasting the to-be-played natural language information.

10. A device for generating natural language based on in-vehicle user information, characterized in that the device for generating natural language based on in-vehicle user information comprises:

The voice information acquisition module of the personnel in the vehicle, the voice information acquisition module of the personnel in the vehicle is used to obtain the voice information of the personnel in the vehicle;

The basic information acquisition module of the personnel in the vehicle, the basic information acquisition module of the personnel in the vehicle is used to obtain the basic information of the personnel in the vehicle;

The slot information acquisition module to be played, the slot information acquisition module to be played is used to acquire the slot information to be played according to the voice information of the personnel in the vehicle;

A template information acquisition module to be played, the template information acquisition module to be played is used to obtain the template information to be played according to the basic information of the personnel in the vehicle and the voice information of the personnel in the vehicle;

A to-be-played natural language information generating module, the to-be-played natural language information generating module is used to generate to-be-played natural language information according to the to-be-played template information and the to-be-played slot information.