CN112599130A

CN112599130A - Intelligent conference system based on intelligent screen

Info

Publication number: CN112599130A
Application number: CN202011408172.4A
Authority: CN
Inventors: 李广垒; 陈祖涛
Original assignee: Anhui Baoxin Information Technology Co ltd
Current assignee: Anhui Baoxin Information Technology Co ltd
Priority date: 2020-12-03
Filing date: 2020-12-03
Publication date: 2021-04-02
Anticipated expiration: 2040-12-03
Also published as: CN112599130B

Abstract

The invention discloses an intelligent conference system based on an intelligent screen, which comprises: the acquisition and recognition module is used for acquiring voice information and performing voice recognition on the acquired voice information; the voice amplification module is used for receiving the voice information sent by the acquisition and recognition module, amplifying the voice information, transmitting the amplified voice information to a loudspeaker, converting the amplified voice information into a voice signal and transmitting the voice signal in a directional manner; the information processing module is used for receiving the voice information sent by the acquisition and recognition module and converting the received voice information into character information; and the subtitle display module is used for receiving the character information sent by the information processing module and displaying the character information on a screen. The invention improves the accuracy of speaking information transmission, and enables the information to be transmitted better.

Description

Intelligent conference system based on intelligent screen

The technical field is as follows:

the invention relates to the technical field of intelligent conference systems, in particular to an intelligent conference system based on an intelligent screen.

Background art:

since the 21 st century, human beings gradually enter the age of multimedia information, mass media mainly comprise internet, television, mobile phones and the like, and multimedia information gradually becomes an indispensable important part in life. As three elements of a media carrier, live broadcast, which is the combination of sound, characters and images, is the most direct way for people to transfer information and understand, and is particularly obvious in scenes such as a release meeting, a large-scale conference, live television broadcast, educational training and the like. Any information which needs to be transmitted by taking sound, images and characters as carriers, such as interviews, meetings, legal disputes, doctor inquiry and the like, needs a product system capable of providing real-time screen uploading.

In order to realize live broadcasting in the above environment, the conventional solution is: in the process of on-site recording, a professional shorthand team is matched with the audio to perform character transcription and proofreading, the audio is matched with a video or a picture and text after the transcription is completed, and the audio is released after the completion of the character transcription and the picture and text, so that the on-site live broadcasting is realized, and the method has the following limitations: 1. the message is delayed, because the video is released after manual later-stage transcription, a certain time difference exists between the video and the scene; 2. the information acquisition is inefficient; 3. resource consumption in subsequent arrangement, and time stamp correction of the transcribed words and the video to form a subtitle when live video live broadcasting is carried out by consuming manpower.

The invention content is as follows:

the invention aims to solve the defects and provides an intelligent conference system based on an intelligent screen.

The invention provides an intelligent conference display method based on an intelligent screen, which comprises the following steps:

acquiring a speech signal of a speech;

amplifying the voice signal and transmitting the amplified voice signal to a loudspeaker to be converted into a voice signal for outputting;

recognizing and converting the acquired voice signal into text information, and displaying the converted text information;

the method comprises the following steps that text information is updated in real time in the following mode, the text information comprises first display text and calibration display text, the first display text is selected keywords when the degree of engagement between voice information collected in a first preset time period and corresponding text in sample information exceeds a first threshold value degree of engagement, and the first display text displays the identified keywords at intervals; the words are calibrated and displayed by sentences determined when the coincidence degree of the voice information collected in a second preset time period and the corresponding words in the sample information exceeds a second threshold coincidence degree, the first preset time is less than the second preset time, and the first threshold coincidence degree is less than the second threshold coincidence degree; the calibration display text overlays the first display text.

In another aspect, the present invention provides an intelligent conference system based on an intelligent screen, including:

the acquisition and recognition module is used for acquiring voice information and performing voice recognition on the acquired voice information;

the voice amplification module is used for receiving the voice information sent by the acquisition and recognition module, amplifying the voice information, transmitting the amplified voice information to a loudspeaker, converting the amplified voice information into a voice signal and transmitting the voice signal in a directional manner;

the information processing module is used for receiving the voice information sent by the acquisition and recognition module and converting the received voice information into character information;

and the subtitle display module is used for receiving the character information sent by the information processing module and displaying the character information on a screen.

Further, the information processing module converts voice information into text information for real-time updating, the text information comprises first display text and calibration display text, the first display text is a keyword deleted when the degree of engagement between the voice information collected in a first preset time period and corresponding text in the sample information exceeds a first threshold value degree of engagement, and the first display text displays the identified keyword at intervals; the words are calibrated and displayed by sentences determined when the coincidence degree of the voice information collected in a second preset time period and the corresponding words in the sample information exceeds a second threshold coincidence degree, the first preset time is less than the second preset time, and the first threshold coincidence degree is less than the second threshold coincidence degree; the calibration display text overlays the first display text.

Further, the acquisition and identification module comprises a microphone, a sound console and a sound card, the sound console is connected with the microphone through a data line, the sound console is connected with the sound card through a data line, and the sound card is used for conveying signals to the voice amplification module.

Another aspect of the invention provides a display device comprising a display, a memory, a processor,

the processor is used for executing the intelligent conference display method;

the memory is to store the processor-executable instructions;

the display is used for displaying the character information processed by the processor.

Another aspect of the present invention provides a storage medium storing a computer program for executing the intelligent conference display method.

The invention discloses an intelligent conference system based on an intelligent screen, which has the following beneficial effects: by the method, the voice information generated by the speech can be amplified and transmitted, the voice information is recognized, and the character signal generated by recognition is displayed, so that conference participants can hear the sound of the speaker and recognize the character information corresponding to the speech through the display screen; in comparison, the speed of acquiring the text information by a human is faster than that of acquiring the voice information, and the text information can skip some contents which do not need to be concerned, so that the expression information can be acquired more quickly through text display; the accuracy of information transmission is improved, so that the information is transmitted better; when the voice information is recognized and converted into the character information, the accurately recognized key words in the voice information sentences of the speakers are quickly displayed firstly, so that the key information can be responded at the first time, and then the whole sentence information is displayed after a period of time is delayed from the key words through whole sentence recognition, so that the expression integrity is improved, the speaking information can be timely displayed and completely displayed, and the information transmission is improved more accurately and quickly.

Drawings

Fig. 1 is a schematic diagram of an intelligent conference system based on an intelligent screen.

Detailed Description

The present invention is further illustrated by the following examples, which are carried out on the premise of the technical scheme of the present invention, and detailed embodiments and specific operation procedures are given, but the protection scope of the present invention is not limited to the following examples:

an exemplary method:

the application provides an intelligent conference display method based on an intelligent screen, which comprises the following steps:

acquiring a speech signal of a speech;

wherein the text information is updated in real time in the following manner, the text information comprises a first display text and a calibration display text,

the first display characters are selected keywords which are deleted when the contact degree between the voice information collected in a first preset time period and the corresponding characters in the sample information exceeds a first threshold contact degree, and the first display characters display the identified keywords at intervals; in some embodiments, the keywords can be displayed in a flashing manner, and the keywords can be identified and displayed after the voice of the speaker is sent out, so that the timeliness is improved, and a reader can skip some contents which do not need to be concerned according to the text information of the keywords, so that the general semantics can be understood, and the related expression information of the voice information can be quickly known;

the words are calibrated and displayed by sentences determined when the coincidence degree of the voice information collected in a second preset time period and the corresponding words in the sample information exceeds a second threshold coincidence degree, the first preset time is less than the second preset time, and the first threshold coincidence degree is less than the second threshold coincidence degree; the calibration display text covers the first display text; in some embodiments, the whole sentence is matched and converted into a proper calibration character in a longer time with a certain lag, and the original keyword display character is covered by the more accurate calibration character, so that the accuracy is improved; and in some embodiments, the second preset time of a plurality of different lag times can be increased, so that the calibration display texts generated in a plurality of different time periods are generated, and the accuracy of the texts is further improved.

An exemplary system:

an intelligent conference system based on a smart screen, as shown in fig. 1, comprises:

Specifically, the information processing module converts voice information into text information for real-time updating, the text information comprises first display text and calibration display text, the first display text is a keyword deleted when the degree of engagement between the voice information collected in a first preset time period and corresponding text in the sample information exceeds a first threshold value degree of engagement, and the first display text displays the identified keyword at intervals; the words are calibrated and displayed by sentences determined when the coincidence degree of the voice information collected in a second preset time period and the corresponding words in the sample information exceeds a second threshold coincidence degree, the first preset time is less than the second preset time, and the first threshold coincidence degree is less than the second threshold coincidence degree; the calibration display text overlays the first display text.

It should be noted that the acquisition and identification module comprises a microphone, a sound console and a sound card, the sound console is connected with the microphone through a data line, the sound console is connected with the sound card through a data line, and the sound card is used for transmitting signals to the voice amplification module.

Example devices:

a display device comprising a display, a memory, a processor;

the processor is used for executing the intelligent conference display method;

the memory is to store the processor-executable instructions;

Exemplary computer program products and computer-readable storage media:

in addition to the above-described methods and apparatus, embodiments of the present disclosure may also be a computer program product comprising computer program instructions that, when executed by a processor, cause the processor to perform the steps in the method of determining a closest obstacle according to the various embodiments of the present disclosure described in the "exemplary methods" section of this specification above.

The computer program product may write program code for carrying out operations for embodiments of the present disclosure in any combination of one or more programming languages, including an object oriented programming language such as Java, C + + or the like and conventional procedural programming languages, such as the "C" programming language or similar programming languages. The program code may execute entirely on the user's computing device, partly on the user's device, as a stand-alone software package, partly on the user's computing device and partly on a remote computing device, or entirely on the remote computing device or server.

Furthermore, embodiments of the present disclosure may also be a computer-readable storage medium having stored thereon computer program instructions that, when executed by a processor, cause the processor to perform the steps in the intelligent conference display method according to various embodiments of the present disclosure described in the "exemplary methods" section above in this specification.

The computer-readable storage medium may take any combination of one or more readable media. The readable medium may be a readable signal medium or a readable storage medium. A readable storage medium may include, for example, but not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or a combination of any of the foregoing. More specific examples (a non-exhaustive list) of the readable storage medium include: an electrical connection having one or more wires, a portable disk, a hard disk, a Random Access Memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or flash memory), an optical fiber, a portable compact disc read-only memory (CD-ROM), an optical storage device, a magnetic storage device, or any suitable combination of the foregoing.

The foregoing is merely an example of the present invention and common general knowledge of features in the schemes is not described here in any greater extent. It should be noted that, for a person skilled in the art, several modifications can be made without departing from the invention, which should also be considered as the protection scope of the invention.

Claims

1. The utility model provides an intelligence conference system based on wisdom screen which characterized in that: the method comprises the following steps:

2. The intelligent conference system based on the intelligent screen as claimed in claim 1, wherein: the information processing module converts voice information into text information for real-time updating, the text information comprises first display text and calibration display text, the first display text is a keyword deleted when the degree of engagement between the voice information collected in a first preset time period and corresponding text in sample information exceeds a first threshold value degree of engagement, and the first display text displays the identified keyword at intervals; the words are calibrated and displayed by sentences determined when the coincidence degree of the voice information collected in a second preset time period and the corresponding words in the sample information exceeds a second threshold coincidence degree, the first preset time is less than the second preset time, and the first threshold coincidence degree is less than the second threshold coincidence degree; the calibration display text overlays the first display text.

3. The intelligent conference system based on the intelligent screen as claimed in claim 2, wherein: the collecting and identifying module comprises a microphone, a sound console and a sound card, the sound console is connected with the microphone through a data line, the sound console is connected with the sound card through a data line, and the sound card is used for conveying signals to the voice amplifying module.

4. An intelligent conference display method based on an intelligent screen is characterized in that: the method comprises the following steps:

acquiring a speech signal of a speech;

5. A display device characterized by: which comprises a display, a memory and a processor,

the processor is configured to execute the intelligent conference display method of claim 4;

the memory is to store the processor-executable instructions;

6. A storage medium, characterized by: the storage medium stores a computer program for executing the intelligent conference display method according to claim 4.