CN113689855A

CN113689855A - Conference record generation system, method, device and storage medium

Info

Publication number: CN113689855A
Application number: CN202110948154.3A
Authority: CN
Inventors: 黎莎; 徐凡
Original assignee: Beijing Railway Institute of Mechanical and Electrical Engineering Group Co Ltd
Current assignee: Beijing Railway Institute of Mechanical and Electrical Engineering Group Co Ltd
Priority date: 2021-08-18
Filing date: 2021-08-18
Publication date: 2021-11-23

Abstract

The embodiment of the invention discloses a conference record generation system, a conference record generation method, a conference record generation device and a storage medium. The system comprises: the conference system comprises control equipment, data processing equipment and voice acquisition equipment, wherein the control equipment is in communication connection with the data processing equipment and is used for producing a conference marking signal and sending the conference marking signal to the data processing equipment, wherein the conference marking signal comprises an identity; the voice acquisition equipment is in communication connection with the data processing equipment and is used for acquiring audio signals and sending the audio signals to the data processing equipment; the data processing device is used for generating a conference record based on the identity in the at least one conference mark signal and the audio signal corresponding to each conference mark signal. By the technical scheme, the relation between the audio information and the identity is established, so that the text information in the generated conference record has the identity, the process of manually summarizing the conference record is reduced, and the generation efficiency of the conference record is improved.

Description

Conference record generation system, method, device and storage medium

Technical Field

The embodiment of the invention relates to the technical field of voice recognition, in particular to a conference record generation system, a conference record generation method, a conference record generation device and a storage medium.

Background

With the arrival of the information age, various conferences, particularly major and large conferences, need to assign full-time recording of full-time conference recording personnel at present.

Based on the background, devices for automatically generating conference records and recording voice are available in the market afterwards, voice information of speakers in a conference can be transcribed into characters, and a high accuracy rate can be guaranteed in a quiet environment.

However, in an actual conference scene, many abnormal situations often cause the voice recognition effect to be not ideal, for example, the conference environment is noisy, sometimes the middle is interrupted, and the like, so that conference recording personnel need to summarize the completion condition of each department in a large disordered text record, and the conference recording efficiency is slow.

Disclosure of Invention

The embodiment of the invention provides a system, a method and a device for generating a conference record and a storage medium, so as to improve the efficiency of the conference record.

In a first aspect, an embodiment of the present invention provides a system for generating a conference record, including: control equipment, data processing equipment and voice acquisition equipment, wherein,

the control equipment is in communication connection with the data processing equipment and is used for producing a conference marking signal and sending the conference marking signal to the data processing equipment, wherein the conference marking signal comprises an identity;

the voice acquisition equipment is in communication connection with the data processing equipment and is used for acquiring audio signals and sending the audio signals to the data processing equipment;

the data processing device is configured to receive the conference marking signal and the audio signal, and generate a conference record based on an identity in at least one of the conference marking signals and the audio signal corresponding to each of the conference marking signals.

In a second aspect, an embodiment of the present invention further provides a method for generating a conference record, where the method includes:

acquiring a conference recording signal generated by control equipment, wherein the conference recording signal comprises an identity;

receiving the audio signal acquired by the voice acquisition equipment, and setting the acquired audio signal based on the identity in the conference recording signal to obtain the audio signal configured with the identity;

and generating a conference record based on the audio signal containing the identity mark.

In a third aspect, an embodiment of the present invention further provides a conference record generating apparatus, including:

the system comprises a signal acquisition module, a conference recording module and a control module, wherein the signal acquisition module is used for acquiring a conference recording signal generated by control equipment, and the conference recording signal comprises an identity;

the audio signal generation module is used for receiving the audio signal acquired by the voice acquisition equipment and setting the acquired audio signal based on the identity in the conference recording signal to obtain the audio signal configured with the identity;

and the conference record generating module is used for generating a conference record based on the audio signal containing the identity identification.

In a fourth aspect, the present invention further provides a storage medium containing computer-executable instructions, which when executed by a computer processor, are configured to perform the conference record generating method according to any one of the embodiments of the present invention.

The invention generates a conference marking signal containing an identity through the control equipment and sends the conference marking signal to the data processing equipment; the voice acquisition equipment acquires an audio signal and sends the audio signal to the data processing equipment; and the data processing equipment generates a conference record by using the identification in the received at least one conference marking signal and the audio signal corresponding to each conference marking signal. By the technical scheme, the relation between the audio information and the identity is established, so that the text information in the generated conference record has the identity, the process of manually summarizing the conference record is reduced, and the generation efficiency of the conference record is improved.

Drawings

In order to more clearly illustrate the technical solutions of the exemplary embodiments of the present invention, a brief description is given below of the drawings used in describing the embodiments. It should be clear that the described figures are only views of some of the embodiments of the invention to be described, not all, and that for a person skilled in the art, other figures can be derived from these figures without inventive effort.

Fig. 1 is a schematic structural diagram of a conference record generating system according to an embodiment of the present invention;

fig. 2 is a schematic structural diagram of a conference record generating system according to a second embodiment of the present invention;

fig. 3 is a schematic flowchart of a method for generating a conference record according to a third embodiment of the present invention;

fig. 4 is a schematic structural diagram of a conference record generating apparatus according to a fourth embodiment of the present invention.

Detailed Description

The present invention will be described in further detail with reference to the accompanying drawings and examples. It is to be understood that the specific embodiments described herein are merely illustrative of the invention and are not limiting of the invention.

It should be further noted that, for the convenience of description, only some but not all of the relevant aspects of the present invention are shown in the drawings. Before discussing exemplary embodiments in more detail, it should be noted that some exemplary embodiments are described as processes or methods depicted as flowcharts. Although a flowchart may describe the operations (or steps) as a sequential process, many of the operations can be performed in parallel, concurrently or simultaneously. In addition, the order of the operations may be re-arranged. The process may be terminated when its operations are completed, but may have additional steps not included in the figure. The processes may correspond to methods, functions, procedures, subroutines, and the like.

Example one

Fig. 1 is a schematic structural diagram of a conference record generating system according to an embodiment of the present invention. The embodiment can be applied to the situation that in a conference, a speaker is subjected to voice collection and converted into a conference record. The system may execute the method for generating a conference record provided by the embodiment of the present application, and the system may be implemented by software and/or hardware. Fig. 1 is only an example, and in other embodiments, the number of the voice capturing devices is not limited. The system comprises: a control device 110, a data processing device 120 and a speech acquisition device 130.

The control device 110 is in communication connection with the data processing device 120, and is configured to generate a conference marking signal, and send the conference marking signal to the data processing device 120, where the conference marking signal includes an identity; the voice acquisition device 130 is in communication connection with the data processing device 120, and is configured to acquire an audio signal and send the audio signal to the data processing device 120; the data processing device 120 is configured to receive the conference marking signal and the audio signal, and generate a conference record based on the identity in at least one conference marking signal and the audio signal corresponding to each conference marking signal.

In this embodiment, the control device 110 includes, but is not limited to, a remote control device, a mobile phone, a computer, and the like, which can generate and transmit the conference flag signal. The conference marking signal may be a signal for identifying conference participants or departments, and the conference marking signal includes, but is not limited to, information such as an identification. In some embodiments, the identification may be a department name or a participant name, such as a quality control department, a technical department, or a security department. In another embodiment, the identity may be a serial number such as 1, 2, 3, etc. Illustratively, when the control device 110 is a remote controller, the remote controller only has numeric keys, and the id in the conference flag signal is a serial number id.

The voice capturing device 130 refers to a device for capturing voice, and may be used to capture audio signals of the participants in the conference. The voice capture device 130 may include, but is not limited to, a wired microphone and a wireless microphone. The audio signal collected by the voice collecting apparatus 130 refers to speech information, i.e., voice information, of conference participants. In some embodiments, the voice capture device 130 may be one. Illustratively, conference participants take turns using one voice capture device 130. In some embodiments, the voice capturing device 130 may be plural. Illustratively, one voice capture device 130 is installed at each conference participant's location.

The data processing device 120 refers to a device for processing data, and may receive the conference marking signal sent by the control device 110 and the audio signal sent by the voice collecting device 130, and the data processing device 120 may be one device or a plurality of devices. In some embodiments, the data processing device 120 may be a host, and the host generates a conference record by using the identity in at least one conference tag signal and the audio signal corresponding to each conference tag signal; in some embodiments, the data processing device 120 may include, but is not limited to, a host, a signal processor, and a voice recognition server, and the data processing device 120, which is composed of the host, the signal processor, and the voice recognition server, transmits the identification in the at least one conference tag signal and the audio signal corresponding to each conference tag signal to generate the conference record.

It should be noted that the communication connection manner includes, but is not limited to, a wired connection and a wireless connection, and the communication connection manner is not particularly limited in this embodiment. The wired connection may be a communication connection through optical fibers, network cables, or the like, and the wireless connection may be a communication connection through a WiFi technology, a mobile communication technology, an infrared technology, or the like.

For example, when the control device 110 is a remote controller, the remote controller and the data processing device 120 may be communicatively connected by an infrared technology, or when the control device 110 is a computer, the computer and the data processing device 120 may be communicatively connected by a network cable.

On the basis of the above embodiment, there is at least one voice acquisition device 130; the control device 110 is further configured to determine, based on the identity identifier in the conference marking signal, a voice collecting device 130 for collecting the audio signal, generate a collecting trigger signal, and send the collecting trigger signal to the corresponding voice collecting device 130; the voice capture device 130 receives the capture trigger signal and initiates capture of the audio signal in response to the capture trigger signal.

The identity in the conference marking signal and the voice capture device 130 may have a preset corresponding relationship.

For example, if the identifier in the conference marking signal is a technical department, a microphone used by the technical department may be searched for by the identifier in a preset correspondence table. While determining the microphone corresponding to the technical department, the control device 110 generates an acquisition trigger signal for triggering the microphone of the technical department person to start to acquire the speaking content of the technical department person.

On the basis of the above embodiment, the control device 110 is further configured to generate a conference control signal, where the conference control signal includes a recording start signal, a recording pause signal, and a recording stop signal; the data processing device 120 is further configured to receive the conference control signal transmitted by the control device 110, and is responsive to the conference control signal to control a processing state of the received audio signal.

The recording start signal may be used to control the data processing apparatus 120 to start a voice receiving function, and prepare to receive an audio signal. The record pause signal may be used to control the data processing device 120 to pause converting the received audio signal into a conference recording.

For example, if the meeting recorder considers that the voice content in a part of the time period in the meeting does not need to be recorded in the meeting record, the control device 110 may be operated to generate a record pause signal, and at this time, the data processing device 120 may pause converting the voice information into a text, so as to reduce the work of the meeting recorder for sorting the invalid text information.

The recording stop signal may be used to control the data processing device 120 to stop converting the received audio signal into a conference recording. It is understood that, while the data processing device 120 responds to the recording stop signal, indicating that the conference recording is completed, the control device 110 needs to generate the recording start signal again if the conference recording is to be performed again.

On the basis of the above embodiment, the control device 110 is configured with a control key and a mark key; alternatively, the control device 110 is configured with control keys and information input keys, or the control device 110 is configured with a touch display screen.

Specifically, in some embodiments, the control device 110 is configured with control keys and a mark key, wherein the control keys include, but are not limited to, a start key, a pause key and a stop key, and the mark key may be a key with a name identifier, such as a technical department key, a quality inspection department key, and the like. In another embodiment, the control device 110 is configured with a control key and an information input key, wherein the information input key may be a numeric keypad or an alphabetic keypad, and the information input key generates the meeting marking signal containing the identity through a numeric key, an alphabetic key, or a combination of the numeric key and the alphabetic key.

In some embodiments, the control device 110 is configured with a touch display screen, where the touch display screen may include a plurality of virtual keys, and functions of the virtual keys may be set according to user requirements. Illustratively, in the conference process, an environmental protection department is newly added for speaking, the department is a newly established department of a company, and conference recording personnel can newly add a virtual key corresponding to the environmental protection department in a touch display screen for conference recording.

According to the technical scheme provided by the embodiment of the invention, the conference marking signal containing the identity is generated by the control equipment, and the conference marking signal is sent to the data processing equipment; the voice acquisition equipment acquires an audio signal and sends the audio signal to the data processing equipment; and the data processing equipment generates a conference record by using the identification in the received at least one conference marking signal and the audio signal corresponding to each conference marking signal. By the technical scheme, the relation between the audio information and the identity is established, so that the text information in the generated conference record has the identity, the process of manually summarizing the conference record is reduced, and the generation efficiency of the conference record is improved.

Example two

Fig. 2 is a schematic structural diagram of another conference record generating system according to a second embodiment of the present invention. Fig. 2 is merely an example, and in other embodiments, the number of the signal processing apparatuses and the number of the voice recognition servers are not limited. Optionally, the data processing apparatus 120 includes: signal processing device, speech recognition server. It should be noted that the structure of the data processing apparatus 120 is only an example, and in some embodiments, the plurality of signal processing devices may correspond to one voice recognition server, and each signal processing device may also correspond to one voice recognition server.

The signal processing device is used for marking the audio signal corresponding to the conference marking signal based on the identification in the conference marking signal and generating the audio signal with the identification, wherein the audio signal corresponding to the conference marking signal is the audio signal collected between the timestamp of the conference marking signal and the timestamp of the next conference marking signal.

Specifically, the signal processing device marks the audio signal corresponding to the conference marker signal with an identity identifier in the received conference marker signal, where the marking mode may be to add the identity identifier to the content information of the audio signal, and the marking mode may also be to add the identity identifier to a file name corresponding to the audio signal, which is not limited in this embodiment.

In some embodiments, the signal processing device may include, but is not limited to, a host and an audio processor. The audio processor is in communication connection with the voice acquisition device 130, and can be used for receiving audio signals, and performing noise reduction processing, echo elimination, sound mixing and other operations on the audio signals, so that the audio signals are clearer, and the quality of the audio signals is improved. The host may be in communication with the receiving control device 110, and may receive the conference marking signal and mark the audio signal corresponding to the conference marking signal with the identity in the received conference marking signal.

The voice recognition server is in communication connection with the signal processing device and is used for performing text conversion on the audio signal with the identity identification to generate a conference record.

The voice recognition server may be located in a local server or a cloud server, which is not limited in this embodiment. In some embodiments, the speech recognition server may convert the audio signal to textual information via natural language processing techniques and generate a meeting record from the textual information.

On the basis of the above embodiment, the identity includes identity information and general identification information; the data processing apparatus is to: adding identity information in the conference marking signal at a preset position of an audio signal corresponding to the conference marking signal; or, based on the corresponding relationship between the general identification information and the identity information, the identity information corresponding to the general identification information is determined, and the identity information in the conference marker signal is added at the preset position of the audio signal corresponding to the conference marker signal.

The identity information may be a department name or a participant name, such as a quality inspection department, a technical department, a security department, or the like. The generic identification information may be a serial number identification such as 1, 2, 3, etc.

Specifically, in some embodiments, the identity information in the conference marker signal is added at a preset position of the audio signal corresponding to the conference marker signal, where the preset position may be a start position, an end position, or another position that can be marked of the audio content in the audio signal, and this embodiment does not limit this. In some embodiments, the identity information in the conference tag signal may also be added to the stored file name of the audio signal, which may be composed of the identity information and the timestamp in the conference tag signal together, since there may be multiple audio signals of the same identity information.

In some embodiments, the identity information corresponding to the general identification information may be called in a preset mapping relationship table, and then the identity information in the conference marker signal may be added at a preset position of the audio signal corresponding to the conference marker signal. Illustratively, the conference marking signal containing the general identification information is generated by pressing a number key on the remote controller, and the general identification information is matched in a preset mapping relation table to obtain the identity information of the conference marking signal.

On the basis of the foregoing embodiment, the speech recognition server is further configured to: extracting key information from the text information obtained by text conversion to obtain key information corresponding to each identity; and inputting the key information corresponding to each identity into a preset conference record template to obtain a conference record.

The key information may be information that is of interest to the user, such as meeting issues, implementation policies, completion time, or supervising departments. The preset conference recording template may be a conference recording template specially made by a conference recorder, or a conference recording template generated according to key information corresponding to each identity, which is not limited in the present application.

In some embodiments, the key information of the text information may be extracted through a preset information extraction model, specifically, the preset information extraction model is a machine learning model, and the text information obtained by text conversion is input to the preset information extraction model to obtain the key information corresponding to the identity.

EXAMPLE III

Fig. 3 is a flowchart of conference record generation according to a third embodiment of the present invention. The method can be executed by the conference record generation system provided by the embodiment of the invention. As shown in fig. 3, the job information processing method of the present embodiment may specifically include the following steps:

s310, acquiring a conference recording signal generated by the control device, wherein the conference recording signal comprises an identity.

S320, receiving the audio signals collected by the voice collecting equipment, and setting the collected audio signals based on the identification marks in the conference recording signals to obtain the audio signals configured with the identification marks.

And S320, generating a conference record based on the audio signal containing the identity identification.

On the basis of the foregoing embodiment, the receiving the audio signal acquired by the voice acquisition device, and setting the acquired audio signal based on the identity identifier in the conference recording signal to obtain the audio signal configured with the identity identifier includes:

and marking the audio signal corresponding to the conference marking signal based on the identity in the conference marking signal, and generating the audio signal with the identity, wherein the audio signal corresponding to the conference marking signal is the audio signal collected between the timestamp of the conference marking signal and the timestamp of the next conference marking signal.

On the basis of the above embodiment, the identity includes identity information and general identification information; receive the audio signal that pronunciation collection equipment gathered to based on the identification in the meeting record signal sets up the audio signal who gathers, obtains the audio signal who disposes the identification, still includes:

adding identity information in the conference marking signal at a preset position of an audio signal corresponding to the conference marking signal; or, based on the corresponding relationship between the general identification information and the identity information, the identity information corresponding to the general identification information is determined, and the identity information in the conference marker signal is added at the preset position of the audio signal corresponding to the conference marker signal.

On the basis of the above embodiment, the generating a conference record based on the audio signal containing the identity includes:

performing text conversion on the audio signal with the identity mark;

extracting key information from the text information obtained by text conversion to obtain key information corresponding to each identity;

and inputting the key information corresponding to each identity into a preset conference record template to obtain a conference record.

On the basis of the above embodiment, the voice collecting device is at least one, and the method further includes:

determining voice acquisition equipment for acquiring audio signals based on the identity in the conference marking signal, generating acquisition trigger signals, and sending the acquisition trigger signals to corresponding voice acquisition equipment;

and the voice acquisition equipment receives the acquisition trigger signal and starts to acquire an audio signal in response to the acquisition trigger signal.

On the basis of the above embodiment, the method further includes:

generating a conference control signal, wherein the conference control signal comprises a recording start signal, a recording pause signal and a recording stop signal;

and receiving a conference control signal sent by the control equipment, and responding to the conference control signal to control the processing state of the received audio signal.

The embodiment of the invention provides a method for generating a conference record, which comprises the steps of generating a conference marking signal containing an identity through a control device, and sending the conference marking signal to a data processing device; the voice acquisition equipment acquires an audio signal and sends the audio signal to the data processing equipment; and the data processing equipment generates a conference record by using the identification in the received at least one conference marking signal and the audio signal corresponding to each conference marking signal. By the technical scheme, the relation between the audio information and the identity is established, so that the text information in the generated conference record has the identity, the process of manually summarizing the conference record is reduced, and the generation efficiency of the conference record is improved.

Example four

Fig. 4 is a schematic structural diagram of a conference record generating device in a fourth embodiment of the present application, where the conference record generating device provided in this embodiment may be implemented by software and/or hardware, and may be configured in a conference record generating system provided in any embodiment of the present invention. Specifically, as shown in fig. 4, the apparatus may specifically include: a signal acquisition module 410, an audio signal generation module 420, and a conference recording generation module 430.

The signal obtaining module 410 is configured to obtain a conference recording signal generated by a control device, where the conference recording signal includes an identity; an audio signal generating module 420, configured to receive an audio signal acquired by the voice acquisition device, and set the acquired audio signal based on an identity identifier in the conference recording signal to obtain an audio signal configured with the identity identifier; and a conference record generating module 430, configured to generate a conference record based on the audio signal containing the identity.

The embodiment of the invention provides a device for generating a conference record, which generates a conference marking signal containing an identity through a control device and sends the conference marking signal to a data processing device; the voice acquisition equipment acquires an audio signal and sends the audio signal to the data processing equipment; and the data processing equipment generates a conference record by using the identification in the received at least one conference marking signal and the audio signal corresponding to each conference marking signal. By the technical scheme, the relation between the audio information and the identity is established, so that the text information in the generated conference record has the identity, the process of manually summarizing the conference record is reduced, and the generation efficiency of the conference record is improved.

On the basis of any optional technical solution in the embodiment of the present invention, optionally, the audio signal generating module 420 is configured to:

On the basis of any optional technical scheme in the embodiment of the present invention, optionally, the identity includes identity information and general identity information; the audio signal generation module 420 is configured to:

On the basis of any optional technical solution in the embodiment of the present invention, optionally, the conference record generating module 430 is configured to:

performing text conversion on the audio signal with the identity mark;

On the basis of any optional technical scheme in the embodiment of the present invention, optionally, at least one voice acquisition device is provided, and the apparatus may further be configured to:

On the basis of any optional technical solution in the embodiment of the present invention, optionally, the apparatus may further be configured to:

The conference record generation device provided by the embodiment of the invention can execute the conference record generation method provided by any embodiment of the invention, and has corresponding functional modules and beneficial effects of the execution method.

EXAMPLE five

An embodiment of the present invention further provides a storage medium containing computer-executable instructions, where the computer-executable instructions are executed by a computer processor to perform a method for generating a conference record, and the method includes:

Computer storage media for embodiments of the invention may employ any combination of one or more computer-readable media. The computer readable medium may be a computer readable signal medium or a computer readable storage medium. A computer readable storage medium may be, for example, but not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or any combination of the foregoing. More specific examples (a non-exhaustive list) of the computer readable storage medium would include the following: an electrical connection having one or more wires, a portable computer diskette, a hard disk, a Random Access Memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or flash memory), an optical fiber, a portable compact disc read-only memory (CD-ROM), an optical storage device, a magnetic storage device, or any suitable combination of the foregoing. In the context of this document, a computer readable storage medium may be any tangible medium that can contain, or store a program for use by or in connection with an instruction execution system, apparatus, or device.

A computer readable signal medium may include a propagated data signal with computer readable program code embodied therein, for example, in baseband or as part of a carrier wave. Such a propagated data signal may take many forms, including, but not limited to, electro-magnetic, optical, or any suitable combination thereof. A computer readable signal medium may also be any computer readable medium that is not a computer readable storage medium and that can communicate, propagate, or transport a program for use by or in connection with an instruction execution system, apparatus, or device.

Program code embodied on a computer readable medium may be transmitted using any appropriate medium, including but not limited to wireless, wireline, optical fiber cable, RF, etc., or any suitable combination of the foregoing.

Computer program code for carrying out operations for embodiments of the present invention may be written in any combination of one or more programming languages, including an object oriented programming language such as Java, Smalltalk, C + + or the like and conventional procedural programming languages, such as the "C" programming language or similar programming languages. The program code may execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer or entirely on the remote computer or server. In the case of a remote computer, the remote computer may be connected to the user's computer through any type of network, including a Local Area Network (LAN) or a Wide Area Network (WAN), or the connection may be made to an external computer (for example, through the Internet using an Internet service provider).

It is to be noted that the foregoing is only illustrative of the preferred embodiments of the present invention and the technical principles employed. It will be understood by those skilled in the art that the present invention is not limited to the particular embodiments described herein, but is capable of various obvious changes, rearrangements and substitutions as will now become apparent to those skilled in the art without departing from the scope of the invention. Therefore, although the present invention has been described in greater detail by the above embodiments, the present invention is not limited to the above embodiments, and may include other equivalent embodiments without departing from the spirit of the present invention, and the scope of the present invention is determined by the scope of the appended claims.

Claims

1. A conference record generating system, comprising: control equipment, data processing equipment and voice acquisition equipment, wherein,

2. The system of claim 1, wherein the data processing device comprises: signal processing device, voice recognition server, wherein,

the signal processing device is configured to label, based on an identity in the conference label signal, an audio signal corresponding to the conference label signal, and generate an audio signal with the identity, where the audio signal corresponding to the conference label signal is an audio signal acquired between a timestamp of the conference label signal and a timestamp of a next conference label signal;

3. The system of claim 2, wherein the identity comprises identity information and general identification information;

the data processing apparatus is to: adding identity information in the conference marking signal at a preset position of an audio signal corresponding to the conference marking signal;

alternatively, the first and second electrodes may be,

and determining the identity information corresponding to the general identification information based on the corresponding relation between the general identification information and the identity information, and adding the identity information in the conference marking signal at the preset position of the audio signal corresponding to the conference marking signal.

4. The system of claim 2, wherein the speech recognition server is further configured to:

5. The system of claim 1, wherein the voice capture device is at least one;

the control equipment is further used for determining voice acquisition equipment for acquiring audio signals based on the identity in the conference marking signal, generating acquisition trigger signals and sending the acquisition trigger signals to the corresponding voice acquisition equipment;

6. The system of claim 1, wherein the control device is further configured to generate conference control signals, wherein the conference control signals include a recording start signal, a recording pause signal, and a recording stop signal;

the data processing equipment is also used for receiving the conference control signal sent by the control equipment and responding to the conference control signal so as to control the processing state of the received audio signal.

7. The system according to any one of claims 1 to 6,

the control equipment is provided with a control key and a marking key; alternatively, the first and second electrodes may be,

the control equipment is provided with a control key and an information input key; alternatively, the first and second electrodes may be,

and a touch display screen is configured on the control equipment.

8. A method for generating a conference record, comprising:

9. A conference record generating apparatus, comprising:

10. A storage medium containing computer-executable instructions for performing the conference recording generation method of claim 8 when executed by a computer processor.