TW201624467A

TW201624467A - Meeting minutes device and method thereof for automatically creating meeting minutes

Info

Publication number: TW201624467A
Application number: TW103146227A
Authority: TW
Inventors: 劉揚偉
Original assignee: 鴻海精密工業股份有限公司
Priority date: 2014-12-30
Filing date: 2014-12-30
Publication date: 2016-07-01
Also published as: TWI616868B; US20160189713A1

Abstract

A meeting minutes device and a method thereof for automatically creating meeting minutes are provided by this disclosure. The method for creating meeting minutes includes the following steps. Converting voice data from a meeting to characters. Determining whether the converted characters include calibration objects. If the converted characters include one or more calibration objects, correcting the one or more calibration objects to corresponding useful expressions based on a useful expression database. Creating an original meeting minutes according to the corrected characters and a meeting minutes template.

Description

Conference recording device and method for automatically generating conference record

本發明涉及一種會議記錄裝置及其自動生成會議記錄的方法。The present invention relates to a conference recording apparatus and a method thereof for automatically generating a conference record.

現有的會議中報告及記錄的方法,通常是利用攝像機、麥克風、錄音筆等設備對會議過程中各人員的發言進行錄音及錄影。會後做會議記錄的人員可以查看、重播錄音及錄影以整理會議記錄。然而，通過人工對語音資料進行標注和提取，對使用者來說，費時且極為不便。The methods of reporting and recording in existing conferences usually use video cameras, microphones, voice recorders and other equipment to record and record the speeches of each person during the conference. Those who do the meeting minutes after the meeting can view and replay the recordings and videos to organize the meeting minutes. However, manually tagging and extracting voice data is time consuming and extremely inconvenient for the user.

鑒於此，有必要提供一種會議記錄裝置及自動生成會議記錄的方法，能夠自動生成會議記錄，以解決上述問題。In view of this, it is necessary to provide a conference recording device and a method for automatically generating a conference record, which can automatically generate a conference record to solve the above problem.

本發明提供一種會議記錄裝置，包括記憶體和處理器。所述會議記錄裝置還包括由所述處理器控制的且存儲於所述記憶體中的如下模組：辨識模組：用於根據一會議上接收的語音對應的所述語音信號以及所述記憶體中存儲的使用者語音特徵表，辨識出所述語音信號對應的一或多個用戶；轉換模組：用於將所述語音信號轉換為包含所述一或多個使用者的用戶名的文字；及生成模組：用於根據轉換得到的所述文字以及所述記憶體中存儲的會議記錄範本生成一原始會議記錄。The invention provides a conference recording device comprising a memory and a processor. The conference recording device further includes a module controlled by the processor and stored in the memory: an identification module: the voice signal corresponding to the voice received on a conference and the memory a user voice feature table stored in the body, identifying one or more users corresponding to the voice signal; and a conversion module: configured to convert the voice signal into a user name including the one or more users And a generating module: configured to generate an original meeting record according to the converted text and the meeting record template stored in the memory.

本發明還提供一種自動生成會議記錄的方法，運行於包括記憶體和處理器的至少一裝置中。所述方法包括由所述處理器控制所述記憶體中存儲的模組執行的如下步驟：辨識步驟：根據一會議上接收的語音對應的語音信號以及所述記憶體中存儲的使用者語音特徵表，辨識出所述語音信號對應的一或多個用戶；轉換步驟：將所述語音信號轉換為包含所述一或多個使用者的用戶名的文字；及生成步驟：根據轉換得到的所述文字以及所述記憶體中存儲的會議記錄範本生成一原始會議記錄。The present invention also provides a method of automatically generating a meeting record, running in at least one device including a memory and a processor. The method includes the following steps performed by the processor to control a module stored in the memory: an identification step: according to a voice signal corresponding to a voice received at a conference and a user voice feature stored in the memory a table, identifying one or more users corresponding to the voice signal; converting step: converting the voice signal into a text including a username of the one or more users; and generating step: according to the converted The text and the conference record template stored in the memory generate an original conference record.

本發明所述的會議記錄裝置及其自動生成會議記錄的方法，可根據預設的會議記錄範本自動生成會議記錄，因而，相較於現有的方式更省時、方便及人性化。The conference recording device and the method for automatically generating the conference record according to the present invention can automatically generate a conference record according to a preset conference record template, thereby being more time-saving, convenient and user-friendly than the existing method.

圖1為本發明一實施方式的會議記錄裝置的應用環境示意圖。FIG. 1 is a schematic diagram of an application environment of a conference recording apparatus according to an embodiment of the present invention.

圖2為圖1所示的會議記錄裝置的一實施方式的功能模組圖。FIG. 2 is a functional block diagram of an embodiment of the conference recording apparatus shown in FIG. 1. FIG.

圖3為本發明一實施方式中，生成的原始會議記錄及編輯後的會議記錄的示意圖。FIG. 3 is a schematic diagram of an original conference record and an edited conference record generated in an embodiment of the present invention.

圖4-圖7分別為本發明不同實施方式的自動生成會議記錄的方法的步驟流程圖。4-7 are flow charts showing the steps of a method for automatically generating a conference record according to different embodiments of the present invention.

請參閱圖1，其為本發明的一實施方式的會議記錄裝置100的應用環境示意圖。本實施方式中，會議記錄裝置100可與一雲端裝置200相連接。其中，會議記錄裝置100處於各使用者1的附近，可接收各使用者1在會議或報告上的語音，即使用者1的發言。會議記錄裝置100和/或雲端裝置200具備根據會議記錄裝置100接收的語音自動生成會議記錄的功能。使用者1為會議或報告的參與者。為了描述方便，以下將會議或報告統一稱為會議。Please refer to FIG. 1 , which is a schematic diagram of an application environment of a conference recording apparatus 100 according to an embodiment of the present invention. In this embodiment, the conference recording device 100 can be connected to a cloud device 200. The conference recording device 100 is located in the vicinity of each user 1, and can receive the voice of each user 1 in the conference or report, that is, the speech of the user 1. The conference recording device 100 and/or the cloud device 200 has a function of automatically generating a conference record based on the voice received by the conference recording device 100. User 1 is a participant in a meeting or report. For convenience of description, the following conference or report is collectively referred to as a conference.

在一實施方式中，會議記錄裝置100具有自動生成會議記錄的功能，即，可以自行生成會議記錄。且會議記錄裝置100不依賴雲端裝置200，而自行根據其接收的語音自動生成會議記錄。當多個使用者1舉行會議或報告時，會議記錄裝置100可自動記錄各使用者1的語音，並自動將識別各使用者的語音，並將識別到的語音轉換為文字後，按照預設的會議記錄範本自動生成會議記錄，並按照預設的方式自動發送至相關人員。相關人員包括各使用者1和/或其他會議相關人員，例如待辦事項負責人、相關主管等人員。從而實現自動記錄、生成及發送會議記錄的功能。In an embodiment, the conference recording apparatus 100 has a function of automatically generating a conference record, that is, the conference record can be generated by itself. And the conference recording device 100 does not rely on the cloud device 200, but automatically generates a conference record according to the voice it receives. When a plurality of users 1 hold a conference or report, the conference recording device 100 can automatically record the voices of the users 1 and automatically recognize the voices of the users, and convert the recognized voices into texts, according to the preset. The meeting record template automatically generates the meeting record and automatically sends it to the relevant personnel according to the preset method. Relevant personnel include each user 1 and/or other meeting related personnel, such as the person in charge of the to-do list, the relevant supervisor, and the like. Thereby realizing the function of automatically recording, generating and transmitting meeting records.

為說明方便，本段落中的以下括弧中的文字為其前面的文字的簡化的功能說明。具體的請參見如下的說明。會議記錄裝置100可以自動辨識接收的語音的各相應的使用者（辨識語音中的使用者），然後將接收的語音轉換為包括辨識出的使用者的使用者名的文字，或者，將接收的語音自動轉換為文字（語音轉換為文字），然後從文字中識別出各使用者1的使用者名（辨識文字中的使用者）。之後根據上述從語音和/或文字中識別出的使用者名對文字進行段落劃分（根據文字劃分段落）之後，再根據預設的會議記錄範本自動生成會議記錄（生成會議記錄）。會議記錄裝置100還可以根據接收的語音自動識別其中的無聲片段（根據語音辨識無聲片段），根據識別出的無聲片段將語音劃分為多個語音片段（根據語音劃分段落），然後分別將該多個語音片段轉換為對應的文字（語音轉換為文字），再根據預設的會議記錄範本自動生成會議記錄（生成會議記錄）。會議記錄裝置100還可以自動辨識語音和/或文字資訊中多次重複出現的詞句，並存儲於常用語資料庫中，因而在生成會議記錄的過程中，可以自動將文字記錄中的詞句校對成常用的詞句。For convenience of explanation, the text in the following brackets in this paragraph is a simplified functional description of the text preceding it. See the instructions below for details. The conference recording device 100 can automatically recognize each corresponding user of the received voice (identifying the user in the voice), and then convert the received voice into a text including the recognized user's username, or will receive the received text. The voice is automatically converted into text (voice converted to text), and then the user name of each user 1 (the user in the recognized text) is recognized from the text. Then, according to the above-mentioned user name recognized from the voice and/or the text, the text is segmented (the paragraph is divided according to the text), and then the conference record is automatically generated according to the preset conference record template (generating the conference record). The conference recording device 100 can also automatically recognize the silent segment (according to the voice recognition silent segment) according to the received voice, divide the voice into a plurality of voice segments according to the recognized silent segment (divide the segment according to the voice), and then respectively The voice segments are converted into corresponding texts (voice converted to text), and the conference records are automatically generated according to the preset conference record template (generating conference records). The conference recording device 100 can also automatically recognize the repeated words and phrases in the voice and/or text information and store them in the common language database. Therefore, in the process of generating the conference record, the words in the text record can be automatically collated into Commonly used words.

在另一實施方式中，會議記錄裝置100可以與雲端裝置200進行資料通讯，從而由會議記錄裝置100和雲端裝置200一起或由雲端裝置200單獨根據會議記錄裝置100接收的語音自動生成會議記錄。因而，本發明還可以是由會議記錄裝置100對會議進行錄音，並將所錄的語音轉換為語音信號，將轉換的到的語音信號和/或其他資料（例如根據語音信號轉換得到的文字等）傳輸至至雲端裝置200，而由會議記錄裝置100和/或及雲端裝置200分別執行在上一實施方式中全部由會議記錄裝置100執行的以下功能中的全部或一部分：語音轉換為文字、辨識語音和/或文字中的使用者、根據語音和/或文字識別無聲片段、根據語音和/或文字中劃分段落、生成會議記錄、辨識語音和/或文字中的常用詞句、存儲常用詞句於常用語資料庫，以及根據常用詞句自動校對/編輯文字或會議記錄。In another embodiment, the conference recording device 100 can perform data communication with the cloud device 200, so that the conference record is automatically generated by the conference recording device 100 and the cloud device 200 or by the cloud device 200 alone according to the voice received by the conference recording device 100. Therefore, the present invention may also be that the conference recording device 100 records the conference, converts the recorded voice into a voice signal, and converts the converted voice signal and/or other materials (for example, text converted according to the voice signal). Transferring to the cloud device 200, and the conference recording device 100 and/or the cloud device 200 respectively perform all or a part of the following functions performed by the conference recording device 100 in the previous embodiment: voice conversion to text, Identify users in speech and/or text, identify silent segments based on speech and/or text, segment paragraphs based on speech and/or text, generate meeting notes, recognize common words in speech and/or text, store common words and phrases in A common language database, as well as automatic proofreading/editing of text or meeting minutes based on common phrases.

請參閱圖2，其為本發明一實施方式的。需要說明的是，圖2所示僅僅是本發明的一實施方式中的會議記錄裝置100的功能模組圖，對應以上所描述的實現本發明的各實施方式，會議記錄裝置100還可以是只包括圖2中示出的一部分的功能單元/模組。而雲端裝置200則可以包括圖2所示的其他功能單元/模組。例如，在單獨由雲端裝置200執行自動生成會議記錄的功能的實施方式中，會議記錄裝置100可以包括圖2所示的語音輸入單元20、通讯單元40、處理器60，雲端裝置200可以包括相應的通讯單元、處理器以及記憶體10中存儲的模組12-19。以下在需要時將作相應的描述。Please refer to FIG. 2, which is an embodiment of the present invention. It should be noted that FIG. 2 is only a functional module diagram of the conference recording apparatus 100 in an embodiment of the present invention. The conference recording apparatus 100 may also be only corresponding to the embodiments for implementing the present invention described above. A functional unit/module including a portion shown in FIG. The cloud device 200 may include other functional units/modules as shown in FIG. 2. For example, in an embodiment in which the function of automatically generating the conference record is performed by the cloud device 200 alone, the conference recording device 100 may include the voice input unit 20, the communication unit 40, and the processor 60 shown in FIG. 2, and the cloud device 200 may include corresponding The communication unit, the processor, and the modules 12-19 stored in the memory 10. The following description will be made as needed.

本實施方式中，會議記錄裝置100包括一記憶體10、語音輸入單元20、觸控式螢幕30、通讯單元40、定位模組50和處理器60。記憶體10、語音輸入單元20、觸控式螢幕30、通讯單元40通過信號線和資料線分別連接於處理器60。會議記錄裝置100為一智慧手機，在其他實施方式中，會議記錄裝置100還可以是平板電腦、筆記型電腦、臺式電腦以及會議電話等裝置。In the present embodiment, the conference recording device 100 includes a memory 10, a voice input unit 20, a touch screen 30, a communication unit 40, a positioning module 50, and a processor 60. The memory 10, the voice input unit 20, the touch screen 30, and the communication unit 40 are respectively connected to the processor 60 through signal lines and data lines. The conference recording device 100 is a smart phone. In other embodiments, the conference recording device 100 can also be a device such as a tablet computer, a notebook computer, a desktop computer, and a conference phone.

本實施方式中，會議記錄裝置100可獨立自動生成會議記錄。會議記錄裝置100自動根據其語音輸入單元20所接收到的參加會議的使用者1的語音，將接收的語音轉換為文字，之後再根據預設的會議記錄範本自動生成一會議記錄。具體的，會議記錄裝置100可以執行前述的將接收的語音自動轉換為文字、自動辨識接收的語音或轉換後的文字中的使用者、根據辨識出的使用者名對文字進行段落劃分，再根據預設的會議記錄範本自動生成會議記錄。會議記錄裝置100還可以根據接收的語音自動識別其中的無聲片段，根據識別出的無聲片段將語音劃分為多個語音片段，然後分別將該多個語音片段轉換為對應的文字，再根據預設的會議記錄範本自動生成會議記錄。會議記錄裝置100還可以自動辨識語音和/或文字資訊中多次重複出現的詞句，並存儲於常用語資料庫中，因而在生成會議記錄的過程中，可以自動將文字記錄中的詞句校對成常用的詞句。會議記錄裝置100還可以將生成的會議記錄和/或待辦事項根據預設方式自動發送至相關人員的通訊位址。其中，該預設方式包括預設的發送格式、預設的發送時間等等。相關人員的通訊位址至少包括以下中的一種：電子郵寄地址、電話號碼、社交帳號（例如QQ號碼、微信帳號）等等。In the present embodiment, the conference recording apparatus 100 can automatically generate a conference record independently. The conference recording device 100 automatically converts the received voice into text according to the voice of the user 1 participating in the conference received by the voice input unit 20, and then automatically generates a conference record according to the preset conference record template. Specifically, the conference recording device 100 can perform the foregoing method of automatically converting the received voice into a text, automatically recognizing the received voice or the converted text, segmenting the text according to the recognized user name, and then The preset meeting record template automatically generates the meeting record. The conference recording device 100 can also automatically recognize the silent segment according to the received voice, divide the voice into a plurality of voice segments according to the recognized silent segment, and then respectively convert the plurality of voice segments into corresponding texts, and then according to the preset. The meeting record template automatically generates meeting minutes. The conference recording device 100 can also automatically recognize the repeated words and phrases in the voice and/or text information and store them in the common language database. Therefore, in the process of generating the conference record, the words in the text record can be automatically collated into Commonly used words. The conference recording device 100 can also automatically send the generated conference record and/or to-do item to the communication address of the relevant person according to a preset manner. The preset mode includes a preset sending format, a preset sending time, and the like. The communication address of the relevant person includes at least one of the following: an electronic mailing address, a telephone number, a social account number (for example, a QQ number, a WeChat account number), and the like.

記憶體10中存儲了一使用者語音特徵表，該語音特徵表記錄了多個使用者名及其語音特徵參數的一一對應關係。本實施方式中，使用者名可以是使用者的真實姓名，也可以是昵稱或代號等。該使用者語音特徵表可以預先訓練得到，即，在會議/報告開始之前的一時間內，對各使用者進行語音訓練、採集而得到。記憶體10中還可以存儲由使用者或系統預設的會議記錄範本。記憶體10還可以用於存儲錄製的語音資料、語音文字轉換所需的語音文字資料庫等，以及常用語資料庫。其中，常用語資料庫是在會議記錄裝置100執行其自動生成會議記錄的功能的過程中，累積、篩選存儲的，也可以是從一常用語資料庫中下載並存儲的。The memory 10 stores a user voice feature table, and the voice feature table records a one-to-one correspondence between a plurality of user names and their voice feature parameters. In this embodiment, the user name may be the real name of the user, or may be a nickname or a code number. The user voice feature table can be pre-trained, that is, the voice training and acquisition are performed for each user within a time before the start of the conference/report. The memory record 10 can also store a conference record template preset by the user or the system. The memory 10 can also be used for storing recorded voice data, a voice text database required for voice text conversion, and a common language database. The common language database is accumulated or filtered in the process of the conference recording device 100 performing its function of automatically generating the conference record, and may also be downloaded and stored from a common language database.

本實施方式中，語音輸入單元20用於採集會議時各使用者的語音，並將採集到的語音轉換為語音信號。語音輸入單元20為一麥克風。通讯單元40用於回應處理器60的控制而與雲端裝置200進行資料通讯。定位模組50用於提供會議記錄裝置100的即時位置資訊，其可以是一GPS定位模組。In this embodiment, the voice input unit 20 is configured to collect the voice of each user during the conference, and convert the collected voice into a voice signal. The voice input unit 20 is a microphone. The communication unit 40 is configured to perform data communication with the cloud device 200 in response to the control of the processor 60. The positioning module 50 is configured to provide real-time location information of the conference recording device 100, which may be a GPS positioning module.

在一實施方式中，會議記錄裝置100還包括一觸控式螢幕30。In an embodiment, the conference recording device 100 further includes a touch screen 30.

在本實施方式中，記憶體10中還存儲了多個功能模組，該多個功能模組被配置成由一個或多個處理器（本實施方式為一個處理器60）執行，以完成本發明。例如，參閱圖1所示，記憶體10中存儲了錄音模組11、轉換模組12、辨識模組13、判斷模組14、校對編輯模組15、生成模組16、發送模組17、分割模組18和控制模組19。在其他實施方式中，記憶體10中存儲的功能模組還可以根據實際需要作相應的變化，例如，當語音轉換為文字、自動辨識語音和/或文字中的常用詞句、存儲常用詞句於常用語資料庫，以及根據常用詞句自動校對文字等功能中的一或多個功能被設置為由雲端裝置200來執行時，會議記錄裝置100的記憶體10中可以不存儲執行該功能所需的功能模組。本發明所稱的模組是完成一特定功能的程式段，比程式更適合於描述軟體在處理器60中的執行過程。關於各模組的功能將在圖4-圖7的流程圖中具體描述。In the embodiment, a plurality of function modules are further stored in the memory 10, and the plurality of function modules are configured to be executed by one or more processors (one processor 60 in the embodiment) to complete the present invention. For example, as shown in FIG. 1 , the memory 10 stores a recording module 11 , a conversion module 12 , an identification module 13 , a determination module 14 , a proofreading module 15 , a generation module 16 , and a transmission module 17 . The module 18 and the control module 19 are divided. In other embodiments, the function module stored in the memory 10 can also be changed according to actual needs, for example, when the voice is converted into text, the common words in the voice and/or text are automatically recognized, and the commonly used words are stored in the common language. When the language database and one or more functions of the functions such as automatic proofreading of the common words are set to be executed by the cloud device 200, the functions required to perform the function may not be stored in the memory 10 of the conference recording device 100. Module. The module referred to in the present invention is a program segment that performs a specific function, and is more suitable than the program to describe the execution process of the software in the processor 60. The function of each module will be specifically described in the flowcharts of Figs.

需要說明的是，為說明方便，以下關於自動生成會議記錄的方法的介紹中，均是以該方法運行於一包括相應的單元和/或功能模組的會議記錄裝置（例如會議記錄裝置100）中來進行介紹的。根據前面的介紹可知，以下的各自動生成會議記錄的方法中，某些步驟還可以設置由一與會議記錄裝置連接的雲端裝置（例如雲端裝置200）來執行，因此，相應的，需要時，可以在下述的各自動生成會議記錄的方法的步驟中增加會議記錄裝置將語音信號/資料、文字資料和/或其他資料傳輸至該雲端裝置，以及該雲端裝置接收信號/資料的步驟。因該些為本領域技術人員可以根據本說明書所揭露的內容實施得到的一些技術手段，因此，為節約篇幅起見，將不在本說明書中一一具體詳細的描述。It should be noted that, for convenience of description, the following description of the method for automatically generating the conference record is performed by the method in a conference recording device (for example, the conference recording device 100) including the corresponding unit and/or function module. Introduced in the middle. According to the foregoing description, in the following methods for automatically generating a conference record, some steps may also be performed by a cloud device (for example, the cloud device 200) connected to the conference recording device, and accordingly, when needed, The step of transmitting the voice signal/data, text data, and/or other data to the cloud device and the step of receiving the signal/data by the cloud device may be added to the steps of the method for automatically generating the conference record described below. Therefore, some technical means that can be implemented by those skilled in the art according to the contents disclosed in the present specification, therefore, for the sake of space saving, will not be specifically described in detail in this specification.

如圖4所示，是本發明一實施方式的自動生成會議記錄的方法400的流程圖。自動生成會議記錄的方法400是在一會議記錄裝置（例如會議記錄裝置100）和/或雲端裝置（例如雲端裝置200）的會議記錄功能被開啟後，運行於該會議記錄裝置和/或雲端裝置的，其可以開始於步驟S401、步驟S402或步驟S403。As shown in FIG. 4, it is a flowchart of a method 400 for automatically generating a conference record according to an embodiment of the present invention. The method 400 of automatically generating a meeting record is to run on the meeting recording device and/or the cloud device after the meeting recording function of the meeting recording device (for example, the meeting recording device 100) and/or the cloud device (for example, the cloud device 200) is turned on. It can start at step S401, step S402 or step S403.

步驟S401，接收步驟：語音輸入單元20接收語音並將接收的語音轉換為相應的語音信號。本實施方式中，會議記錄裝置100設在會議的使用者1附近，語音輸入單元20為設置於會議記錄裝置100中的麥克風。Step S401, receiving step: the voice input unit 20 receives the voice and converts the received voice into a corresponding voice signal. In the present embodiment, the conference recording device 100 is provided in the vicinity of the user 1 of the conference, and the voice input unit 20 is a microphone provided in the conference recording device 100.

在另一實施方式中，還可以在本步驟S401同時或之前執行如下步驟：控制模組19控制開啟定位模組50以獲取一會議記錄裝置100的位置資訊及當前的會議時間資訊，並將獲取的位置資訊及時間資訊存儲於記憶體10中。在其他實施方式中，會議記錄裝置100還可以接收經由觸控式螢幕30輸入的當前會議的相關資訊並存儲，例如，會議日期、時間、地點以及參加會議的人員名等等。In another embodiment, the following steps may be performed at the same time or before the step S401: the control module 19 controls the opening of the positioning module 50 to obtain the location information of the conference recording device 100 and the current meeting time information, and acquires The location information and time information are stored in the memory 10. In other embodiments, the conference recording device 100 can also receive related information of the current conference input via the touch screen 30 and store, for example, the date, time, location, and name of the person attending the conference, and the like.

步驟S402，錄音步驟：錄音模組11將所述語音信號錄製成語音資料，並將錄製好的語音資料存儲於記憶體10。在一實施方式中，回應使用者的選擇，本步驟也可以省略，而直接執行步驟S403。Step S402, recording step: the recording module 11 records the voice signal into voice data, and stores the recorded voice data in the memory 10. In an embodiment, in response to the user's selection, this step may also be omitted, and step S403 is directly executed.

步驟S403，辨識步驟：辨識模組13根據所述語音信號以及記憶體10中存儲的使用者語音特徵表，識別出所述語音信號對應的一或多個使用者。本實施方式中，辨識模組13根據所述語音信號分析得到一或多個語音特徵，並從所述語音特徵表中查詢到相同/最相近的語音特徵對應的一或多個使用者，從而得到語音資料中對應的一或多個使用者。會議或報告進行時，當有多個使用者發言/說話的時候，辨識模組13即可根據所述語音信號及所述語音特徵表識別出所述語音資料中包含了哪個使用者的聲音。Step S403, the step of identifying: the identification module 13 identifies one or more users corresponding to the voice signal according to the voice signal and the user voice feature table stored in the memory 10. In this embodiment, the identification module 13 analyzes one or more speech features according to the speech signal, and queries one or more users corresponding to the same/closest speech features from the speech feature table, thereby The corresponding one or more users in the voice data are obtained. When a conference or a report is being executed, when a plurality of users speak/speak, the identification module 13 can identify which user's voice is included in the voice data according to the voice signal and the voice feature table.

在另一實施方式中，辨識模組13還給不同的使用者的語音片段加上不同的標籤，同一使用者的語音片段加上相同的標籤。In another embodiment, the recognition module 13 also adds different labels to the voice segments of different users, and the same user's voice segments are added with the same label.

步驟S404，轉換步驟：轉換模組12將所述語音信號轉換為包含所述一或多個使用者的使用者名的文字。本實施方式中，轉換模組12根據所述語音信號以及記憶體10中存儲的語音文字資料庫，將所述語音信號轉換為文字，並在辨識模組13識別到的一或多個使用者的各使用者的語音信號對應的轉換得到的文字的一預設位置自動添加對應的使用者的使用者名，本實施方式中，預設位置為各使用者的語音信號對應的轉換得到的文字的最前端。Step S404, a conversion step: the conversion module 12 converts the voice signal into a text containing a username of the one or more users. In this embodiment, the conversion module 12 converts the voice signal into text according to the voice signal and the voice text database stored in the memory 10, and the one or more users identified by the recognition module 13 In the preset position of the converted text corresponding to the voice signal of each user, the user name of the corresponding user is automatically added. In this embodiment, the preset position is the converted text corresponding to the voice signal of each user. The front end.

在另一實施方式中，在辨識模組13給不同的使用者的語音片段加上了些標籤時，轉換模組12轉換得到的所述文字還包括了該些標籤。In another embodiment, when the identification module 13 adds a label to a voice segment of a different user, the text converted by the conversion module 12 further includes the labels.

步驟S405，生成步驟：生成模組16根據轉換得到的所述文字以及記憶體10中存儲的會議記錄範本生成一原始會議記錄。請參閱圖3所示，其示出有一實施方式中，生成模組16生成的一原始會議記錄310。Step S405, a generating step: the generating module 16 generates an original meeting record according to the converted text and the meeting record template stored in the memory 10. Referring to FIG. 3, an original conference record 310 generated by the generation module 16 is shown in an embodiment.

在一實施方式中，生成模組16還將定位模組50所獲取的位置資訊及時間資訊自動添加到生成的原始會議記錄中。例如，將時間資訊添加到會議記錄範本中的會議日期/時間的欄位元中，將位置資訊添加到會議記錄範本中的會議地點的欄位中，等等。In an embodiment, the generating module 16 also automatically adds the location information and time information acquired by the positioning module 50 to the generated original meeting record. For example, add time information to the session date/time field in the meeting record template, add location information to the meeting place field in the meeting record template, and so on.

生成模組16還可以將使用者通過觸控式螢幕30輸入的會議參加者/出席者自動添加到會議記錄範本中的出席者/與會者的欄位中。The generation module 16 can also automatically add the conference participants/attendees input by the user through the touch screen 30 to the attendees/participants in the conference record template.

在另一實施方式中，生成模組16還可以根據辨識模組13識別到的所述文字中包含的使用者名或辨識模組13根據語音信號辨識得到的發出所述語音信號對應的語音的使用者的使用者名，自動將該些使用者名添加到會議記錄範本中的出席者/與會者的欄位中。In another embodiment, the generating module 16 may further generate the voice corresponding to the voice signal according to the voice signal recognized by the user name or the recognition module 13 included in the text recognized by the recognition module 13 The user's username automatically adds the username to the attendee/participant field in the meeting record template.

步驟S406，校對編輯步驟：校對編輯模組15根據預設的校對編輯規則對所述原始會議記錄進行校對和/或編輯，以得到一會議記錄。Step S406, the proofreading editing step: the proofreading editing module 15 collates and/or edits the original meeting record according to a preset proofreading editing rule to obtain a meeting record.

本實施方式中，所述預設的校對編輯規則為從所述文字中的每一使用者名處對文字進行段落劃分。辨識模組13還從轉換得到的所述文字中辨識/識別出使用者的使用者名，校對編輯模組15則根據辨識模組13識別到的所述文字中包含的使用者名對所述原始會議記錄進行段落劃分。例如，校對編輯模組15以使用者名的第一個或最後一個字為界來劃分段落。當所述文字中包含使用者名為王大明時，校對編輯模組15則從以王大明這三個文字作為段落的段首。需要說明的是，本實施方式中，優選的，此處所說的使用者名均是由辨識模組13通過辨識語音而得到的使用者的使用者名。在另一實施方式中，該些使用者名還可以是辨識模組13根據記憶體10中原先存儲的使用者名，從所述文字中自動識別出來的。請參閱圖3所示，其示出有一實施方式中，校對編輯模組15對原始會議記錄310進行校對和/或編輯後得到的編輯後的會議記錄320。In this embodiment, the preset proofreading rule is to segment the text from each user name in the text. The identification module 13 further identifies/recognizes the user name of the user from the converted text, and the proofreading editing module 15 performs the user name pair included in the text recognized by the recognition module 13 The original meeting record is divided into paragraphs. For example, the proofreading editing module 15 divides the paragraphs by the first or last word of the username. When the text includes the user name Wang Daming, the proofreading editing module 15 takes the first paragraph of the paragraph from Wang Daming as the paragraph. It should be noted that, in the present embodiment, it is preferable that the user names mentioned herein are the user names of the users obtained by the recognition module 13 by recognizing the voice. In another embodiment, the user names may be automatically recognized by the recognition module 13 from the text according to the user name originally stored in the memory 10. Referring to FIG. 3, there is shown an edited conference record 320 obtained by the proofreading module 15 after proofreading and/or editing the original conference record 310 in one embodiment.

在另一實施方式中，所述預設的校對編輯規則為根據辨識模組13給不同的使用者的語音片段加上的標籤，從每一語音片段起始處所對應的文字處對文字段落進行切分。In another embodiment, the preset proofreading rule is a label added to the voice segment of different users according to the recognition module 13, and the text paragraph is performed from the text corresponding to the beginning of each voice segment. Segmentation.

在再一實施方式中，校對編輯模組15還將校對編輯後的所述會議記錄存儲於所述記憶體10中。或者，發送模組17控制通過通讯單元40將校對編輯後的所述會議記錄發送至所述雲端裝置200，以控制將所述會議記錄存儲於所述雲端裝置200。In still another embodiment, the proofreading editing module 15 also stores the collated edited meeting record in the memory 10. Alternatively, the sending module 17 controls the conference record edited by the communication unit 40 to be sent to the cloud device 200 to control storing the conference record in the cloud device 200.

在其他實施方式中，校對編輯模組15還根據觸控式螢幕30生成的編輯信號對會議記錄進行編輯。例如，使用者可以通過觸控式螢幕30輸入對原始會議記錄的編輯內容和/或編輯操作，從而提供了供使用者手動編輯原始會議記錄的功能。此外，所述預設的校對編輯規則還包括智慧識別校對文字等，具體請結合以下根據圖5進行的說明。In other embodiments, the proofreading editing module 15 also edits the meeting record according to the editing signal generated by the touch screen 30. For example, the user can input the editing content and/or editing operation of the original meeting record through the touch screen 30, thereby providing a function for the user to manually edit the original meeting record. In addition, the preset proofreading rules further include smart identification of proofreading characters, etc., in particular, the following description according to FIG.

步驟S407，發送步驟：發送模組17根據預設的發送規則將經校對和/或編輯後的所述會議記錄自動發送至會議相關人員的通訊位址。本實施方式中，所述預設的發送規則可以為立即發送（即，會議記錄生成後即發送）至會議相關人員的通訊位址，也可以是在會議記錄生成後的一預設時間點發送至會議相關人員的通訊位址。所述會議相關人員可以包括以下人員中的一或多個：會議出席者、會議記錄中出現了其使用者名的使用者、會議記錄中涉及/提及的使用者（例如，待辦事項的使用者）、預設的主管、負責人、責任人等等。Step S407, the sending step: the sending module 17 automatically sends the collated and/or edited meeting record to the communication address of the meeting related person according to the preset sending rule. In this implementation manner, the preset sending rule may be immediately sent (that is, sent after the conference record is generated) to the communication address of the conference related person, or may be sent at a preset time point after the conference record is generated. The communication address of the person to the meeting. The meeting related person may include one or more of the following: a meeting attendee, a user whose username appears in the meeting record, or a user referred to/reported in the meeting record (eg, to-do list) User), default supervisor, responsible person, responsible person, etc.

在另一實施方式中，所述預設的發送規則還可以包括在待辦事項的預設到期日前的預設天數發送生成的所述會議記錄至待辦事項相關的人員的通訊位址，例如，可以包括待辦事項的直接責任人、相關主管及與該待辦事項相關的其他相關人員。In another embodiment, the preset sending rule may further include sending, by the preset number of days before the preset expiration date of the to-do item, the generated meeting record to the communication address of the to-do person related to the to-do list. For example, it may include the person directly responsible for the to-do list, the relevant supervisor, and other related personnel related to the to-do list.

在其他實施方式中，還可以不設置本步驟S407，而由使用者直接手動發送會議記錄至會議相關人員的通訊位址；或者，在雲端裝置200接收並存儲了該會議記錄時，由雲端裝置200將該會議記錄發送至會議相關人員。In other embodiments, the step S407 may not be set, but the user directly sends the conference record to the communication address of the conference related person manually; or, when the cloud device 200 receives and stores the conference record, the cloud device 200 sends the meeting record to the relevant person of the meeting.

如圖5所示，是本發明一實施方式的自動生成會議記錄的方法500的流程圖。自動生成會議記錄的方法500是在一會議記錄裝置（例如會議記錄裝置100）的會議記錄功能被開啟後，運行於該會議記錄裝置的。需要說明的是，圖5所示的自動生成會議記錄的方法500與圖4所示的自動生成會議記錄的方法400中執行的步驟中，有一部分相同或相類似的，因此，上述對圖4中的自動生成會議記錄的方法400進行描述時，針對某步驟進行說明的一些替代的、可同時執行的其他實施方式也是適用於圖5中的自動生成會議記錄的方法500中相同或相類似的步驟，在此就不再一一贅述。自動生成會議記錄的方法500可以開始於步驟S501。As shown in FIG. 5, it is a flowchart of a method 500 of automatically generating a conference record according to an embodiment of the present invention. The method 500 of automatically generating a meeting record is performed on the meeting recording device after the meeting recording function of the meeting recording device (e.g., the meeting recording device 100) is turned on. It should be noted that some of the steps performed in the method 500 for automatically generating the conference record shown in FIG. 5 and the method 400 for automatically generating the conference record shown in FIG. 4 are the same or similar, and therefore, the foregoing FIG. 4 While the method 400 of automatically generating a meeting record is described, some alternative, simultaneously executable, other embodiments that are described for a certain step are also the same or similar in the method 500 of the automatically generated meeting record of FIG. The steps are not repeated here. The method 500 of automatically generating a meeting record can begin in step S501.

步驟S501，接收步驟：語音輸入單元20接收語音並將接收的語音轉換為相應的語音信號。Step S501, receiving step: the voice input unit 20 receives the voice and converts the received voice into a corresponding voice signal.

步驟S502，錄音步驟：錄音模組11將所述語音信號錄製成語音資料，並將錄製好的語音資料存儲於記憶體10。在一實施方式中，回應使用者的選擇，本步驟也可以省略，而直接執行步驟S503。Step S502, recording step: the recording module 11 records the voice signal into voice data, and stores the recorded voice data in the memory 10. In an embodiment, in response to the user's selection, this step may also be omitted, and step S503 is directly executed.

步驟S503，辨識步驟：辨識模組13根據所述語音信號識別出所述語音資料中的無聲片段。本實施方式中，所述無聲片段即為所述語音資料中的為靜音資料的片段，即，為所述語音中為靜音的片段。例如，當所述語音信號中某部分對應的語音資料的語音片段的音量小於一預設的無聲臨界值時，辨識模組13即識別該語音片段為無聲片段。所述語音資料中可能包含了多個無聲片段。Step S503, an identification step: the recognition module 13 identifies the silent segment in the voice data according to the voice signal. In this embodiment, the silent segment is a segment of the voice data that is a mute data, that is, a segment that is muted in the voice. For example, when the volume of the voice segment corresponding to a certain portion of the voice signal is less than a predetermined silent threshold, the recognition module 13 recognizes the voice segment as a silent segment. The voice material may contain a plurality of silent segments.

在一實施方式中，當未包含步驟S502時，本步驟中，辨識模組13根據所述語音信號識別出所述語音中的無聲片段。In an embodiment, when step S502 is not included, in this step, the recognition module 13 identifies the silent segment in the voice according to the voice signal.

步驟S504，判斷步驟：判斷模組14判斷所述無聲片段所歷經的時間是否大於一預設值，如果是，則執行步驟S505，否則，流程結束。在一實施方式中，所述預設值為3秒。Step S504, the determining step: the determining module 14 determines whether the time elapsed by the silent segment is greater than a preset value, and if yes, executing step S505; otherwise, the process ends. In an embodiment, the preset value is 3 seconds.

步驟S505，分割步驟：分割模組18根據所述無聲片段將所述語音資料分割為多個語音資料片段。本實施方式中，分割模組18從所述無聲片段處對所述語音資料進行分割，當所述語音資料中包含歷經的時間均大於所述預設值的多個無聲片段時，分割模組18根據多個無聲片段將所述語音資料分割為多個語音資料片段。Step S505, the dividing step: the segmentation module 18 divides the voice data into a plurality of voice data segments according to the silent segment. In this embodiment, the segmentation module 18 segments the voice data from the silent segment, and when the voice data includes a plurality of silent segments whose time is greater than the preset value, the segmentation module 18 segmenting the speech data into a plurality of speech data segments based on a plurality of silent segments.

步驟S506，辨識步驟：辨識模組13根據分割得到的多個語音資料片段對應的語音信號以及記憶體10中存儲的使用者語音特徵表，識別出所述多個語音資料片段中對應的一或多個使用者。在一實施方式中，本自動生成會議記錄的方法500還可以不包括本步驟。Step S506, the identification step: the recognition module 13 identifies a corresponding one of the plurality of voice data segments according to the voice signal corresponding to the plurality of voice data segments obtained by the segmentation and the user voice feature table stored in the memory 10. Multiple users. In an embodiment, the method 500 for automatically generating a conference record may not include this step.

步驟S507，轉換步驟：轉換模組12將分割得到的多個語音資料片段對應的語音信號轉換為包含多個段落的文字。本實施方式中，轉換模組12根據所述多個語音資料片段對應的語音信號、辨識模組13識別到的一或多個使用者以及記憶體10中存儲的語音文字資料庫，將所述多個語音資料片段對應的語音信號轉換為包含與各語音資料片段一一對應的多個段落的文字。Step S507, a conversion step: the conversion module 12 converts the voice signals corresponding to the plurality of segmented speech data segments into characters including a plurality of paragraphs. In the embodiment, the conversion module 12 is configured according to the voice signal corresponding to the plurality of voice data segments, the one or more users identified by the recognition module 13, and the voice text database stored in the memory 10. The speech signals corresponding to the plurality of speech data segments are converted into words including a plurality of paragraphs corresponding to the respective speech data segments.

步驟S508，生成步驟：生成模組16根據轉換得到的所述包含多個段落的文字以及記憶體10中存儲的會議記錄範本生成一原始會議記錄。本步驟S509具體的方式與自動生成會議記錄的方法400可以相同，在此就不在贅述。Step S508, a generating step: the generating module 16 generates an original meeting record according to the converted text containing the plurality of paragraphs and the meeting record template stored in the memory 10. The specific manner of this step S509 may be the same as the method 400 for automatically generating a conference record, and is not described herein.

在本實施方式中，在本步驟S508之後還可以執行自動生成會議記錄的方法400中的步驟S406（校對編輯步驟）及步驟S407（發送步驟），在此就不再贅述。In the present embodiment, step S406 (proofreading editing step) and step S407 (transmission step) in the method 400 for automatically generating the conference record may be performed after the step S508, and details are not described herein again.

如圖6所示，是本發明一實施方式的自動生成會議記錄的方法600的流程圖。自動生成會議記錄的方法600是在一會議記錄裝置（例如會議記錄裝置100）的會議記錄功能被開啟後，運行於該會議記錄裝置的。需要說明的是，圖6所示的自動生成會議記錄的方法600與圖5及圖4所示的自動生成會議記錄的方法中所執行的步驟中，有一部分是相同或相類似的，因此，上述對圖4中的自動生成會議記錄的方法400以及對圖5中的自動生成會議記錄的方法500進行描述時，針對某步驟進行說明的一些替代的、可同時執行的其他實施方式也是適用於圖6中的自動生成會議記錄的方法600中相同或相類似的步驟，在此也不再一一贅述。自動生成會議記錄的方法600可以開始於步驟S601。As shown in FIG. 6, a flowchart of a method 600 of automatically generating a conference record according to an embodiment of the present invention. The method 600 of automatically generating a meeting record is run by the meeting recording device after the meeting recording function of the meeting recording device (e.g., the meeting recording device 100) is turned on. It should be noted that some of the steps performed in the method 600 for automatically generating a conference record shown in FIG. 6 and the method for automatically generating a conference record shown in FIG. 5 and FIG. 4 are the same or similar. In the above description of the method 400 of automatically generating a conference record in FIG. 4 and the method 500 of automatically generating a conference record in FIG. 5, some alternative embodiments that can be performed simultaneously for a certain step are also applicable to The same or similar steps in the method 600 of automatically generating a conference record in FIG. 6 are not repeated here. The method 600 of automatically generating a meeting record can begin in step S601.

步驟S601，接收步驟：語音輸入單元20接收語音並將接收的語音轉換為相應的語音信號。Step S601, receiving step: the voice input unit 20 receives the voice and converts the received voice into a corresponding voice signal.

步驟S602，錄音步驟：錄音模組11將所述語音信號錄製成包含錄音時間戳記的語音資料，並將錄製好的語音資料存儲於記憶體10。在一實施方式中，回應使用者的選擇，本步驟也可以省略，而直接執行步驟S603。Step S602, recording step: the recording module 11 records the voice signal into voice data including a recording time stamp, and stores the recorded voice data in the memory 10. In an embodiment, in response to the user's selection, this step may also be omitted, and step S603 is directly executed.

步驟S603，辨識步驟：辨識模組13根據所述語音信號以及記憶體10中存儲的使用者語音特徵表，識別出所述語音信號中對應的一或多個使用者。在一實施方式中，辨識模組13根據所述包含錄音時間戳記的語音資料以及記憶體10中存儲的使用者語音特徵表，識別出所述語音信號對應的一或多個使用者。在另一實施方式中，自動生成會議記錄的方法600也可以不包括本步驟。In step S603, the identification module 13 identifies the corresponding one or more users in the voice signal according to the voice signal and the user voice feature table stored in the memory 10. In one embodiment, the identification module 13 identifies one or more users corresponding to the voice signal according to the voice data including the recording time stamp and the user voice feature table stored in the memory 10. In another embodiment, the method 600 of automatically generating a meeting record may also not include this step.

步驟S604，轉換步驟：轉換模組12將所述語音信號轉換為包含所述錄音時間戳記及所述一或多個使用者的使用者名的文字。本實施方式中，轉換模組12將所述語音信號轉換為包含所述錄音時間戳記及所述一或多個使用者的使用者名的文字。轉換模組12根據所述語音信號、錄音模組11所錄製的包含了錄音時間戳記的語音資料、辨識模組13識別到的一或多個使用者以及記憶體10中存儲的語音文字資料庫，將所述語音信號轉換為包含了所述錄音時間戳記的文字，並在各使用者的語音信號轉換得到的文字的最前端自動添加對應的使用者的使用者名。在另一實施方式中，轉換模組12根據所述語音信號、錄音模組11所錄製的包含了錄音時間戳記的語音資料以及記憶體10中存儲的語音文字資料庫，將所述語音信號轉換為包含了所述錄音時間戳記的文字。Step S604, a conversion step: the conversion module 12 converts the voice signal into a text including the recording time stamp and the user name of the one or more users. In this embodiment, the conversion module 12 converts the voice signal into a text including the recording time stamp and the user name of the one or more users. The conversion module 12 is based on the voice signal, the voice data recorded by the recording module 11 and including the recording time stamp, the one or more users identified by the recognition module 13, and the voice text database stored in the memory 10. And converting the voice signal into a character including the recording time stamp, and automatically adding a corresponding user name to the front end of the text converted by each user's voice signal. In another embodiment, the conversion module 12 converts the voice signal according to the voice signal, the voice data recorded by the recording module 11 and including the voice time data stored in the memory 10, and the voice text database stored in the memory 10. Is the text that contains the recording timestamp.

步驟S605，判斷步驟：判斷模組14根據轉換後的所述文字，判斷是否有相鄰的文字對應的錄音時間戳記所記載的時間間隔達到一預設值，如果是，則執行步驟S606，否則，流程結束。在一實施方式中，所述預設值為3秒。所述包含所述錄音時間戳記的相鄰的文字中可能包含有多個時間間隔達到該預設值的。Step S605, the determining step: the determining module 14 determines, according to the converted character, whether the time interval recorded by the recording time stamp corresponding to the adjacent text reaches a preset value, and if yes, executing step S606, otherwise The process ends. In an embodiment, the preset value is 3 seconds. The adjacent text including the recording time stamp may include a plurality of time intervals to reach the preset value.

步驟S606，分割步驟：分割模組18將所述對應的錄音時間戳記所記載的時間間隔達到所述預設值的相鄰的文字為界劃分文字段落。本實施方式中，具體的，該相鄰的文字分別被劃分到前一個段落以及相鄰的後一個段落，直至所有的對應的錄音時間戳記所記載的時間間隔達到所述預設值的各相鄰的文字均被劃分到不同的段落。Step S606, the dividing step: the segmentation module 18 divides the text paragraphs by the adjacent characters whose time intervals indicated by the corresponding recording time stamps reach the preset value. In this embodiment, specifically, the adjacent characters are respectively divided into a previous paragraph and an adjacent subsequent paragraph until all time intervals recorded by the corresponding recording time stamps reach the preset values. The adjacent text is divided into different paragraphs.

步驟S607，生成步驟：生成模組16根據劃分段落後的所述文字以及記憶體10中存儲的會議記錄範本生成一原始會議記錄。本步驟S607具體的方式與自動生成會議記錄的方法500可以相同，在此就不在贅述。Step S607, a generating step: the generating module 16 generates an original meeting record according to the text after dividing the paragraph and the meeting record template stored in the memory 10. The specific manner of this step S607 may be the same as the method 500 for automatically generating a conference record, and is not described herein.

如圖7所示，是本發明一實施方式的自動生成會議記錄的方法700的流程圖。自動生成會議記錄的方法700是在一會議記錄裝置（例如會議記錄裝置100）的會議記錄功能被開啟後，運行於該會議記錄裝置的。需要說明的是，圖7所示的自動生成會議記錄的方法700與圖5及圖4所示的自動生成會議記錄的方法中所執行的步驟中，有一部分是相同或相類似的，因此，上述對圖4中的自動生成會議記錄的方法400以及對圖5中的自動生成會議記錄的方法500進行描述時，針對某步驟進行說明的一些替代的、可同時執行的其他實施方式也是適用於圖7中的自動生成會議記錄的方法700中相同或相類似的步驟，在此也不再一一贅述。本自動生成會議記錄的方法700可以開始於步驟S701。As shown in FIG. 7, it is a flowchart of a method 700 for automatically generating a conference record according to an embodiment of the present invention. The method 700 of automatically generating a meeting record is performed on the meeting recording device after the meeting recording function of the meeting recording device (e.g., the meeting recording device 100) is turned on. It should be noted that some of the steps performed in the method 700 for automatically generating a conference record shown in FIG. 7 and the method for automatically generating a conference record shown in FIG. 5 and FIG. 4 are the same or similar. In the above description of the method 400 of automatically generating a conference record in FIG. 4 and the method 500 of automatically generating a conference record in FIG. 5, some alternative embodiments that can be performed simultaneously for a certain step are also applicable to The same or similar steps in the method 700 of automatically generating a conference record in FIG. 7 will not be repeated here. The method 700 of automatically generating a meeting record can begin in step S701.

步驟S701，建庫步驟：控制模組19建立一包含常用語及其校正對象的常用語資料庫，並將所述常用語資料庫存儲於記憶體10中。本實施方式中，可以是當會議記錄裝置100為首次使用自動生成會議記錄的功能時，控制模組19自動建立所述常用語資料庫。所述常用語資料庫中包含至少一常用語及其校正對象的對應關係，每一常用語至少與一校正對象對應。所述常用語包括了以下中的一或多種：常用字、常用詞、常用句子等，還可以是語音資料或文字資料。每一常用語的校正對象可以是在使用者手動編輯、修改會議記錄過程中累積、記載下來的。校正對象包括以下語音資料和/或文字資料中的以下中的一或多種：字、詞、句子等。Step S701, the database building step: the control module 19 creates a common language database containing the common language and its correction object, and stores the common language data in the memory 10. In this embodiment, when the conference recording device 100 is configured to automatically generate a conference record for the first time, the control module 19 automatically establishes the common language database. The common language database includes a correspondence between at least one common language and its correction object, and each common language corresponds to at least one correction object. The common language includes one or more of the following: common words, common words, common sentences, etc., and may also be voice data or text data. The correction object of each common language may be accumulated and recorded during the user's manual editing and modification of the meeting record. The correction object includes one or more of the following voice data and/or text data: words, words, sentences, and the like.

在另一實施方式中，本自動生成會議記錄的方法700還可以不包括本步驟S701。而是在該會議記錄裝置中預先存儲有一常用語資料庫，常用語資料庫是在會議記錄裝置100執行其自動生成會議記錄的功能的過程中，累積、篩選存儲的，也可以是從一常用語資料庫中下載並存儲的。In another embodiment, the method 700 for automatically generating a conference record may not include this step S701. Rather, a common language database is pre-stored in the conference recording device. The common language database is accumulated, filtered, or used in the process of the conference recording device 100 performing its function of automatically generating the conference record. Downloaded and stored in the language database.

步驟S702，接收步驟：語音輸入單元20接收語音並將接收的語音轉換為相應的語音信號。Step S702, receiving step: the voice input unit 20 receives the voice and converts the received voice into a corresponding voice signal.

步驟S703，轉換步驟：轉換模組12將所述語音信號轉換為文字。在一實施方式中，還可以包括自動生成會議記錄的方法400、500及600中任一方法中所包含的接收步驟至轉換步驟之間的其他步驟。即，可以包含前面所描述的各種實施方式的將語音信號轉換為文字的步驟。Step S703, a conversion step: the conversion module 12 converts the voice signal into a text. In an embodiment, other steps between the receiving step and the converting step included in any of the methods 400, 500, and 600 of automatically generating the meeting record may also be included. That is, the steps of converting the speech signal into text can be included in the various embodiments described above.

步驟S704，識別存儲常用詞步驟：判斷模組14在識別判斷出所述語音資料和/或所述文字中包含重複出現一預設次數的詞句時，將所述重複出現該預設次數的詞句作為常用語存儲於所述常用語資料庫中。本實施方式中，重複出現該預設次數的詞句可以為字、詞、句子等語音資料和/或文字資料。在一實施方式中，本步驟S704還可以省略。所述預設次數為20次。Step S704: Identify a step of storing a common word: the determining module 14 displays the repeated occurrence of the predetermined number of words when the voice data is identified and/or the word contains a predetermined number of times of repeated occurrences It is stored as a common language in the common language database. In this embodiment, the words and phrases that are repeated for the preset number of times may be voice data and/or text data such as words, words, sentences, and the like. In an embodiment, this step S704 can also be omitted. The preset number of times is 20 times.

步驟S705，判斷步驟：判斷模組14判斷轉換後的所述文字否包含一校正對象，如果是，則執行步驟S706，否則，流程結束。Step S705, the determining step: the determining module 14 determines whether the converted text contains a correction object, and if yes, executing step S706; otherwise, the flow ends.

步驟S706，校對步驟：校對編輯模組15根據所述常用語資料庫自動將所述文字包含的校正對象校正為對應的常用語。在一實施方中，本步驟S706還可以在步驟S707之後執行。Step S706, the proofreading step: the proofreading editing module 15 automatically corrects the corrected object included in the text to the corresponding common language according to the common language database. In an implementation, this step S706 can also be performed after step S707.

步驟S707，生成步驟：生成模組16根據校正後的所述文字以及記憶體10中存儲的會議記錄範本生成一原始會議記錄。本步驟S707具體的方式與自動生成會議記錄的方法500可以相同，在此就不在贅述。Step S707, a generating step: the generating module 16 generates an original meeting record according to the corrected text and the meeting record template stored in the memory 10. The specific manner of this step S707 can be the same as the method 500 for automatically generating a conference record, and is not described herein.

本發明提供的上述會議記錄裝置100及其自動生成會議記錄的方法，可根據預設的會議記錄範本自動生成會議記錄，並可對會議記錄進行智慧的語音文字識別、內容格式化及編輯校對。而且，還可以根據預設的規則將會議記錄發送至相關人員。因而，相較於現有的方式更省時、方便及人性化。The conference recording apparatus 100 and the method for automatically generating the conference record provided by the invention can automatically generate a conference record according to a preset conference record template, and can perform intelligent voice text recognition, content formatting and editing proofreading on the conference record. Moreover, the meeting record can also be sent to the relevant personnel according to a preset rule. Therefore, it is more time-saving, convenient and user-friendly than the existing method.

本技術領域的普通技術人員應當認識到，以上的實施方式僅是用來說明本發明，而並非用作為對本發明的限定，只要在本發明的實質精神範圍之內，對以上實施例所作的適當改變和變化都落在本發明要求保護的範圍之內。It is to be understood by those skilled in the art that the above embodiments are only intended to illustrate the invention, and are not intended to limit the invention, as long as it is within the spirit of the invention Changes and modifications are intended to fall within the scope of the invention.

100‧‧‧會議記錄裝置100‧‧‧Conference recording device

200‧‧‧雲端裝置200‧‧‧Cloud device

1‧‧‧使用者1‧‧‧Users

310‧‧‧原始會議記錄310‧‧‧ Original meeting minutes

320‧‧‧編輯後的會議記錄320‧‧‧edited meeting minutes

400、500、600、700‧‧‧自動生成會議記錄的方法400, 500, 600, 700‧‧‧ Method of automatically generating meeting minutes

10‧‧‧記憶體10‧‧‧ memory

11‧‧‧錄音模組11‧‧‧Recording module

12‧‧‧轉換模組12‧‧‧Transition module

13‧‧‧辨識模組13‧‧‧ Identification Module

14‧‧‧判斷模組14‧‧‧Judgement module

15‧‧‧校對編輯模組15‧‧‧ proofreading module

16‧‧‧生成模組16‧‧‧Generation Module

17‧‧‧發送模組17‧‧‧Transmission module

18‧‧‧分割模組18‧‧‧Segment Module

19‧‧‧控制模組19‧‧‧Control module

20‧‧‧語音輸入單元20‧‧‧Voice input unit

30‧‧‧觸控式螢幕30‧‧‧Touch screen

40‧‧‧通讯單元40‧‧‧Communication unit

50‧‧‧定位模組50‧‧‧ Positioning Module

60‧‧‧處理器60‧‧‧ processor

S401-S407、S501-S508、S601-S607、S701-S707‧‧‧步驟S401-S407, S501-S508, S601-S607, S701-S707‧‧‧ steps

無no

400‧‧‧自動生成會議記錄的方法 400‧‧‧How to automatically generate meeting minutes

S401-S407‧‧‧步驟 S401-S407‧‧‧Steps

Claims

A method of automatically generating a meeting record, running in at least one device comprising a memory and a processor, the improvement comprising the step of: controlling, by the processor, the module stored in the memory to perform the following steps:
An identification step of: identifying one or more users corresponding to the voice signal according to a voice signal corresponding to the voice received at a conference and a user voice feature table stored in the memory;
a converting step of: converting the voice signal into a text including a user name of the one or more users; and generating a step of: generating an original according to the converted text and the meeting record template stored in the memory Meeting minutes.

The method of claim 1, further comprising an editing step of editing the original meeting record according to a preset proofreading editing rule to obtain a meeting record.

The method of claim 1, wherein the converting step comprises:
Converting the voice signal into text according to the voice signal and a voice text database stored in the memory; and automatically at a preset position of the text corresponding to the voice signal of the one or more users Add the username of the corresponding user.

The method of claim 1, wherein the generating step further comprises:
Automatically adding location information and time information acquired by a positioning module of the device to the generated meeting record;
Identifying a user name included in the text, or identifying a user name of the user who issued the corresponding voice according to the voice signal; and automatically adding the recognized user name to the attendee's field in the conference record template.

The method of claim 2, wherein: the proofreading editing rule is to segment the text from each user name in the text.

The method of any one of claims 2 to 4, further comprising:
Recording step: recording the voice signal into voice data, and storing the recorded voice data in the memory; and sending step: automatically transmitting the edited conference record to the conference related personnel according to a preset sending rule Communication address.

The method of any one of claims 1 to 4, further comprising:
a determining step: determining whether the text includes a correction object; and a proofreading step: when the text includes a correction object, automatically correcting the correction object included in the text to a corresponding common language according to a common language database.

The method of claim 7, further comprising the step of: identifying a stored common word: when identifying and determining that the voice and/or the text includes a repeated occurrence of a predetermined number of words, The words repeating the preset number of times are stored as common words in the common language database.

The method of claim 1 or 2, wherein the preset proofreading rule is to divide a text segment according to the plurality of voice data segments, the method further comprising:
Identifying a silent segment in the voice; and when determining that the time that the silent segment has elapsed is greater than a preset value, segmenting the voice data corresponding to the voice into a plurality of voice data segments according to the silent segment.

A conference recording device comprising a memory and a processor, the improvement comprising: the following module controlled by the processor and stored in the memory:
An identification module: configured to identify one or more users corresponding to the voice signal according to the voice signal corresponding to the voice received in a conference and the user voice feature table stored in the memory;
a conversion module: for converting the voice signal into a text including a user name of the one or more users; and a generating module: the text obtained according to the conversion and the stored in the memory The meeting record template generates an original meeting record.

The conference recording device of claim 10, further comprising a proofreading editing module, configured to edit the original meeting record according to a preset proofreading editing rule to obtain a meeting record.

The conference recording device of claim 10, wherein the conversion module is further configured to:
Converting the voice signal into text according to the voice signal and a voice text database stored in the memory; and automatically at a preset position of the text corresponding to the voice signal of the one or more users Add the username of the corresponding user.

The conference recording device of claim 10, wherein the generating module further comprises:
Automatically adding location information and time information acquired by a positioning module of the device to the generated meeting record;
Identifying a user name included in the text, or identifying a user name of the user who issued the corresponding voice according to the voice signal; and automatically adding the recognized user name to the attendee's field in the conference record template.

The conference recording device of claim 11, wherein: the proofreading editing rule is to segment the text from each user name in the text.

The conference recording device according to any one of claims 11-13, further comprising:
Recording module: for recording the voice signal into voice data, and storing the recorded voice data in the memory; and sending module: for recording the edited meeting record according to a preset sending rule Automatically sent to the communication address of the meeting person.

The conference recording device according to any one of claims 10-13, further comprising a determining module, configured to determine whether the text includes a correction object; and the proofreading editing module is further used to When the text includes a correction object, the correction object included in the text is automatically corrected to the corresponding common language according to a common language database.

The conference recording device of claim 16, wherein the determining module is further configured to: when it is determined that the voice and/or the text includes a repeated occurrence of a predetermined number of words, The phrase that repeats the preset number of times is stored as a common language in the common language database.

The conference recording device of claim 10 or 11, wherein:
The preset proofreading editing rule is to divide the text paragraph according to the plurality of voice data segments;
The conference recording device further includes a segmentation module;
The identification module is further configured to identify a silent segment in the voice;
The determining module is further configured to determine whether the time elapsed by the silent segment is greater than a preset value; and the segmentation module is configured to divide the voice data corresponding to the voice into multiple voices according to the silent segment Fragment of data.