CN109599115A - Minutes method and apparatus for audio collecting device and user terminal - Google Patents

Minutes method and apparatus for audio collecting device and user terminal Download PDF

Info

Publication number
CN109599115A
CN109599115A CN201811585400.8A CN201811585400A CN109599115A CN 109599115 A CN109599115 A CN 109599115A CN 201811585400 A CN201811585400 A CN 201811585400A CN 109599115 A CN109599115 A CN 109599115A
Authority
CN
China
Prior art keywords
audio
data
user terminal
collecting device
text
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201811585400.8A
Other languages
Chinese (zh)
Other versions
CN109599115B (en
Inventor
张蓓蓓
张计锋
赵恒艺
孙岩
周祥
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
AI Speech Ltd
Original Assignee
AI Speech Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by AI Speech Ltd filed Critical AI Speech Ltd
Priority to CN201811585400.8A priority Critical patent/CN109599115B/en
Publication of CN109599115A publication Critical patent/CN109599115A/en
Application granted granted Critical
Publication of CN109599115B publication Critical patent/CN109599115B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/166Editing, e.g. inserting or deleting
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/28Constructional details of speech recognition systems
    • G10L15/30Distributed recognition, e.g. in client-server systems, for mobile phones or network applications
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • H04L67/10Protocols in which an application is distributed across nodes in the network
    • H04L67/1095Replication or mirroring of data, e.g. scheduling or transport for data synchronisation between network nodes
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/14Systems for two-way working
    • H04N7/15Conference systems

Abstract

The present invention discloses the minutes method and apparatus for audio collecting device and user terminal, wherein a kind of minutes method is used for audio collecting device, comprising: audio collecting device and user terminal establish connection;Audio collecting device acquires audio data in real time;The audio data is sent to cloud transcription service and obtains the text data that the cloud transcription service returns, wherein the cloud transcription service turns text-processing for carrying out voice to the audio data;Via multiterminal collaboration services by the text data real-time synchronization to the user terminal.Scheme provided by the embodiments of the present application solves the problems, such as that multiterminal content of edit is synchronous.But product design is more helped to be, multiterminal cooperation provides good support for real-time edition during audio transcription, so that user can change in the usage scenarios such as meeting, interview when listening, is finally reached the purpose of quickly output destination document.

Description

Minutes method and apparatus for audio collecting device and user terminal
Technical field
The invention belongs to voice data technical fields, more particularly, to the meeting note of audio collecting device and user terminal Recording method and device.
Background technique
In the related technology, the minutes scheme that certain schemes provide can support mobile phone terminal audio collection and real-time transcription Text can support the end APP and the end Web both ends after the completion of recording while edit, and can support in the end APP and web terminal editor The bi-directional synchronization of appearance.
Inventor has found that above scheme at least has the following deficiencies: during realizing the application
1, text editing is not supported while audio real-time transcription, do not meet the actual use habit that user remembers when listening.
2, transcription text is only just supported to be synchronized to the end Web after recording is completed to save, in conference scenario inconvenient user and When check.
3, only mobile phone terminal is supported to acquire audio, radio reception effect is bad, causes to record unintelligible, has an effect on transcription result.
Summary of the invention
The embodiment of the present invention provides a kind of minutes method and apparatus for audio collecting device and user terminal, uses In at least one of solution above-mentioned technical problem.
In a first aspect, the embodiment of the present invention provides a kind of minutes method, it is used for audio collecting device, comprising: audio Acquisition equipment and user terminal establish connection;Audio collecting device acquires audio data in real time;The audio data is sent to Cloud transcription service simultaneously obtains the text data that cloud transcription service returns, wherein the cloud transcription service for pair The audio data carries out voice and turns text-processing;Via multiterminal collaboration services by the text data real-time synchronization to the use Family terminal.
Second aspect, the embodiment of the present invention provide a kind of minutes method, are used for user terminal, comprising: user terminal Connection is established with audio collecting device;It receives via the first synchronous text data of multiterminal collaboration services and by first text Data are inserted into the end of history text data;And/or in response to user to the editor of history text data, Xiang Suoshu multiterminal association Make service real-time Transmission change after history text data with by the history text real time data synchronization after the change to other User terminal.
The third aspect, the embodiment of the present invention provide a kind of minutes device for audio collecting device, comprising: first Link block, is configured to audio collecting device and user terminal establishes connection;It is real-time to be configured to audio collecting device for acquisition module Acquire audio data;Transcription module is configured to that the audio data is sent to cloud transcription service and obtains the cloud to turn Write the text data that service returns, wherein the cloud transcription service turns at text for carrying out voice to the audio data Reason;And real-time synchronization module, it is configured to via multiterminal collaboration services that the text data real-time synchronization is whole to the user End.
Fourth aspect, the embodiment of the present invention provide a kind of minutes device for user terminal, comprising: the second connection Module, is configured to user terminal and audio collecting device establishes connection;Insertion module is received, reception is configured to and cooperates via multiterminal First text data of service synchronization and the end that first text data is inserted into history text data;And/or change Synchronization module is configured to the editor in response to user to history text data, the change of Xiang Suoshu multiterminal collaboration services real-time Transmission History text data afterwards are with by the history text real time data synchronization after the change to other users terminal.
5th aspect, provides a kind of electronic equipment comprising: at least one processor, and with described at least one Manage the memory of device communication connection, wherein the memory is stored with the instruction that can be executed by least one described processor, institute It states instruction to be executed by least one described processor, so that at least one described processor is able to carry out any embodiment of the present invention The minutes method for audio collecting device and user terminal the step of.
6th aspect, the embodiment of the present invention also provide a kind of computer program product, and the computer program product includes The computer program being stored on non-volatile computer readable storage medium storing program for executing, the computer program include program instruction, when When described program instruction is computer-executed, make computer execution any embodiment of the present invention is used for audio collecting device The step of with the minutes method of user terminal.
Minutes scheme provided by the embodiments of the present application for audio collecting device and user terminal solves multiterminal The synchronous problem of content of edit.But product design is more helped to be, multiterminal cooperation is real-time during audio transcription Editor provides good support, so that user can change in the usage scenarios such as meeting, interview when listening, is finally reached quickly Export the purpose of destination document.
Detailed description of the invention
In order to illustrate the technical solution of the embodiments of the present invention more clearly, required use in being described below to embodiment Attached drawing be briefly described, it should be apparent that, drawings in the following description are some embodiments of the invention, for ability For the those of ordinary skill of domain, without creative efforts, it can also be obtained according to these attached drawings other attached Figure.
Fig. 1 is a kind of flow chart for minutes method for audio collecting device that one embodiment of the invention provides;
Fig. 2 is a kind of flow chart for minutes method for user terminal that one embodiment of the invention provides;
Fig. 3 is the flow chart for the minutes method that the another kind that one embodiment of the invention provides is used for user terminal;
Fig. 4 is used for the flow chart of the minutes method of user terminal for another that one embodiment of the invention provides;
Fig. 5 is a kind of each end interaction of a specific embodiment of minutes scheme that one embodiment of the invention provides Figure;
Fig. 6 is a kind of block diagram for minutes device for audio collecting device that one embodiment of the invention provides;
Fig. 7 is a kind of block diagram for minutes device for user terminal that one embodiment of the invention provides;
Fig. 8 is the structural schematic diagram for the electronic equipment that one embodiment of the invention provides.
Specific embodiment
In order to make the object, technical scheme and advantages of the embodiment of the invention clearer, below in conjunction with the embodiment of the present invention In attached drawing, technical scheme in the embodiment of the invention is clearly and completely described, it is clear that described embodiment is A part of the embodiment of the present invention, instead of all the embodiments.Based on the embodiments of the present invention, those of ordinary skill in the art Every other embodiment obtained without creative efforts, shall fall within the protection scope of the present invention.
Referring to FIG. 1, it illustrates the minutes methods one for audio collecting device and user terminal of the application The minutes method of the flow chart of embodiment, the present embodiment can be adapted for audio collecting device, such as recording pen, call it is precious, Meeting treasured etc., there is no limit herein by the application.
As shown in Figure 1, in a step 101, audio collecting device and user terminal establish connection;
In a step 102, audio collecting device acquires audio data in real time;
In step 103, audio data is sent to cloud transcription service and obtains the text that transcription service in cloud returns Data, wherein cloud transcription service turns text-processing for carrying out voice to audio data;
At step 104, via multiterminal collaboration services by text data real-time synchronization to user terminal.
In the present embodiment, for step 101, audio collecting device establishes connection with each user terminal first.Later, right In step 102, audio collecting device acquires the audio data of user in real time, later, for step 103, by what is acquired in real time Audio data is sent to cloud transcription service, then obtains the text data that transcription service in cloud returns, the cloud transcription service Turn text-processing for carrying out voice to the audio data of acquisition.Then, for step 104, audio collecting device is via multiterminal Collaboration services are by the text data real-time synchronization of transcription to user terminal.For example, audio collecting device is individual hardware, such as Meeting is precious, after meeting treasured is turned on, establishes connection with the user terminal in range first, such as build with cell phone application end, the end Web Vertical connection can be and establish bluetooth or WiFi connection, later, the meeting based on same account again later by logging in same account The audio data of the precious acquisition user in real time of view, the audio data of acquisition are uploaded to cloud transcription service and carry out voice and turn text being formed Text data, text data are transmitted back to meeting treasured and are transmitted to multiterminal collaboration services by meeting treasured again, later via multiterminal collaboration services Real-time synchronization to the end APP and the end Web, thus complete the real-time transcription of meeting with it is synchronous.
The method of the present embodiment, by acquiring in real time, transcription with it is synchronous, may be implemented the transcription of conference content and editor Be synchronized to each terminal.Transcription provides support with synchronous for the real time inspection of conference content.Further, cloud collaboration services The modification of user in a certain terminal can also be synchronized to other terminals, so that realizing the synchronization of modification between each terminal.
In some alternative embodiments, after audio collection terminal acquires audio data in real time, method further include: warp Audio data is synchronized to large data center by cloud transcription service.To, audio data is synchronized to cloud large data center, Allow user terminal to download original audio data at any time, minutes are confirmed and are corrected.
In some alternative embodiments, after audio collecting device and user terminal establish connection, method further include: The account information of user terminal is obtained via connection to carry out between audio collecting device and user terminal based on account information Data transmission.It is associated with to which audio collecting device is based on the generation of identical account information with user terminal, convenient for the transmission of data With the safety of data.
In other optional embodiments, audio collecting device includes user terminal, and audio collecting device and user are whole It includes that user terminal and other users terminal establish connection that connection is established at end.It is set to which user terminal can also be used as audio collection It is standby, or it can be direct in the case where forgetting to carry professional audio collecting device in the case where of less demanding to recording quality Audio collection is carried out using portable devices such as mobile phone or computers.
In other optional embodiments, above-mentioned connection may include that bluetooth connection is connected with WiFi.So as to select Select the connection between bluetooth or WiFi progress audio collecting device and user terminal.
Referring to FIG. 2, it illustrates a kind of minutes methods for user terminal that one embodiment of the application provides. Which is suitable for intelligent subscriber equipment, such as mobile phone, pad, computer, and there is no limit herein by the application.
As shown in Fig. 2, in step 201, user terminal and audio collecting device establish connection;
In step 202, it receives via the first synchronous text data of multiterminal collaboration services and inserts the first text data Enter to the end of history text data;And/or
In step 203, in response to user to the editor of history text data, the change of Xiang Duoduan collaboration services real-time Transmission History text data afterwards are with the history text real time data synchronization after changing to other users terminal.
In the present embodiment, the connection sides such as bluetooth, WiFi are passed through for step 201, user terminal and audio collecting device Formula establishes connection, wherein the audio collecting device can be other user terminals, and there is no limit herein by the application.Later, right In step 202, user terminal receives the first text data being sent by audio collecting device, synchronizing via multiterminal collaboration services (transcription text data), and first text data is inserted into the end of history text data, if nothing in history text data Any data are then placed directly in the beginning of history text data, and there is no limit herein by the application.
For step 203, if user has carried out editing and updating to history text data, passed in real time to multiterminal collaboration services History text data after defeated change, thus via multiterminal collaboration services by the history text real time data synchronization after change to its His user terminal.
On the one hand the synchronous transcription text data of multiterminal collaboration services can be inserted into history text by the method for this implementation On the one hand the text data of user's edit-modify can be synchronized to other users end via multiterminal collaboration services by the end of data End.There are two focus, a focuses to be always positioned at the end of history text data for setting, should for being inserted into minutes data Focus can be set to invisible;The cursor can be placed on history text for the cursor of editor, user by another focus Any position of data, so that the data to any position are edited.
The synchronizing of content of edit can be content real-time synchronization that will modify to other users terminal, due to editor and Transcription is two different focuses, so will not interact, such as one of content will not occur and cover another content The problem of.Certainly, since computer end editor is more convenient, and mobile phone terminal editor is inconvenient, can also only have when meeting One terminal can use editting function, or only support an equipment editor in some time, when this equipment editor Other equipment are by the state that can not be edited, and there is no limit herein by the application.
With further reference to Fig. 3, it illustrates the meeting notes that the another kind that one embodiment of the application provides is used for user terminal Recording method.The flow chart is mainly the flow chart to the supplementary explanation of the additional technical feature of process Fig. 2.The flow chart is mainly User terminal is used for the process for the step of acquiring audio.
As shown in figure 3, in step 301, acquiring the audio data of user in real time in response to the record command of user;
In step 302, audio data is sent to cloud transcription service and obtains transcription service in cloud returns second Text data, wherein cloud transcription service turns text-processing for carrying out voice to audio data;
In step 303, via multiterminal collaboration services by the second text data real-time synchronization to other users terminal.
In the present embodiment, for step 301, user terminal is in response to the record command of user, such as user is in meeting When making a speech in the process, the audio data of user is acquired in real time.Later, for step 302, the audio data acquired in real time is sent It is serviced to cloud transcription and carries out the second text data that voice turns text-processing and obtains cloud transcription service return, second text Notebook data is also transcription text data.It is via multiterminal collaboration services that second text data is real-time finally, for step 303 It is synchronized to other users terminal, such as is synchronized to the end APP from the end Web, or be synchronized to the end Web from the end APP, the application does not have herein It is restricted.
The method of the present embodiment is recorded by user terminal and the audio data is then turned context synchronization to other users end End, so that the user of any user terminal can make a speech during meeting, the meeting being more in line in real life, user's body It tests more preferable.
With further reference to Fig. 4, another provided it illustrates one embodiment of the application is used for the meeting note of user terminal Recording method.The step of flow chart mainly further limits the additional technical feature after process Fig. 2 step 202.
As shown in figure 4, in step 401, being obtained in response to the audio acquisition instruction of user to large data center transmission audio Take request;
In step 402, the audio data that large data center returns is received.
In the present embodiment, for step 401, user terminal receives the audio data acquisition instruction of user, then to big Data center sends audio acquisition request.Later, for step 402, the audio data that large data center returns is received.
The method of the present embodiment obtains the approach of audio data by providing for user, user can be allowed to remember using meeting Record scheme can also obtain original audio data to the text data of minutes while conveniently record to conference content The confirmation or modification assisted.
In some alternative embodiments, after user terminal and audio collecting device establish connection, method further include: The account information of user terminal is sent to be based on account information in user terminal, other users to audio collecting device via connection Carry out data transmission between terminal and audio collecting device.To which user terminal and audio collecting device can pass through same account It number establishes connection and carries out data transmission, convenient integration to data and the safety for ensureing audio data.
Further alternative, the mode of above-mentioned connection includes that bluetooth connection is connected with WiFi.
It should be noted that above method step be not limited to each step execute sequence, in fact, certain steps The opposite sequence execution or certain steps to limit with step may be performed simultaneously perhaps substantially and without successive Sequentially, there is no limit herein by the application.
Below to some problems encountered in the implementation of the present invention by description inventor and to finally determination One specific embodiment of scheme is illustrated, so that those skilled in the art more fully understand the scheme of the application.
Inventors have found that in order to solve drawbacks described above existing in the prior art, the portioned product of the prior art may lead to Following method is crossed to solve:
Certain existing schemes increase the page of abstract at the end APP.Audio is made up by way of editor's abstract in recording The defect that cannot be edited in transcription, but edit and need to switch the page, it is not easy-to-use enough in actual use.
When the acquisition of other existing scheme sound intermediate frequencies is using hardware supported, complexity increases from original only both ends interaction To multiterminal interaction, abnormal stream process increases in design, more complicated.In addition, real-time edition is burnt because there is editor in recording transcription The collision problem of point and transcription text focus, like product lack deep thinking in the design to this problem.
The mentality of designing of this programme is as follows:
1, solve the problems, such as that mobile phone reception is not clear enough using special audio collection hardware.
2, multiterminal live collaboration service is established, audio collection hardware acquisition audio is simultaneously uploaded to cloud identification service in real time, Transcription text is pushed into the end APP and Web in real time by collaboration services again after identification service returns results, reaches transcription content Real-time synchronization.
3, terminal editor creates 2 focuses in audio transcription, edits focus and transcription text is inserted into focus.Ensure to edit burnt Point does not influence the position of text insertion content.
Referring to FIG. 5, it illustrates a specific embodiment of the minutes multiterminal collaboration mode of the scheme of the application, Although it should be noted that refer to some specific examples in following embodiment, the scheme being not intended to limit this application.
Step 1: audio collection hardware and mobile phone terminal establish binding relationship by bluetooth and connect device network.Mobile phone and After equipment establishes binding relationship, APP account can be synchronized to equipment end, with will pass through account accurately carry out audio collection hardware, Three ends of APP, Web connect.
Step 2: equipment recording is opened.There are many modes for opening equipment recording, since current 3 end is everywhere in connection Middle state, user may be selected to carry out recording unlatching from audio collection hardware, APP, Web any end, and other termination will be real after unlatching Apply synchronization state.
Step 3: equipment end uploads audio to identification service in real time and is identified.
Step 4: identification service, which identifies and returns to recognition result, gives audio collection hardware.
Step 5: audio collection hardware real-time synchronization recognition result and audio to live collaboration service, live collaboration service Content is pushed into the end APP and the end Web.The real time service of transcription for during users conference real time inspection and modification provide base Plinth.
Step 6: user can carry out real-time edition by the end APP and the end Web during transcription, when editor the end APP and The end Web real-time informing content of edit is to live collaboration service.
Step 7: the content of edit after change is pushed into each end by live collaboration service in real time.
It should be noted that, although the scheme recorded is illustrated only using the audio collecting device of profession in Fig. 5, but It is it will be understood by those in the art that the sound pick-up outfit can also directly use the terminals such as mobile phone, pad, computer.Sound pick-up outfit is to use The schematic diagram of family terminal no longer provides herein.
During realizing application scheme, inventor also attempted some other schemes, such as in product design At the beginning of, considered the scheme that timing automatically saves.Timing automatically saves, and is a kind of pseudo- real-time proposals in fact, when multiterminal cooperate still It so has and causes the problem of content of edit is by improper covering due to saving time difference problem, so abandoning.
The solution of multiterminal live collaboration provided by the embodiments of the present application solves the problems, such as that multiterminal content of edit is synchronous. But product design is more helped to be, multiterminal cooperation provides good branch for real-time edition during audio transcription It holds, so that user can change in the usage scenarios such as meeting, interview when listening, is finally reached the mesh of quickly output destination document 's.
Referring to FIG. 6, it illustrates the minutes devices for audio collecting device that one embodiment of the invention provides Block diagram.
As shown in fig. 6, including the first link block 610, acquisition module for audio collecting device minutes device 600 620, transcription module 630 and real-time synchronization module 640.
Wherein, the first link block 610, is configured to audio collecting device and user terminal establishes connection;Acquisition module 620, it is configured to audio collecting device and acquires audio data in real time;Transcription module 630 is configured to for the audio data being sent to Cloud transcription service simultaneously obtains the text data that cloud transcription service returns, wherein the cloud transcription service for pair The audio data carries out voice and turns text-processing;And real-time synchronization module 640, it is configured to institute via multiterminal collaboration services Text data real-time synchronization is stated to the user terminal.
Referring to FIG. 7, the frame of the minutes device for user terminal provided it illustrates one embodiment of the invention Figure.
As shown in fig. 7, being used for the minutes device 700 of user terminal, including the second link block 710, reception insertion Module 720 and/or change synchronization module 730.
Wherein, the second link block 710, is configured to user terminal and audio collecting device establishes connection;Receive insertion mould Block 720 is configured to receive via the first synchronous text data of multiterminal collaboration services and be inserted into first text data The end of history text data;And/or change synchronization module 730, it is configured to the editor in response to user to history text data, To the history text data after multiterminal collaboration services real-time Transmission change with the history text data after the change are real When be synchronized to other users terminal.
It should be appreciated that all modules recorded in Fig. 6 and Fig. 7 are opposite with each step in the method with reference to described in Fig. 1 It answers.The operation above with respect to method description and feature and corresponding technical effect are equally applicable in Fig. 6 and Fig. 7 as a result, All modules, details are not described herein.
It is worth noting that, the module in embodiment of the disclosure is not limited to the scheme of the disclosure, such as acquire Module can be described as the module that audio collecting device acquires audio data in real time.Furthermore it is also possible to by hardware processor come Realize that related function module, such as acquisition module can also realize that details are not described herein with processor.
In further embodiments, the embodiment of the invention also provides a kind of nonvolatile computer storage medias, calculate Machine storage medium is stored with computer executable instructions, which can be performed in above-mentioned any means embodiment The minutes method for audio collecting device and user terminal;
As an implementation, nonvolatile computer storage media of the invention is stored with the executable finger of computer It enables, computer executable instructions setting are as follows:
Audio collecting device and user terminal establish connection;
Audio collecting device acquires audio data in real time;
The audio data is sent to cloud transcription service and obtains the text data that the cloud transcription service returns, Wherein, the cloud transcription service turns text-processing for carrying out voice to the audio data;
Via multiterminal collaboration services by the text data real-time synchronization to the user terminal.
As an implementation, nonvolatile computer storage media of the invention is stored with the executable finger of computer It enables, computer executable instructions setting are as follows:
User terminal and audio collecting device establish connection;
It receives via the first synchronous text data of multiterminal collaboration services and first text data is inserted into history The end of text data;And/or
History in response to user to the editor of history text data, after the change of Xiang Suoshu multiterminal collaboration services real-time Transmission Text data is with by the history text real time data synchronization after the change to other users terminal.
Non-volatile computer readable storage medium storing program for executing may include storing program area and storage data area, wherein storage journey It sequence area can application program required for storage program area, at least one function;Storage data area can be stored according to for audio The minutes device of acquisition equipment and user terminal uses created data etc..In addition, non-volatile computer is readable Storage medium may include high-speed random access memory, can also include nonvolatile memory, for example, at least a disk Memory device, flush memory device or other non-volatile solid state memory parts.In some embodiments, non-volatile computer can It includes the memory remotely located relative to processor that it is optional, which to read storage medium, these remote memories can pass through network connection To the minutes device for being used for audio collecting device and user terminal.The example of above-mentioned network include but is not limited to internet, Intranet, local area network, mobile radio communication and combinations thereof.
The embodiment of the present invention also provides a kind of computer program product, and computer program product is non-volatile including being stored in Computer program on computer readable storage medium, computer program include program instruction, when program instruction is held by computer When row, computer is made to execute the minutes method that any of the above-described is used for audio collecting device and user terminal.
Fig. 8 is the structural schematic diagram of electronic equipment provided in an embodiment of the present invention, as shown in figure 8, the equipment includes: one Or multiple processors 810 and memory 820, in Fig. 8 by taking a processor 810 as an example.For audio collecting device and user The equipment of the minutes method of terminal can also include: input unit 830 and output device 840.Processor 810, memory 820, input unit 830 can be connected with output device 840 by bus or other modes, to be connected by bus in Fig. 8 For.Memory 820 is above-mentioned non-volatile computer readable storage medium storing program for executing.Processor 810 is stored in storage by operation Non-volatile software program, instruction and module in device 820, at the various function application and data of server The minutes method that reason, i.e. realization above method embodiment are used for audio collecting device and user terminal.Input unit 830 can The number or character information of input are received, and generates key related with the user setting of meeting recording device and function control Signal input.Output device 840 may include that display screen etc. shows equipment.
Method provided by the embodiment of the present invention can be performed in the said goods, has the corresponding functional module of execution method and has Beneficial effect.The not technical detail of detailed description in the present embodiment, reference can be made to method provided by the embodiment of the present invention.
As an implementation, above-mentioned electronic apparatus application is used in audio collecting device in minutes device, It include: at least one processor;And the memory being connect at least one processor communication;Wherein, be stored with can for memory The instruction executed by least one processor, instruction executed by least one processor so that at least one processor can:
Audio collecting device and user terminal establish connection;
Audio collecting device acquires audio data in real time;
The audio data is sent to cloud transcription service and obtains the text data that the cloud transcription service returns, Wherein, the cloud transcription service turns text-processing for carrying out voice to the audio data;
Via multiterminal collaboration services by the text data real-time synchronization to the user terminal.
As an implementation, above-mentioned electronic apparatus application is used for user terminal in minutes device, comprising: At least one processor;And the memory being connect at least one processor communication;Wherein, be stored with can be by extremely for memory The instruction that a few processor executes, instruction are executed by least one processor so that at least one processor can:
User terminal and audio collecting device establish connection;
It receives via the first synchronous text data of multiterminal collaboration services and first text data is inserted into history The end of text data;And/or
History in response to user to the editor of history text data, after the change of Xiang Suoshu multiterminal collaboration services real-time Transmission Text data is with by the history text real time data synchronization after the change to other users terminal.
The electronic equipment of the embodiment of the present application exists in a variety of forms, including but not limited to:
(1) mobile communication equipment: the characteristics of this kind of equipment is that have mobile communication function, and to provide speech, data Communication is main target.This Terminal Type includes: smart phone (such as iPhone), multimedia handset, functional mobile phone and low Hold mobile phone etc..
(2) super mobile personal computer equipment: this kind of equipment belongs to the scope of personal computer, there is calculating and processing function Can, generally also have mobile Internet access characteristic.This Terminal Type includes: PDA, MID and UMPC equipment etc., such as iPad.
(3) portable entertainment device: this kind of equipment can show and play multimedia content.Such equipment include: audio, Video player (such as iPod), handheld device, e-book and intelligent toy and portable car-mounted navigation equipment.
(4) server: providing the equipment of the service of calculating, and the composition of server includes that processor, hard disk, memory, system are total Line etc., server is similar with general computer architecture, but due to needing to provide highly reliable service, in processing energy Power, stability, reliability, safety, scalability, manageability etc. are more demanding.
(5) other electronic devices with data interaction function.
The apparatus embodiments described above are merely exemplary, wherein unit can be as illustrated by the separation member Or may not be and be physically separated, component shown as a unit may or may not be physical unit, i.e., It can be located in one place, or may be distributed over multiple network units.It can select according to the actual needs therein Some or all of the modules achieves the purpose of the solution of this embodiment.Those of ordinary skill in the art are not paying creative labor In the case where dynamic, it can understand and implement.
Through the above description of the embodiments, those skilled in the art can be understood that each embodiment can It realizes by means of software and necessary general hardware platform, naturally it is also possible to pass through hardware.Based on this understanding, on Stating technical solution, substantially the part that contributes to existing technology can be embodied in the form of software products in other words, should Computer software product may be stored in a computer readable storage medium, such as ROM/RAM, magnetic disk, CD, including several fingers It enables and using so that a computer equipment (can be personal computer, server or the network equipment etc.) executes each implementation The method of certain parts of example or embodiment.
Finally, it should be noted that the above embodiments are merely illustrative of the technical solutions of the present invention, rather than its limitations;Although Present invention has been described in detail with reference to the aforementioned embodiments, those skilled in the art should understand that: it still may be used To modify the technical solutions described in the foregoing embodiments or equivalent replacement of some of the technical features; And these are modified or replaceed, technical solution of various embodiments of the present invention that it does not separate the essence of the corresponding technical solution spirit and Range.

Claims (10)

1. a kind of minutes method is used for audio collecting device, comprising:
Audio collecting device and user terminal establish connection;
Audio collecting device acquires audio data in real time;
The audio data is sent to cloud transcription service and obtains the text data that the cloud transcription service returns, In, the cloud transcription service turns text-processing for carrying out voice to the audio data;
Via multiterminal collaboration services by the text data real-time synchronization to the user terminal.
2. according to the method described in claim 1, wherein, after the audio collection terminal acquires audio data in real time, institute State method further include:
The audio data is synchronized to large data center via the cloud transcription service.
3. according to the method described in claim 1, wherein, after the audio collecting device and user terminal establish connection, The method also includes:
The account information of the user terminal is obtained via the connection to set based on the account information in the audio collection Carry out data transmission between the standby and described user terminal.
4. method according to any one of claim 1-3, wherein the audio collecting device includes user terminal, institute Stating audio collecting device and establishing connection with user terminal includes that user terminal and other users terminal establish connection.
5. according to the method described in claim 4, wherein, the connection includes that bluetooth connection is connected with WiFi.
6. a kind of minutes method is used for user terminal, comprising:
User terminal and audio collecting device establish connection;
It receives via the first synchronous text data of multiterminal collaboration services and first text data is inserted into history text The end of data;And/or
History text in response to user to the editor of history text data, after the change of Xiang Suoshu multiterminal collaboration services real-time Transmission Data are with by the history text real time data synchronization after the change to other users terminal.
7. according to the method described in claim 6, wherein, the method also includes:
Acquire the audio data of user in real time in response to the record command of user;
The audio data is sent to cloud transcription service and obtains the second text data that the cloud transcription service returns, Wherein, the cloud transcription service turns text-processing for carrying out voice to the audio data;
Via multiterminal collaboration services by the second text data real-time synchronization to other users terminal.
8. according to the method described in claim 6, wherein, receiving first textual data synchronous via multiterminal collaboration services described It is inserted into after the end of history text data according to and by first text data, the method also includes:
Audio acquisition request is sent to large data center in response to the audio acquisition instruction of user;
Receive the audio data that large data center returns.
9. method a method according to any one of claims 6-8, wherein established in the user terminal and audio collecting device After connection, the method also includes:
The account information of the user terminal is sent via described connect to the audio collecting device to believe based on the account Breath carries out data transmission between the user terminal, the other users terminal and the audio collecting device.
10. according to the method described in claim 9, wherein, the connection includes that bluetooth connection is connected with WiFi.
CN201811585400.8A 2018-12-24 2018-12-24 Conference recording method and device for audio acquisition equipment and user terminal Active CN109599115B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201811585400.8A CN109599115B (en) 2018-12-24 2018-12-24 Conference recording method and device for audio acquisition equipment and user terminal

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201811585400.8A CN109599115B (en) 2018-12-24 2018-12-24 Conference recording method and device for audio acquisition equipment and user terminal

Publications (2)

Publication Number Publication Date
CN109599115A true CN109599115A (en) 2019-04-09
CN109599115B CN109599115B (en) 2022-03-22

Family

ID=65964430

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201811585400.8A Active CN109599115B (en) 2018-12-24 2018-12-24 Conference recording method and device for audio acquisition equipment and user terminal

Country Status (1)

Country Link
CN (1) CN109599115B (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110246501A (en) * 2019-07-02 2019-09-17 苏州思必驰信息科技有限公司 Audio recognition method and system for minutes
CN111177353A (en) * 2019-12-27 2020-05-19 拉克诺德(深圳)科技有限公司 Text record generation method and device, computer equipment and storage medium
CN112637147A (en) * 2020-12-13 2021-04-09 青岛希望鸟科技有限公司 Method, terminal and server for establishing and connecting communication service through audio
CN113571061A (en) * 2020-04-28 2021-10-29 阿里巴巴集团控股有限公司 System, method, device and equipment for editing voice transcription text
WO2022135254A1 (en) * 2020-12-22 2022-06-30 华为技术有限公司 Text editing method, electronic device and system

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20140081635A1 (en) * 2008-02-22 2014-03-20 Apple Inc. Providing Text Input Using Speech Data and Non-Speech Data
CN105245355A (en) * 2015-10-14 2016-01-13 安徽声讯信息技术有限公司 Intelligent voice shorthand conference system
CN108074570A (en) * 2017-12-26 2018-05-25 安徽声讯信息技术有限公司 Surface trimming, transmission, the audio recognition method preserved
CN108133710A (en) * 2017-12-26 2018-06-08 安徽声讯信息技术有限公司 Long-range record refreshes and the high in the clouds data processing system of multiport synchronous vacations
CN108597518A (en) * 2018-03-21 2018-09-28 安徽咪鼠科技有限公司 A kind of minutes intelligence microphone system based on speech recognition

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20140081635A1 (en) * 2008-02-22 2014-03-20 Apple Inc. Providing Text Input Using Speech Data and Non-Speech Data
CN105245355A (en) * 2015-10-14 2016-01-13 安徽声讯信息技术有限公司 Intelligent voice shorthand conference system
CN108074570A (en) * 2017-12-26 2018-05-25 安徽声讯信息技术有限公司 Surface trimming, transmission, the audio recognition method preserved
CN108133710A (en) * 2017-12-26 2018-06-08 安徽声讯信息技术有限公司 Long-range record refreshes and the high in the clouds data processing system of multiport synchronous vacations
CN108597518A (en) * 2018-03-21 2018-09-28 安徽咪鼠科技有限公司 A kind of minutes intelligence microphone system based on speech recognition

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110246501A (en) * 2019-07-02 2019-09-17 苏州思必驰信息科技有限公司 Audio recognition method and system for minutes
CN110246501B (en) * 2019-07-02 2022-02-01 思必驰科技股份有限公司 Voice recognition method and system for conference recording
CN111177353A (en) * 2019-12-27 2020-05-19 拉克诺德(深圳)科技有限公司 Text record generation method and device, computer equipment and storage medium
CN113571061A (en) * 2020-04-28 2021-10-29 阿里巴巴集团控股有限公司 System, method, device and equipment for editing voice transcription text
CN112637147A (en) * 2020-12-13 2021-04-09 青岛希望鸟科技有限公司 Method, terminal and server for establishing and connecting communication service through audio
CN112637147B (en) * 2020-12-13 2022-08-05 青岛希望鸟科技有限公司 Method, terminal and server for establishing and connecting communication service through audio
WO2022135254A1 (en) * 2020-12-22 2022-06-30 华为技术有限公司 Text editing method, electronic device and system

Also Published As

Publication number Publication date
CN109599115B (en) 2022-03-22

Similar Documents

Publication Publication Date Title
CN109599115A (en) Minutes method and apparatus for audio collecting device and user terminal
US9449523B2 (en) Systems and methods for narrating electronic books
CN109951743A (en) Barrage information processing method, system and computer equipment
US20120297284A1 (en) Media presentation playback annotation
CN109117235B (en) A kind of business data processing method, device and relevant device
CN111049996A (en) Multi-scene voice recognition method and device and intelligent customer service system applying same
CN104394437B (en) A kind of online live method and system that start broadcasting
CN109361527B (en) Voice conference recording method and system
CN108460120A (en) Data save method, device, terminal device and storage medium
CN105975063B (en) A kind of method and apparatus controlling intelligent terminal
CN103905216A (en) Team-building method, client, server and system
CN108271096A (en) A kind of task executing method, device, intelligent sound box and storage medium
CN110136713A (en) Dialogue method and system of the user in multi-modal interaction
CN104464743B (en) Method for playing background music in voice chat room and mobile terminal
CN102427465A (en) Voice service proxy method and device and system for integrating voice application through proxy
CN102737690B (en) Method and terminal that music application starts
CN108320761B (en) Audio recording method, intelligent recording device and computer readable storage medium
CN109271503A (en) Intelligent answer method, apparatus, equipment and storage medium
JP2020514936A (en) Method and device for quick insertion of voice carrier text
CN110517692A (en) Hot word audio recognition method and device
CN112581965A (en) Transcription method, device, recording pen and storage medium
CN102882565B (en) A kind of data process, sending method and relevant device
CN108228134A (en) A kind of processing method, device, intelligent sound box and the storage medium of task voice
CN104702758B (en) A kind of terminal and its method for managing multimedia notepad
KR101351264B1 (en) System and method for message translation based on voice recognition

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
CB02 Change of applicant information

Address after: 215123 building 14, Tengfei Innovation Park, 388 Xinping street, Suzhou Industrial Park, Suzhou City, Jiangsu Province

Applicant after: Sipic Technology Co.,Ltd.

Address before: 215123 building 14, Tengfei Innovation Park, 388 Xinping street, Suzhou Industrial Park, Suzhou City, Jiangsu Province

Applicant before: AI SPEECH Ltd.

CB02 Change of applicant information
GR01 Patent grant
GR01 Patent grant