CN109492206A - PPT presentation file method for recording, device, computer equipment and storage medium - Google Patents
PPT presentation file method for recording, device, computer equipment and storage medium Download PDFInfo
- Publication number
- CN109492206A CN109492206A CN201811177454.0A CN201811177454A CN109492206A CN 109492206 A CN109492206 A CN 109492206A CN 201811177454 A CN201811177454 A CN 201811177454A CN 109492206 A CN109492206 A CN 109492206A
- Authority
- CN
- China
- Prior art keywords
- document
- ppt
- speech
- presentation file
- text
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims abstract description 40
- 238000004590 computer program Methods 0.000 claims description 20
- 238000012545 processing Methods 0.000 claims description 14
- 238000004422 calculation algorithm Methods 0.000 claims description 7
- 238000005553 drilling Methods 0.000 claims description 4
- 238000012015 optical character recognition Methods 0.000 description 8
- 238000012937 correction Methods 0.000 description 3
- 238000010586 diagram Methods 0.000 description 3
- 230000008901 benefit Effects 0.000 description 2
- 239000000463 material Substances 0.000 description 2
- 238000004364 calculation method Methods 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 230000002708 enhancing effect Effects 0.000 description 1
- 239000000284 extract Substances 0.000 description 1
- 238000003780 insertion Methods 0.000 description 1
- 230000037431 insertion Effects 0.000 description 1
- 239000004973 liquid crystal related substance Substances 0.000 description 1
- 239000000203 mixture Substances 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 230000003068 static effect Effects 0.000 description 1
- 230000001360 synchronised effect Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/10—Text processing
- G06F40/166—Editing, e.g. inserting or deleting
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V20/00—Scenes; Scene-specific elements
- G06V20/60—Type of objects
- G06V20/62—Text, e.g. of license plates, overlay texts or captions on TV images
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/26—Speech to text systems
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Computational Linguistics (AREA)
- General Physics & Mathematics (AREA)
- Health & Medical Sciences (AREA)
- Multimedia (AREA)
- Artificial Intelligence (AREA)
- General Health & Medical Sciences (AREA)
- General Engineering & Computer Science (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Character Discrimination (AREA)
Abstract
This application involves a kind of PPT presentation file method for recording, device, computer equipment and storage mediums.The described method includes: shooting to PPT being customized of presentation file in demonstration, the document picture of PPT presentation file and the speech audio of speaker are obtained;Document picture and speech audio are uploaded to server;Control server carries out Text region to document picture, generates target PPT document;Control server carries out speech recognition to speech audio, and speech audio is converted into speech text, obtains speech document.It is recorded in the form of the speech content of demonstrator can be automatically converted into text by this method, is not necessarily to later period manual sorting, it is very convenient.
Description
Technical field
This application involves field of computer technology, more particularly to a kind of PPT(PowerPoint, PowerPoint) demonstration text
Shelves method for recording, device, computer equipment and storage medium.
Background technique
Currently, PPT document has become the important expression way of various reports and teaching.Speaker is by projecting PPT
The main contents that will give a lecture on to projection screen or display present to audience.Spectators mainly pass through the PPT document showed
Speech purport is understood with the oral content told about of speaker.Wherein, spectators are mainly using sound pick-up outfit come oral to speaker
Content is told about to be acquired;It is taken pictures using camera or video record records PPT document content in speech.However, not
Pipe is recording, takes pictures or record a video, and requires that a more complete document could be formed to its manual sorting afterwards, wherein right
In speaker it is oral tell about content need to manually listen write as extracts at document, this part of housekeeping needs to consume plenty of time and essence
Power.
Summary of the invention
Based on this, it is necessary to which in view of the above technical problems, providing one kind can be by the speech content automatic conversion of demonstrator
PPT presentation file method for recording, device, computer equipment and the storage medium recorded at the form of text.
A kind of PPT presentation file method for recording, which comprises
To PPT being customized of the presentation file shooting in demonstration, the document picture of PPT presentation file and drilling for speaker are obtained
Say audio;
Document picture and speech audio are uploaded to server;
Control server carries out Text region to document picture, generates target PPT document;
Control server carries out speech recognition to speech audio, and speech audio is converted into speech text, obtains speech document.
The step of shooting in one of the embodiments, to the PPT presentation file in demonstration specifically includes: to demonstration
In PPT presentation file, select frame to shoot PPT being customized of document in camera lens using quadrangle;According between the preset time
Every continuously being taken pictures, while recording to speaker.
Before the step of document picture and speech audio are uploaded to server in one of the embodiments, further include:
Duplicate removal processing is carried out to document picture.
The method also includes carrying out figure school to document picture using image processing algorithm in one of the embodiments,
Just.
Control server carries out Text region to document picture in one of the embodiments, generates target PPT document
Step, which includes: control server, carries out Text region to each document picture using OCR recognizer, obtain document text and
Layout information;Target PPT document is generated according to document text and layout information.
The method also includes the timestamps and speech text according to target PPT document in one of the embodiments,
Timeline information, in the corresponding target PPT document of the Characters that will give a lecture.
A kind of PPT presentation file record device, described device include:
Shooting module obtains the document map of PPT presentation file for shooting to PPT being customized of presentation file in demonstration
The speech audio of piece and speaker;
Uploading module, for document picture and speech audio to be uploaded to server;
Text region module carries out Text region to document picture for control server, generates target PPT document;
Speech recognition module carries out speech recognition to speech audio for control server, speech audio is converted into speech text
This, obtains speech document.
A kind of computer equipment, including memory, processor, the memory are stored with computer program, the processing
Device performs the steps of when executing the computer program
PPT presentation file in demonstration is shot, the document picture of PPT presentation file and the speech sound of speaker are obtained
Frequently;
Document picture and speech audio are uploaded to server;
Control server carries out Text region to document picture, generates target PPT document;
Control server carries out speech recognition to speech audio, and speech audio is converted into speech text, obtains speech document.
A kind of computer readable storage medium, is stored thereon with computer program, and the computer program is held by processor
It is performed the steps of when row
PPT presentation file in demonstration is shot, the document picture of PPT presentation file and the speech sound of speaker are obtained
Frequently;
Document picture and speech audio are uploaded to server;
Control server carries out Text region to document picture, generates target PPT document;
Control server carries out speech recognition to speech audio, and speech audio is converted into speech text, obtains speech document.
Above-mentioned PPT presentation file method for recording, device, computer equipment and storage medium, photographic device pass through shooting PPT
Presentation file obtains document picture and speech audio, and carries out Text region to document picture and obtain target PPT document, to speech
Audio carries out speech recognition, obtains speech document, the form that the speech content of demonstrator is automatically converted into text is recorded,
It is very convenient without later period manual sorting.
Detailed description of the invention
Fig. 1 is the applied environment figure of PPT presentation file method for recording in one embodiment;
Fig. 2 is the flow diagram of PPT presentation file method for recording in one embodiment;
Fig. 3 is the shooting figure of PPT presentation file method for recording in one embodiment;
Fig. 4 is the effect picture of PPT presentation file method for recording in one embodiment;
Fig. 5 is the structural block diagram of PPT presentation file record device in one embodiment;
Fig. 6 is the internal structure chart of computer equipment in one embodiment.
Specific embodiment
It is with reference to the accompanying drawings and embodiments, right in order to which the objects, technical solutions and advantages of the application are more clearly understood
The application is further elaborated.It should be appreciated that specific embodiment described herein is only used to explain the application, not
For limiting the application.
PPT presentation file method for recording provided by the present application, can be applied in terminal, and terminal demonstrates the PPT of projection
Document is shot, and the document picture and speech audio that shooting obtains are handled.Wherein, terminal can be, but not limited to be
Various personal computers, laptop, smart phone and tablet computer.PPT presentation file method for recording provided by the present application
It can also be applied in application environment as shown in Figure 1.Photographic device 102 can be communicated by network with server 104,
It can also be communicated by being electrically connected with server 104.When speaker demonstrates the PPT presentation file 106 of projection, camera shooting
Device 102 acquires PPT presentation materials, and presentation materials are uploaded to server and are handled.Wherein, server 104 can be used
The server cluster of independent server either multiple servers composition is realized.
In one embodiment, as shown in Fig. 2, providing a kind of PPT presentation file method for recording, it is applied in this way
It is illustrated for photographic device 102 in Fig. 1, comprising the following steps:
Step 202, the PPT presentation file in demonstration is shot, obtains document picture and the speaker of PPT presentation file
Speech audio.
Before carrying out method of the invention, user photographic device can be placed in the PPT presentation file of projection into
The region that do not blocked when row shooting, and the camera of photographic device is directed at PPT presentation file, adjustment photographic device to take the photograph
As device can take complete clearly PPT presentation file.When speaker carries out the demonstration of PPT presentation file, Yong Huqi
Dynamic photographic device shoots PPT speech document.Photographic device takes pictures to the PPT presentation file of projection, while to drilling
Speaker records, and obtains the document picture of multiple shootings and the speech audio of speaker.Wherein, the document picture of acquisition is shot
It can store with speech audio in the local storage of photographic device.
In one of the embodiments, to the PPT presentation file in demonstration, select frame to PPT text in camera lens using quadrangle
Shelves being customized shooting;It is continuously taken pictures according to preset time interval, while being recorded to speaker.
User can operate the time interval that photographic device presets an automatic camera.For example, preset time interval can
To be 2s.When speaker starts to carry out PPT demonstration, user starts photographic device to the PPT presentation file of projection according to default
Time interval automatically continuously taken pictures, obtain multiple document pictures;It takes pictures in photographic device to PPT presentation file
It records simultaneously to speaker, obtains the speech audio of speaker.At the end of speech, user closes the camera shooting of photographic device
Operation.In the present embodiment, continuous automatically continuously shooting is carried out to PPT presentation file by setting interval, has liberated user
Both hands, so that user shoots without manual multi-pass operation photographic device, it is very convenient.
Before the step of document picture and speech audio are uploaded to server in one of the embodiments, further include:
Duplicate removal processing is carried out to document picture.
When the document picture that photographic device shooting obtains is multiple, photographic device carries out at duplicate removal multiple document pictures
Reason.Specifically, all document pictures are carried out feature degree similarity-rough set by photographic device, delete the document that similarity is greater than threshold value
Picture.Photographic device stabs the document picture for being greater than threshold value to similarity according to shooting time and deletes, big to multiple similarities
In the document picture of threshold value, retains the earliest document picture of shooting time, delete other document pictures.Photographic device can use
Perceptual hash algorithm carries out feature degree similarity-rough set to document picture.Wherein, threshold value can be by user setting, such as can be
95%.In the present embodiment, by the duplicate removal to document picture, so that same PPT presentation file of photographic device shooting obtained
Multiple document pictures can eliminate repetition, so that document picture and PPT presentation file correspond.
PPT presentation file method for recording further includes using image processing algorithm to document map in one of the embodiments,
Piece carries out figure adjustment.
Fig. 3 shows photographic device and customizes the photo that shooting obtains, due to the shooting angle problem of photographic device,
The PPT301 for including in the document picture of shooting may need pair with original PPT presentation file there are certain deformation problems
It, which is corrected, keeps it consistent with the figure of PPT presentation file.In the present embodiment, photographic device is using image processing algorithm to text
Shelves picture carries out figure adjustment, keeps it consistent with the figure of PPT presentation file.Document picture after correction is as shown in figure 4,401
For the PPT for including in the document picture after correction.
In the present embodiment, by carrying out figure adjustment to document picture, so that the document picture of deformation restores normal figure,
It is more convenient subsequent Text region and the normal browsing to formula and illustration etc. in document picture.
Step 204, document picture and speech audio are uploaded to server.
Document picture and speech audio are uploaded to server by photographic device.Specifically, photographic device is by all document maps
Piece is packaged as the document of PPT format, and the document of PPT format and speech audio are uploaded to server.
Step 206, control server carries out Text region to document picture, generates target PPT document.
Photographic device control server carries out Text region to document picture, obtains document text, raw according to document text
At target PPT document.Wherein, the text in target PPT document is independent editable text.
Control server carries out Text region to document picture in one of the embodiments, generates target PPT document
Step includes: that control server utilizes OCR(optical character recognition, optical character identification) identification calculation
Method carries out Text region to each document picture, obtains document text and layout information;According to document text and layout information
Generate target PPT document.
Photographic device control server carries out Text region to each document picture using OCR recognizer.Server
Document picture is scanned, document picture is analyzed and processed, obtains document text and layout information.Wherein, document text
Word includes the attribute information of document text, and attribute information includes format, size, color and the font space bit of document text
It sets.Server generates corresponding target PPT document according to document text and its attribute information.Further, server is to document
Formula and illustration in picture etc. carry out screenshot, by formula and illustration according to the corresponding insertion target in position in document picture
PPT document.In the present embodiment, Text region is carried out to document picture using OCR recognizer, the document text obtained according to identification
Word and layout information regenerate target PPT document, can the secondary text edited in PPT according to the PPT document that the method obtains
Shelves text, it is very convenient.
Step 208, control server carries out speech recognition to speech audio, and speech audio is converted into speech text, is obtained
Must give a lecture document.
Photographic device control server carries out speech recognition to speech audio, and speech audio is converted into speech text, is obtained
Must give a lecture document.Specifically, photographic device control server is filtered speech audio, to filtered speech audio
Speech recognition is carried out, the speech text of identification is stored as speech document.Wherein, give a lecture document format can for txt format,
Doc format, docx format and ppt format etc..
In the present embodiment, photographic device obtains document picture and speech audio by shooting PPT presentation file, and to document
Picture carries out Text region and obtains target PPT document, carries out speech recognition to speech audio, speech document is obtained, by demonstrator
Speech content be automatically converted into the form of text and record, be not necessarily to later period manual sorting, it is very convenient.
PPT presentation file method for recording in one of the embodiments, further include: according to the timestamp of target PPT document
With the timeline information of speech text, in the corresponding target PPT document of the Characters that will give a lecture.
Wherein, the timestamp of target PPT document is generated according to camera time.Camera time can be specifically i.e. constantly
Between, such as can be instant Beijing time.Camera time can also be that relative time, such as photographic device grasp starting camera shooting
The initial time of work is set as 0, according to preset time interval to the document picture entry time of shooting.Such as by the preset time
Interval is recorded as t, and time of first document picture on timestamp is 0, second document picture on timestamp when
Between be then t, third time of the document picture on timestamp is then 2t ... ..., and n-th document picture is on timestamp
Time be then (n-1) t.
In the present embodiment, photographic device is after control server carries out speech recognition to speech audio, according to speech audio
Speech Characters timeline information of the recording time to acquisition.Further, photographic device control server is according to target PPT
The timestamp of document and the timeline information of speech text are matched, by speech text according to corresponding typing target PPT text
In shelves.It specifically, will be in the remarks for words input target PPT document of giving a lecture.
In the present embodiment, by the way that the speech text matches of speaker are recorded such as target PPT document, so that speaker
PPT presentation file content and speech content are combined into one, and are illustrated so that the target PPT document obtained is relatively sharp, easy-to-read
Browsing.
It should be understood that although each step in the flow chart of Fig. 2 is successively shown according to the instruction of arrow, this
A little steps are not that the inevitable sequence according to arrow instruction successively executes.Unless expressly state otherwise herein, these steps
It executes there is no the limitation of stringent sequence, these steps can execute in other order.Moreover, at least part in Fig. 2
Step may include that perhaps these sub-steps of multiple stages or stage are executed in synchronization to multiple sub-steps
It completes, but can execute at different times, the execution sequence in these sub-steps or stage, which is also not necessarily, successively to be carried out,
But it can be executed in turn or alternately at least part of the sub-step or stage of other steps or other steps.
In one embodiment, as shown in figure 5, providing a kind of PPT presentation file record device, comprising: shooting module
510, uploading module 510, Text region module 530 and speech recognition module 540, in which:
Shooting module 510, for being shot to the PPT presentation file in demonstration, obtain PPT presentation file document picture and
The speech audio of speaker;
Uploading module 510, for document picture and speech audio to be uploaded to server;
Text region module 530 carries out Text region to document picture for control server, generates target PPT document;
Speech recognition module 540 carries out speech recognition to speech audio for control server, speech audio is converted into giving a lecture
Text obtains speech document.
In one embodiment, shooting module is specifically used for: to the PPT presentation file in demonstration, selecting frame using quadrangle
PPT being customized of document in camera lens is shot;It is continuously taken pictures according to preset time interval, while speaker is carried out
Recording.
In one embodiment, PPT presentation file record device further includes deduplication module, for document picture and will drill
It says and duplicate removal processing is carried out to document picture before audio is uploaded to server.
In one embodiment, PPT presentation file record device further includes correction module, for utilizing image processing algorithm
Figure adjustment is carried out to document picture.
In one embodiment, Text region module 530 is also used to control server using OCR recognizer to each
Document picture carries out Text region, obtains document text and layout information;Target PPT is generated according to document text and layout information
Document.
In one embodiment, PPT presentation file record device further includes merging module, for according to target PPT document
Timestamp and speech text timeline information, in the corresponding target PPT document of the Characters that will give a lecture.
Specific restriction about PPT presentation file record device may refer to above for PPT presentation file recording side
The restriction of method, details are not described herein.Modules in above-mentioned PPT presentation file record device can be fully or partially through soft
Part, hardware and combinations thereof are realized.Above-mentioned each module can be embedded in the form of hardware or independently of the processing in computer equipment
It in device, can also be stored in a software form in the memory in computer equipment, in order to which processor calls execution above each
The corresponding operation of a module.
In one embodiment, a kind of computer equipment is provided, which can be photographic device, inside
Structure chart can be as shown in Figure 6.The computer equipment include by system bus connect processor, memory, network interface,
Display screen and input unit.Wherein, the processor of the computer equipment is for providing calculating and control ability.The computer equipment
Memory include non-volatile memory medium, built-in storage.The non-volatile memory medium is stored with operating system and calculating
Machine program.The built-in storage provides environment for the operation of operating system and computer program in non-volatile memory medium.It should
The network interface of computer equipment is used to communicate with external photographic device or server by network connection.The computer program
To realize a kind of PPT presentation file method for recording when being executed by processor.The display screen of the computer equipment can be liquid crystal
Display screen or electric ink display screen, the input unit of the computer equipment can be the touch layer covered on display screen, can also
To be the key being arranged on computer equipment shell, trace ball or Trackpad, external keyboard, Trackpad or mouse can also be
Deng.
It will be understood by those skilled in the art that structure shown in Fig. 6, only part relevant to application scheme is tied
The block diagram of structure does not constitute the restriction for the computer equipment being applied thereon to application scheme, specific computer equipment
It may include perhaps combining certain components or with different component layouts than more or fewer components as shown in the figure.
In one embodiment, a kind of computer equipment, including memory and processor are provided, is stored in memory
Computer program, the processor are performed the steps of when executing computer program and are clapped the PPT presentation file in demonstration
It takes the photograph, obtains the document picture of PPT presentation file and the speech audio of speaker;Document picture and speech audio are uploaded to service
Device;Control server carries out Text region to document picture, generates target PPT document;Control server carries out speech audio
Speech audio is converted into speech text, obtains speech document by speech recognition.
In one embodiment, the step of shooting to the PPT presentation file in demonstration specifically includes: in demonstration
PPT presentation file selects frame to shoot PPT being customized of document in camera lens using quadrangle;According to preset time interval into
Row is continuously taken pictures, while being recorded to speaker.
In one embodiment, before the step of document picture and speech audio being uploaded to server further include: to text
Shelves picture carries out duplicate removal processing.
In one embodiment, it is also performed the steps of when processor executes computer program and utilizes image processing algorithm
Figure adjustment is carried out to document picture.
In one embodiment, the step of control server carries out Text region to document picture, generates target PPT document
Including: control server carries out Text region to each document picture using OCR recognizer, obtains document text and the space of a whole page
Information;Target PPT document is generated according to document text and layout information.
In one embodiment, it also performs the steps of when processor executes computer program according to target PPT document
The timeline information of timestamp and speech text, in the corresponding target PPT document of the Characters that will give a lecture.
In one embodiment, a kind of computer readable storage medium is provided, computer program is stored thereon with, is calculated
Machine program performs the steps of when being executed by processor shoots the PPT presentation file in demonstration, obtains PPT demonstration text
The document picture of shelves and the speech audio of speaker;Document picture and speech audio are uploaded to server;Control server pair
Document picture carries out Text region, generates target PPT document;Control server carries out speech recognition to speech audio, will give a lecture
Audio is converted into speech text, obtains speech document.
The step of shooting to the PPT presentation file in demonstration specifically includes: to the PPT presentation file in demonstration, benefit
Frame is selected to shoot PPT being customized of document in camera lens with quadrangle;It is continuously taken pictures according to preset time interval, simultaneously
It records to speaker.
In one embodiment, before the step of document picture and speech audio being uploaded to server further include: to text
Shelves picture carries out duplicate removal processing.
In one embodiment, it also performs the steps of when computer program is executed by processor and is calculated using image procossing
Method carries out figure adjustment to document picture.
In one embodiment, the step of control server carries out Text region to document picture, generates target PPT document
Including: control server carries out Text region to each document picture using OCR recognizer, obtains document text and the space of a whole page
Information;Target PPT document is generated according to document text and layout information.
In one embodiment, it is also performed the steps of when computer program is executed by processor according to target PPT document
Timestamp and speech text timeline information, in the corresponding target PPT document of the Characters that will give a lecture.
Those of ordinary skill in the art will appreciate that realizing all or part of the process in above-described embodiment method, being can be with
Relevant hardware is instructed to complete by computer program, the computer program can be stored in a non-volatile computer
In read/write memory medium, the computer program is when being executed, it may include such as the process of the embodiment of above-mentioned each method.Wherein,
To any reference of memory, storage, database or other media used in each embodiment provided herein,
Including non-volatile and/or volatile memory.Nonvolatile memory may include read-only memory (ROM), programming ROM
(PROM), electrically programmable ROM(EPROM), electrically erasable ROM(EEPROM) or flash memory.Volatile memory may include
Random-access memory (ram) or external cache.By way of illustration and not limitation, RAM is available in many forms,
Such as static state RAM(SRAM), dynamic ram (DRAM), synchronous dram (SDRAM), double data rate sdram (DDRSDRAM), enhancing
Type SDRAM(ESDRAM), synchronization link (Synchlink) DRAM(SLDRAM), memory bus (Rambus) direct RAM
(RDRAM), direct memory bus dynamic ram (DRDRAM) and memory bus dynamic ram (RDRAM) etc..
Each technical characteristic of above embodiments can be combined arbitrarily, for simplicity of description, not to above-described embodiment
In each technical characteristic it is all possible combination be all described, as long as however, the combination of these technical characteristics be not present lance
Shield all should be considered as described in this specification.
The several embodiments of the application above described embodiment only expresses, the description thereof is more specific and detailed, but simultaneously
It cannot therefore be construed as limiting the scope of the patent.It should be pointed out that coming for those of ordinary skill in the art
It says, without departing from the concept of this application, various modifications and improvements can be made, these belong to the protection of the application
Range.Therefore, the scope of protection shall be subject to the appended claims for the application patent.
Claims (10)
1. a kind of PPT presentation file method for recording, which comprises
To PPT being customized of the presentation file shooting in demonstration, the document picture of PPT presentation file and drilling for speaker are obtained
Say audio;
Document picture and speech audio are uploaded to server;
Control server carries out Text region to document picture, generates target PPT document;
Control server carries out speech recognition to speech audio, and speech audio is converted into speech text, obtains speech document.
2. the method according to claim 1, wherein what the PPT presentation file in described pair of demonstration was shot
Step specifically includes: to the PPT presentation file in demonstration, selecting frame to clap PPT being customized of document in camera lens using quadrangle
It takes the photograph;It is continuously taken pictures according to preset time interval, while being recorded to speaker.
3. the method according to claim 1, wherein described be uploaded to server for document picture and speech audio
The step of before further include: to document picture carry out duplicate removal processing.
4. the method according to claim 1, wherein the method also includes utilizing image processing algorithm to document
Picture is corrected.
5. the method according to claim 1, wherein the control server carries out text knowledge to document picture
Not, the step of generation target PPT document includes:
Control server carries out Text region to each document picture using OCR recognizer, obtains document text and the space of a whole page
Information;
Target PPT document is generated according to document text and layout information.
6. the method according to claim 1, wherein the method also includes the times according to target PPT document
The timeline information of stamp and speech text, in the corresponding target PPT document of the Characters that will give a lecture.
7. a kind of PPT presentation file record device, which is characterized in that described device includes:
Shooting module obtains the document map of PPT presentation file for shooting to PPT being customized of presentation file in demonstration
The speech audio of piece and speaker;
Uploading module, for document picture and speech audio to be uploaded to server;
Text region module carries out Text region to document picture for control server, generates target PPT document;
Speech recognition module carries out speech recognition to speech audio for control server, speech audio is converted into speech text
This, obtains speech document.
8. a kind of PPT presentation file record device, which is characterized in that the shooting module is specifically used for: drilling the PPT in demonstration
Show document, selects frame to shoot PPT being customized of document in camera lens using quadrangle;It is carried out according to preset time interval continuous
It takes pictures, while recording to speaker.
9. a kind of computer equipment, including memory and processor, the memory are stored with computer program, feature exists
In the step of processor realizes any one of claims 1 to 5 the method when executing the computer program.
10. a kind of computer readable storage medium, is stored thereon with computer program, which is characterized in that the computer program
The step of method described in any one of claims 1 to 5 is realized when being executed by processor.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201811177454.0A CN109492206A (en) | 2018-10-10 | 2018-10-10 | PPT presentation file method for recording, device, computer equipment and storage medium |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201811177454.0A CN109492206A (en) | 2018-10-10 | 2018-10-10 | PPT presentation file method for recording, device, computer equipment and storage medium |
Publications (1)
Publication Number | Publication Date |
---|---|
CN109492206A true CN109492206A (en) | 2019-03-19 |
Family
ID=65690218
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201811177454.0A Pending CN109492206A (en) | 2018-10-10 | 2018-10-10 | PPT presentation file method for recording, device, computer equipment and storage medium |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN109492206A (en) |
Cited By (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110060033A (en) * | 2019-04-25 | 2019-07-26 | 大连海事大学 | Lecture full information acquires and is intelligently embedded in the on-demand distribution management system of audio |
CN110070084A (en) * | 2019-04-25 | 2019-07-30 | 大连海事大学 | Lecture PPT intellectual analysis, storage and on-demand dissemination system |
CN110347848A (en) * | 2019-07-11 | 2019-10-18 | 深圳云智教育科技有限公司 | A kind of PowerPoint management method and device |
CN110493640A (en) * | 2019-08-01 | 2019-11-22 | 东莞理工学院 | A kind of system and method that the Video Quality Metric based on video processing is PPT |
CN110909737A (en) * | 2019-11-14 | 2020-03-24 | 武汉虹旭信息技术有限责任公司 | Picture character recognition method and system |
CN111275048A (en) * | 2020-01-15 | 2020-06-12 | 济南浪潮高新科技投资发展有限公司 | PPT reproduction method based on OCR character recognition technology |
CN111832455A (en) * | 2020-06-30 | 2020-10-27 | 北京小米松果电子有限公司 | Method, device, storage medium and electronic equipment for acquiring content image |
CN112581965A (en) * | 2020-12-11 | 2021-03-30 | 天津讯飞极智科技有限公司 | Transcription method, device, recording pen and storage medium |
CN112784085A (en) * | 2021-01-19 | 2021-05-11 | 杭州睿胜软件有限公司 | Method for generating file by using shared picture, server side and readable storage medium |
CN112784106A (en) * | 2019-11-04 | 2021-05-11 | 阿里巴巴集团控股有限公司 | Content data processing method, report data processing method, computer device, and storage medium |
CN114077823A (en) * | 2020-08-05 | 2022-02-22 | Oppo广东移动通信有限公司 | Image processing method, terminal and storage medium |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20140282030A1 (en) * | 2013-03-14 | 2014-09-18 | Prateek Bhatnagar | Method and system for outputting information |
CN105120195A (en) * | 2015-09-18 | 2015-12-02 | 谷鸿林 | Content recording and reproducing system and method |
CN107920280A (en) * | 2017-03-23 | 2018-04-17 | 广州思涵信息科技有限公司 | The accurate matched method and system of video, teaching materials PPT and voice content |
-
2018
- 2018-10-10 CN CN201811177454.0A patent/CN109492206A/en active Pending
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20140282030A1 (en) * | 2013-03-14 | 2014-09-18 | Prateek Bhatnagar | Method and system for outputting information |
CN105120195A (en) * | 2015-09-18 | 2015-12-02 | 谷鸿林 | Content recording and reproducing system and method |
CN107920280A (en) * | 2017-03-23 | 2018-04-17 | 广州思涵信息科技有限公司 | The accurate matched method and system of video, teaching materials PPT and voice content |
Non-Patent Citations (2)
Title |
---|
赵怡 等: "《融媒时代艺术类院校教学模式创新》", 31 October 2016 * |
高凡: "《图书馆变革与发展 四川省文化厅图书情报学与文献学规划项目论文集》", 30 September 2013, 西南交通大学出版社 * |
Cited By (13)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110060033A (en) * | 2019-04-25 | 2019-07-26 | 大连海事大学 | Lecture full information acquires and is intelligently embedded in the on-demand distribution management system of audio |
CN110070084A (en) * | 2019-04-25 | 2019-07-30 | 大连海事大学 | Lecture PPT intellectual analysis, storage and on-demand dissemination system |
CN110347848A (en) * | 2019-07-11 | 2019-10-18 | 深圳云智教育科技有限公司 | A kind of PowerPoint management method and device |
CN110493640A (en) * | 2019-08-01 | 2019-11-22 | 东莞理工学院 | A kind of system and method that the Video Quality Metric based on video processing is PPT |
CN112784106A (en) * | 2019-11-04 | 2021-05-11 | 阿里巴巴集团控股有限公司 | Content data processing method, report data processing method, computer device, and storage medium |
CN112784106B (en) * | 2019-11-04 | 2024-05-14 | 阿里巴巴集团控股有限公司 | Content data processing method, report data processing method, computer device, and storage medium |
CN110909737A (en) * | 2019-11-14 | 2020-03-24 | 武汉虹旭信息技术有限责任公司 | Picture character recognition method and system |
CN111275048A (en) * | 2020-01-15 | 2020-06-12 | 济南浪潮高新科技投资发展有限公司 | PPT reproduction method based on OCR character recognition technology |
CN111275048B (en) * | 2020-01-15 | 2023-04-18 | 山东浪潮科学研究院有限公司 | PPT reproduction method based on OCR character recognition technology |
CN111832455A (en) * | 2020-06-30 | 2020-10-27 | 北京小米松果电子有限公司 | Method, device, storage medium and electronic equipment for acquiring content image |
CN114077823A (en) * | 2020-08-05 | 2022-02-22 | Oppo广东移动通信有限公司 | Image processing method, terminal and storage medium |
CN112581965A (en) * | 2020-12-11 | 2021-03-30 | 天津讯飞极智科技有限公司 | Transcription method, device, recording pen and storage medium |
CN112784085A (en) * | 2021-01-19 | 2021-05-11 | 杭州睿胜软件有限公司 | Method for generating file by using shared picture, server side and readable storage medium |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN109492206A (en) | PPT presentation file method for recording, device, computer equipment and storage medium | |
US9799375B2 (en) | Method and device for adjusting playback progress of video file | |
US7779355B1 (en) | Techniques for using paper documents as media templates | |
CN111523293A (en) | Method and device for assisting user in information input in live broadcast teaching | |
CN110781328A (en) | Video generation method, system, device and storage medium based on voice recognition | |
JP2007004784A (en) | Method, system, and device for digital information processing | |
US20220028424A1 (en) | Systems for optimized presentation capture | |
DE112018006727B4 (en) | ELECTRONIC DEVICE FOR COMBINING MUSIC WITH PHOTOGRAPHY AND CONTROL METHODS THEREFOR | |
US20140156651A1 (en) | Automatic summarizing of media content | |
US10339204B2 (en) | Converting electronic documents having visible objects | |
US20140364982A1 (en) | Methods and systems for media file management | |
CN111881904A (en) | Blackboard writing recording method and system | |
US11355155B1 (en) | System and method to summarize one or more videos based on user priorities | |
US20150111189A1 (en) | System and method for browsing multimedia file | |
CN110992960A (en) | Control method, control device, electronic equipment and storage medium | |
CN104243886A (en) | High-speed image analyzing and video generating technology based on plug-in technology | |
US20160335500A1 (en) | Method of and system for generating metadata | |
CN117171369A (en) | Content generation method, device, computer equipment and storage medium | |
US10902047B2 (en) | Information processing method for displaying a plurality of images extracted from a moving image | |
CN104092553A (en) | Data processing method and device and conference system | |
CN114341866A (en) | Simultaneous interpretation method, device, server and storage medium | |
CN110110103A (en) | Acquisition methods, device, computer equipment and the storage medium of media resource | |
EP4099711A1 (en) | Method and apparatus and storage medium for processing video and timing of subtitles | |
KR101783872B1 (en) | Video Search System and Method thereof | |
CN113709521B (en) | System for automatically matching background according to video content |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20190319 |