CN109492206A - PPT presentation file method for recording, device, computer equipment and storage medium - Google Patents

PPT presentation file method for recording, device, computer equipment and storage medium Download PDF

Info

Publication number
CN109492206A
CN109492206A CN201811177454.0A CN201811177454A CN109492206A CN 109492206 A CN109492206 A CN 109492206A CN 201811177454 A CN201811177454 A CN 201811177454A CN 109492206 A CN109492206 A CN 109492206A
Authority
CN
China
Prior art keywords
document
ppt
speech
presentation file
text
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201811177454.0A
Other languages
Chinese (zh)
Inventor
管明雷
汪驰升
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shenzhen Ronghui Technology Co ltd
Original Assignee
Shenzhen Ronghui Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shenzhen Ronghui Technology Co ltd filed Critical Shenzhen Ronghui Technology Co ltd
Priority to CN201811177454.0A priority Critical patent/CN109492206A/en
Publication of CN109492206A publication Critical patent/CN109492206A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/166Editing, e.g. inserting or deleting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/60Type of objects
    • G06V20/62Text, e.g. of license plates, overlay texts or captions on TV images
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • General Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • Multimedia (AREA)
  • Artificial Intelligence (AREA)
  • General Health & Medical Sciences (AREA)
  • General Engineering & Computer Science (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Character Discrimination (AREA)

Abstract

This application involves a kind of PPT presentation file method for recording, device, computer equipment and storage mediums.The described method includes: shooting to PPT being customized of presentation file in demonstration, the document picture of PPT presentation file and the speech audio of speaker are obtained;Document picture and speech audio are uploaded to server;Control server carries out Text region to document picture, generates target PPT document;Control server carries out speech recognition to speech audio, and speech audio is converted into speech text, obtains speech document.It is recorded in the form of the speech content of demonstrator can be automatically converted into text by this method, is not necessarily to later period manual sorting, it is very convenient.

Description

PPT presentation file method for recording, device, computer equipment and storage medium
Technical field
This application involves field of computer technology, more particularly to a kind of PPT(PowerPoint, PowerPoint) demonstration text Shelves method for recording, device, computer equipment and storage medium.
Background technique
Currently, PPT document has become the important expression way of various reports and teaching.Speaker is by projecting PPT The main contents that will give a lecture on to projection screen or display present to audience.Spectators mainly pass through the PPT document showed Speech purport is understood with the oral content told about of speaker.Wherein, spectators are mainly using sound pick-up outfit come oral to speaker Content is told about to be acquired;It is taken pictures using camera or video record records PPT document content in speech.However, not Pipe is recording, takes pictures or record a video, and requires that a more complete document could be formed to its manual sorting afterwards, wherein right In speaker it is oral tell about content need to manually listen write as extracts at document, this part of housekeeping needs to consume plenty of time and essence Power.
Summary of the invention
Based on this, it is necessary to which in view of the above technical problems, providing one kind can be by the speech content automatic conversion of demonstrator PPT presentation file method for recording, device, computer equipment and the storage medium recorded at the form of text.
A kind of PPT presentation file method for recording, which comprises
To PPT being customized of the presentation file shooting in demonstration, the document picture of PPT presentation file and drilling for speaker are obtained Say audio;
Document picture and speech audio are uploaded to server;
Control server carries out Text region to document picture, generates target PPT document;
Control server carries out speech recognition to speech audio, and speech audio is converted into speech text, obtains speech document.
The step of shooting in one of the embodiments, to the PPT presentation file in demonstration specifically includes: to demonstration In PPT presentation file, select frame to shoot PPT being customized of document in camera lens using quadrangle;According between the preset time Every continuously being taken pictures, while recording to speaker.
Before the step of document picture and speech audio are uploaded to server in one of the embodiments, further include: Duplicate removal processing is carried out to document picture.
The method also includes carrying out figure school to document picture using image processing algorithm in one of the embodiments, Just.
Control server carries out Text region to document picture in one of the embodiments, generates target PPT document Step, which includes: control server, carries out Text region to each document picture using OCR recognizer, obtain document text and Layout information;Target PPT document is generated according to document text and layout information.
The method also includes the timestamps and speech text according to target PPT document in one of the embodiments, Timeline information, in the corresponding target PPT document of the Characters that will give a lecture.
A kind of PPT presentation file record device, described device include:
Shooting module obtains the document map of PPT presentation file for shooting to PPT being customized of presentation file in demonstration The speech audio of piece and speaker;
Uploading module, for document picture and speech audio to be uploaded to server;
Text region module carries out Text region to document picture for control server, generates target PPT document;
Speech recognition module carries out speech recognition to speech audio for control server, speech audio is converted into speech text This, obtains speech document.
A kind of computer equipment, including memory, processor, the memory are stored with computer program, the processing Device performs the steps of when executing the computer program
PPT presentation file in demonstration is shot, the document picture of PPT presentation file and the speech sound of speaker are obtained Frequently;
Document picture and speech audio are uploaded to server;
Control server carries out Text region to document picture, generates target PPT document;
Control server carries out speech recognition to speech audio, and speech audio is converted into speech text, obtains speech document.
A kind of computer readable storage medium, is stored thereon with computer program, and the computer program is held by processor It is performed the steps of when row
PPT presentation file in demonstration is shot, the document picture of PPT presentation file and the speech sound of speaker are obtained Frequently;
Document picture and speech audio are uploaded to server;
Control server carries out Text region to document picture, generates target PPT document;
Control server carries out speech recognition to speech audio, and speech audio is converted into speech text, obtains speech document.
Above-mentioned PPT presentation file method for recording, device, computer equipment and storage medium, photographic device pass through shooting PPT Presentation file obtains document picture and speech audio, and carries out Text region to document picture and obtain target PPT document, to speech Audio carries out speech recognition, obtains speech document, the form that the speech content of demonstrator is automatically converted into text is recorded, It is very convenient without later period manual sorting.
Detailed description of the invention
Fig. 1 is the applied environment figure of PPT presentation file method for recording in one embodiment;
Fig. 2 is the flow diagram of PPT presentation file method for recording in one embodiment;
Fig. 3 is the shooting figure of PPT presentation file method for recording in one embodiment;
Fig. 4 is the effect picture of PPT presentation file method for recording in one embodiment;
Fig. 5 is the structural block diagram of PPT presentation file record device in one embodiment;
Fig. 6 is the internal structure chart of computer equipment in one embodiment.
Specific embodiment
It is with reference to the accompanying drawings and embodiments, right in order to which the objects, technical solutions and advantages of the application are more clearly understood The application is further elaborated.It should be appreciated that specific embodiment described herein is only used to explain the application, not For limiting the application.
PPT presentation file method for recording provided by the present application, can be applied in terminal, and terminal demonstrates the PPT of projection Document is shot, and the document picture and speech audio that shooting obtains are handled.Wherein, terminal can be, but not limited to be Various personal computers, laptop, smart phone and tablet computer.PPT presentation file method for recording provided by the present application It can also be applied in application environment as shown in Figure 1.Photographic device 102 can be communicated by network with server 104, It can also be communicated by being electrically connected with server 104.When speaker demonstrates the PPT presentation file 106 of projection, camera shooting Device 102 acquires PPT presentation materials, and presentation materials are uploaded to server and are handled.Wherein, server 104 can be used The server cluster of independent server either multiple servers composition is realized.
In one embodiment, as shown in Fig. 2, providing a kind of PPT presentation file method for recording, it is applied in this way It is illustrated for photographic device 102 in Fig. 1, comprising the following steps:
Step 202, the PPT presentation file in demonstration is shot, obtains document picture and the speaker of PPT presentation file Speech audio.
Before carrying out method of the invention, user photographic device can be placed in the PPT presentation file of projection into The region that do not blocked when row shooting, and the camera of photographic device is directed at PPT presentation file, adjustment photographic device to take the photograph As device can take complete clearly PPT presentation file.When speaker carries out the demonstration of PPT presentation file, Yong Huqi Dynamic photographic device shoots PPT speech document.Photographic device takes pictures to the PPT presentation file of projection, while to drilling Speaker records, and obtains the document picture of multiple shootings and the speech audio of speaker.Wherein, the document picture of acquisition is shot It can store with speech audio in the local storage of photographic device.
In one of the embodiments, to the PPT presentation file in demonstration, select frame to PPT text in camera lens using quadrangle Shelves being customized shooting;It is continuously taken pictures according to preset time interval, while being recorded to speaker.
User can operate the time interval that photographic device presets an automatic camera.For example, preset time interval can To be 2s.When speaker starts to carry out PPT demonstration, user starts photographic device to the PPT presentation file of projection according to default Time interval automatically continuously taken pictures, obtain multiple document pictures;It takes pictures in photographic device to PPT presentation file It records simultaneously to speaker, obtains the speech audio of speaker.At the end of speech, user closes the camera shooting of photographic device Operation.In the present embodiment, continuous automatically continuously shooting is carried out to PPT presentation file by setting interval, has liberated user Both hands, so that user shoots without manual multi-pass operation photographic device, it is very convenient.
Before the step of document picture and speech audio are uploaded to server in one of the embodiments, further include: Duplicate removal processing is carried out to document picture.
When the document picture that photographic device shooting obtains is multiple, photographic device carries out at duplicate removal multiple document pictures Reason.Specifically, all document pictures are carried out feature degree similarity-rough set by photographic device, delete the document that similarity is greater than threshold value Picture.Photographic device stabs the document picture for being greater than threshold value to similarity according to shooting time and deletes, big to multiple similarities In the document picture of threshold value, retains the earliest document picture of shooting time, delete other document pictures.Photographic device can use Perceptual hash algorithm carries out feature degree similarity-rough set to document picture.Wherein, threshold value can be by user setting, such as can be 95%.In the present embodiment, by the duplicate removal to document picture, so that same PPT presentation file of photographic device shooting obtained Multiple document pictures can eliminate repetition, so that document picture and PPT presentation file correspond.
PPT presentation file method for recording further includes using image processing algorithm to document map in one of the embodiments, Piece carries out figure adjustment.
Fig. 3 shows photographic device and customizes the photo that shooting obtains, due to the shooting angle problem of photographic device, The PPT301 for including in the document picture of shooting may need pair with original PPT presentation file there are certain deformation problems It, which is corrected, keeps it consistent with the figure of PPT presentation file.In the present embodiment, photographic device is using image processing algorithm to text Shelves picture carries out figure adjustment, keeps it consistent with the figure of PPT presentation file.Document picture after correction is as shown in figure 4,401 For the PPT for including in the document picture after correction.
In the present embodiment, by carrying out figure adjustment to document picture, so that the document picture of deformation restores normal figure, It is more convenient subsequent Text region and the normal browsing to formula and illustration etc. in document picture.
Step 204, document picture and speech audio are uploaded to server.
Document picture and speech audio are uploaded to server by photographic device.Specifically, photographic device is by all document maps Piece is packaged as the document of PPT format, and the document of PPT format and speech audio are uploaded to server.
Step 206, control server carries out Text region to document picture, generates target PPT document.
Photographic device control server carries out Text region to document picture, obtains document text, raw according to document text At target PPT document.Wherein, the text in target PPT document is independent editable text.
Control server carries out Text region to document picture in one of the embodiments, generates target PPT document Step includes: that control server utilizes OCR(optical character recognition, optical character identification) identification calculation Method carries out Text region to each document picture, obtains document text and layout information;According to document text and layout information Generate target PPT document.
Photographic device control server carries out Text region to each document picture using OCR recognizer.Server Document picture is scanned, document picture is analyzed and processed, obtains document text and layout information.Wherein, document text Word includes the attribute information of document text, and attribute information includes format, size, color and the font space bit of document text It sets.Server generates corresponding target PPT document according to document text and its attribute information.Further, server is to document Formula and illustration in picture etc. carry out screenshot, by formula and illustration according to the corresponding insertion target in position in document picture PPT document.In the present embodiment, Text region is carried out to document picture using OCR recognizer, the document text obtained according to identification Word and layout information regenerate target PPT document, can the secondary text edited in PPT according to the PPT document that the method obtains Shelves text, it is very convenient.
Step 208, control server carries out speech recognition to speech audio, and speech audio is converted into speech text, is obtained Must give a lecture document.
Photographic device control server carries out speech recognition to speech audio, and speech audio is converted into speech text, is obtained Must give a lecture document.Specifically, photographic device control server is filtered speech audio, to filtered speech audio Speech recognition is carried out, the speech text of identification is stored as speech document.Wherein, give a lecture document format can for txt format, Doc format, docx format and ppt format etc..
In the present embodiment, photographic device obtains document picture and speech audio by shooting PPT presentation file, and to document Picture carries out Text region and obtains target PPT document, carries out speech recognition to speech audio, speech document is obtained, by demonstrator Speech content be automatically converted into the form of text and record, be not necessarily to later period manual sorting, it is very convenient.
PPT presentation file method for recording in one of the embodiments, further include: according to the timestamp of target PPT document With the timeline information of speech text, in the corresponding target PPT document of the Characters that will give a lecture.
Wherein, the timestamp of target PPT document is generated according to camera time.Camera time can be specifically i.e. constantly Between, such as can be instant Beijing time.Camera time can also be that relative time, such as photographic device grasp starting camera shooting The initial time of work is set as 0, according to preset time interval to the document picture entry time of shooting.Such as by the preset time Interval is recorded as t, and time of first document picture on timestamp is 0, second document picture on timestamp when Between be then t, third time of the document picture on timestamp is then 2t ... ..., and n-th document picture is on timestamp Time be then (n-1) t.
In the present embodiment, photographic device is after control server carries out speech recognition to speech audio, according to speech audio Speech Characters timeline information of the recording time to acquisition.Further, photographic device control server is according to target PPT The timestamp of document and the timeline information of speech text are matched, by speech text according to corresponding typing target PPT text In shelves.It specifically, will be in the remarks for words input target PPT document of giving a lecture.
In the present embodiment, by the way that the speech text matches of speaker are recorded such as target PPT document, so that speaker PPT presentation file content and speech content are combined into one, and are illustrated so that the target PPT document obtained is relatively sharp, easy-to-read Browsing.
It should be understood that although each step in the flow chart of Fig. 2 is successively shown according to the instruction of arrow, this A little steps are not that the inevitable sequence according to arrow instruction successively executes.Unless expressly state otherwise herein, these steps It executes there is no the limitation of stringent sequence, these steps can execute in other order.Moreover, at least part in Fig. 2 Step may include that perhaps these sub-steps of multiple stages or stage are executed in synchronization to multiple sub-steps It completes, but can execute at different times, the execution sequence in these sub-steps or stage, which is also not necessarily, successively to be carried out, But it can be executed in turn or alternately at least part of the sub-step or stage of other steps or other steps.
In one embodiment, as shown in figure 5, providing a kind of PPT presentation file record device, comprising: shooting module 510, uploading module 510, Text region module 530 and speech recognition module 540, in which:
Shooting module 510, for being shot to the PPT presentation file in demonstration, obtain PPT presentation file document picture and The speech audio of speaker;
Uploading module 510, for document picture and speech audio to be uploaded to server;
Text region module 530 carries out Text region to document picture for control server, generates target PPT document;
Speech recognition module 540 carries out speech recognition to speech audio for control server, speech audio is converted into giving a lecture Text obtains speech document.
In one embodiment, shooting module is specifically used for: to the PPT presentation file in demonstration, selecting frame using quadrangle PPT being customized of document in camera lens is shot;It is continuously taken pictures according to preset time interval, while speaker is carried out Recording.
In one embodiment, PPT presentation file record device further includes deduplication module, for document picture and will drill It says and duplicate removal processing is carried out to document picture before audio is uploaded to server.
In one embodiment, PPT presentation file record device further includes correction module, for utilizing image processing algorithm Figure adjustment is carried out to document picture.
In one embodiment, Text region module 530 is also used to control server using OCR recognizer to each Document picture carries out Text region, obtains document text and layout information;Target PPT is generated according to document text and layout information Document.
In one embodiment, PPT presentation file record device further includes merging module, for according to target PPT document Timestamp and speech text timeline information, in the corresponding target PPT document of the Characters that will give a lecture.
Specific restriction about PPT presentation file record device may refer to above for PPT presentation file recording side The restriction of method, details are not described herein.Modules in above-mentioned PPT presentation file record device can be fully or partially through soft Part, hardware and combinations thereof are realized.Above-mentioned each module can be embedded in the form of hardware or independently of the processing in computer equipment It in device, can also be stored in a software form in the memory in computer equipment, in order to which processor calls execution above each The corresponding operation of a module.
In one embodiment, a kind of computer equipment is provided, which can be photographic device, inside Structure chart can be as shown in Figure 6.The computer equipment include by system bus connect processor, memory, network interface, Display screen and input unit.Wherein, the processor of the computer equipment is for providing calculating and control ability.The computer equipment Memory include non-volatile memory medium, built-in storage.The non-volatile memory medium is stored with operating system and calculating Machine program.The built-in storage provides environment for the operation of operating system and computer program in non-volatile memory medium.It should The network interface of computer equipment is used to communicate with external photographic device or server by network connection.The computer program To realize a kind of PPT presentation file method for recording when being executed by processor.The display screen of the computer equipment can be liquid crystal Display screen or electric ink display screen, the input unit of the computer equipment can be the touch layer covered on display screen, can also To be the key being arranged on computer equipment shell, trace ball or Trackpad, external keyboard, Trackpad or mouse can also be Deng.
It will be understood by those skilled in the art that structure shown in Fig. 6, only part relevant to application scheme is tied The block diagram of structure does not constitute the restriction for the computer equipment being applied thereon to application scheme, specific computer equipment It may include perhaps combining certain components or with different component layouts than more or fewer components as shown in the figure.
In one embodiment, a kind of computer equipment, including memory and processor are provided, is stored in memory Computer program, the processor are performed the steps of when executing computer program and are clapped the PPT presentation file in demonstration It takes the photograph, obtains the document picture of PPT presentation file and the speech audio of speaker;Document picture and speech audio are uploaded to service Device;Control server carries out Text region to document picture, generates target PPT document;Control server carries out speech audio Speech audio is converted into speech text, obtains speech document by speech recognition.
In one embodiment, the step of shooting to the PPT presentation file in demonstration specifically includes: in demonstration PPT presentation file selects frame to shoot PPT being customized of document in camera lens using quadrangle;According to preset time interval into Row is continuously taken pictures, while being recorded to speaker.
In one embodiment, before the step of document picture and speech audio being uploaded to server further include: to text Shelves picture carries out duplicate removal processing.
In one embodiment, it is also performed the steps of when processor executes computer program and utilizes image processing algorithm Figure adjustment is carried out to document picture.
In one embodiment, the step of control server carries out Text region to document picture, generates target PPT document Including: control server carries out Text region to each document picture using OCR recognizer, obtains document text and the space of a whole page Information;Target PPT document is generated according to document text and layout information.
In one embodiment, it also performs the steps of when processor executes computer program according to target PPT document The timeline information of timestamp and speech text, in the corresponding target PPT document of the Characters that will give a lecture.
In one embodiment, a kind of computer readable storage medium is provided, computer program is stored thereon with, is calculated Machine program performs the steps of when being executed by processor shoots the PPT presentation file in demonstration, obtains PPT demonstration text The document picture of shelves and the speech audio of speaker;Document picture and speech audio are uploaded to server;Control server pair Document picture carries out Text region, generates target PPT document;Control server carries out speech recognition to speech audio, will give a lecture Audio is converted into speech text, obtains speech document.
The step of shooting to the PPT presentation file in demonstration specifically includes: to the PPT presentation file in demonstration, benefit Frame is selected to shoot PPT being customized of document in camera lens with quadrangle;It is continuously taken pictures according to preset time interval, simultaneously It records to speaker.
In one embodiment, before the step of document picture and speech audio being uploaded to server further include: to text Shelves picture carries out duplicate removal processing.
In one embodiment, it also performs the steps of when computer program is executed by processor and is calculated using image procossing Method carries out figure adjustment to document picture.
In one embodiment, the step of control server carries out Text region to document picture, generates target PPT document Including: control server carries out Text region to each document picture using OCR recognizer, obtains document text and the space of a whole page Information;Target PPT document is generated according to document text and layout information.
In one embodiment, it is also performed the steps of when computer program is executed by processor according to target PPT document Timestamp and speech text timeline information, in the corresponding target PPT document of the Characters that will give a lecture.
Those of ordinary skill in the art will appreciate that realizing all or part of the process in above-described embodiment method, being can be with Relevant hardware is instructed to complete by computer program, the computer program can be stored in a non-volatile computer In read/write memory medium, the computer program is when being executed, it may include such as the process of the embodiment of above-mentioned each method.Wherein, To any reference of memory, storage, database or other media used in each embodiment provided herein, Including non-volatile and/or volatile memory.Nonvolatile memory may include read-only memory (ROM), programming ROM (PROM), electrically programmable ROM(EPROM), electrically erasable ROM(EEPROM) or flash memory.Volatile memory may include Random-access memory (ram) or external cache.By way of illustration and not limitation, RAM is available in many forms, Such as static state RAM(SRAM), dynamic ram (DRAM), synchronous dram (SDRAM), double data rate sdram (DDRSDRAM), enhancing Type SDRAM(ESDRAM), synchronization link (Synchlink) DRAM(SLDRAM), memory bus (Rambus) direct RAM (RDRAM), direct memory bus dynamic ram (DRDRAM) and memory bus dynamic ram (RDRAM) etc..
Each technical characteristic of above embodiments can be combined arbitrarily, for simplicity of description, not to above-described embodiment In each technical characteristic it is all possible combination be all described, as long as however, the combination of these technical characteristics be not present lance Shield all should be considered as described in this specification.
The several embodiments of the application above described embodiment only expresses, the description thereof is more specific and detailed, but simultaneously It cannot therefore be construed as limiting the scope of the patent.It should be pointed out that coming for those of ordinary skill in the art It says, without departing from the concept of this application, various modifications and improvements can be made, these belong to the protection of the application Range.Therefore, the scope of protection shall be subject to the appended claims for the application patent.

Claims (10)

1. a kind of PPT presentation file method for recording, which comprises
To PPT being customized of the presentation file shooting in demonstration, the document picture of PPT presentation file and drilling for speaker are obtained Say audio;
Document picture and speech audio are uploaded to server;
Control server carries out Text region to document picture, generates target PPT document;
Control server carries out speech recognition to speech audio, and speech audio is converted into speech text, obtains speech document.
2. the method according to claim 1, wherein what the PPT presentation file in described pair of demonstration was shot Step specifically includes: to the PPT presentation file in demonstration, selecting frame to clap PPT being customized of document in camera lens using quadrangle It takes the photograph;It is continuously taken pictures according to preset time interval, while being recorded to speaker.
3. the method according to claim 1, wherein described be uploaded to server for document picture and speech audio The step of before further include: to document picture carry out duplicate removal processing.
4. the method according to claim 1, wherein the method also includes utilizing image processing algorithm to document Picture is corrected.
5. the method according to claim 1, wherein the control server carries out text knowledge to document picture Not, the step of generation target PPT document includes:
Control server carries out Text region to each document picture using OCR recognizer, obtains document text and the space of a whole page Information;
Target PPT document is generated according to document text and layout information.
6. the method according to claim 1, wherein the method also includes the times according to target PPT document The timeline information of stamp and speech text, in the corresponding target PPT document of the Characters that will give a lecture.
7. a kind of PPT presentation file record device, which is characterized in that described device includes:
Shooting module obtains the document map of PPT presentation file for shooting to PPT being customized of presentation file in demonstration The speech audio of piece and speaker;
Uploading module, for document picture and speech audio to be uploaded to server;
Text region module carries out Text region to document picture for control server, generates target PPT document;
Speech recognition module carries out speech recognition to speech audio for control server, speech audio is converted into speech text This, obtains speech document.
8. a kind of PPT presentation file record device, which is characterized in that the shooting module is specifically used for: drilling the PPT in demonstration Show document, selects frame to shoot PPT being customized of document in camera lens using quadrangle;It is carried out according to preset time interval continuous It takes pictures, while recording to speaker.
9. a kind of computer equipment, including memory and processor, the memory are stored with computer program, feature exists In the step of processor realizes any one of claims 1 to 5 the method when executing the computer program.
10. a kind of computer readable storage medium, is stored thereon with computer program, which is characterized in that the computer program The step of method described in any one of claims 1 to 5 is realized when being executed by processor.
CN201811177454.0A 2018-10-10 2018-10-10 PPT presentation file method for recording, device, computer equipment and storage medium Pending CN109492206A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201811177454.0A CN109492206A (en) 2018-10-10 2018-10-10 PPT presentation file method for recording, device, computer equipment and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201811177454.0A CN109492206A (en) 2018-10-10 2018-10-10 PPT presentation file method for recording, device, computer equipment and storage medium

Publications (1)

Publication Number Publication Date
CN109492206A true CN109492206A (en) 2019-03-19

Family

ID=65690218

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201811177454.0A Pending CN109492206A (en) 2018-10-10 2018-10-10 PPT presentation file method for recording, device, computer equipment and storage medium

Country Status (1)

Country Link
CN (1) CN109492206A (en)

Cited By (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110060033A (en) * 2019-04-25 2019-07-26 大连海事大学 Lecture full information acquires and is intelligently embedded in the on-demand distribution management system of audio
CN110070084A (en) * 2019-04-25 2019-07-30 大连海事大学 Lecture PPT intellectual analysis, storage and on-demand dissemination system
CN110347848A (en) * 2019-07-11 2019-10-18 深圳云智教育科技有限公司 A kind of PowerPoint management method and device
CN110493640A (en) * 2019-08-01 2019-11-22 东莞理工学院 A kind of system and method that the Video Quality Metric based on video processing is PPT
CN110909737A (en) * 2019-11-14 2020-03-24 武汉虹旭信息技术有限责任公司 Picture character recognition method and system
CN111275048A (en) * 2020-01-15 2020-06-12 济南浪潮高新科技投资发展有限公司 PPT reproduction method based on OCR character recognition technology
CN111832455A (en) * 2020-06-30 2020-10-27 北京小米松果电子有限公司 Method, device, storage medium and electronic equipment for acquiring content image
CN112581965A (en) * 2020-12-11 2021-03-30 天津讯飞极智科技有限公司 Transcription method, device, recording pen and storage medium
CN112784085A (en) * 2021-01-19 2021-05-11 杭州睿胜软件有限公司 Method for generating file by using shared picture, server side and readable storage medium
CN112784106A (en) * 2019-11-04 2021-05-11 阿里巴巴集团控股有限公司 Content data processing method, report data processing method, computer device, and storage medium
CN114077823A (en) * 2020-08-05 2022-02-22 Oppo广东移动通信有限公司 Image processing method, terminal and storage medium

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20140282030A1 (en) * 2013-03-14 2014-09-18 Prateek Bhatnagar Method and system for outputting information
CN105120195A (en) * 2015-09-18 2015-12-02 谷鸿林 Content recording and reproducing system and method
CN107920280A (en) * 2017-03-23 2018-04-17 广州思涵信息科技有限公司 The accurate matched method and system of video, teaching materials PPT and voice content

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20140282030A1 (en) * 2013-03-14 2014-09-18 Prateek Bhatnagar Method and system for outputting information
CN105120195A (en) * 2015-09-18 2015-12-02 谷鸿林 Content recording and reproducing system and method
CN107920280A (en) * 2017-03-23 2018-04-17 广州思涵信息科技有限公司 The accurate matched method and system of video, teaching materials PPT and voice content

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
赵怡 等: "《融媒时代艺术类院校教学模式创新》", 31 October 2016 *
高凡: "《图书馆变革与发展 四川省文化厅图书情报学与文献学规划项目论文集》", 30 September 2013, 西南交通大学出版社 *

Cited By (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110060033A (en) * 2019-04-25 2019-07-26 大连海事大学 Lecture full information acquires and is intelligently embedded in the on-demand distribution management system of audio
CN110070084A (en) * 2019-04-25 2019-07-30 大连海事大学 Lecture PPT intellectual analysis, storage and on-demand dissemination system
CN110347848A (en) * 2019-07-11 2019-10-18 深圳云智教育科技有限公司 A kind of PowerPoint management method and device
CN110493640A (en) * 2019-08-01 2019-11-22 东莞理工学院 A kind of system and method that the Video Quality Metric based on video processing is PPT
CN112784106A (en) * 2019-11-04 2021-05-11 阿里巴巴集团控股有限公司 Content data processing method, report data processing method, computer device, and storage medium
CN112784106B (en) * 2019-11-04 2024-05-14 阿里巴巴集团控股有限公司 Content data processing method, report data processing method, computer device, and storage medium
CN110909737A (en) * 2019-11-14 2020-03-24 武汉虹旭信息技术有限责任公司 Picture character recognition method and system
CN111275048A (en) * 2020-01-15 2020-06-12 济南浪潮高新科技投资发展有限公司 PPT reproduction method based on OCR character recognition technology
CN111275048B (en) * 2020-01-15 2023-04-18 山东浪潮科学研究院有限公司 PPT reproduction method based on OCR character recognition technology
CN111832455A (en) * 2020-06-30 2020-10-27 北京小米松果电子有限公司 Method, device, storage medium and electronic equipment for acquiring content image
CN114077823A (en) * 2020-08-05 2022-02-22 Oppo广东移动通信有限公司 Image processing method, terminal and storage medium
CN112581965A (en) * 2020-12-11 2021-03-30 天津讯飞极智科技有限公司 Transcription method, device, recording pen and storage medium
CN112784085A (en) * 2021-01-19 2021-05-11 杭州睿胜软件有限公司 Method for generating file by using shared picture, server side and readable storage medium

Similar Documents

Publication Publication Date Title
CN109492206A (en) PPT presentation file method for recording, device, computer equipment and storage medium
US9799375B2 (en) Method and device for adjusting playback progress of video file
US7779355B1 (en) Techniques for using paper documents as media templates
CN111523293A (en) Method and device for assisting user in information input in live broadcast teaching
CN110781328A (en) Video generation method, system, device and storage medium based on voice recognition
JP2007004784A (en) Method, system, and device for digital information processing
US20220028424A1 (en) Systems for optimized presentation capture
DE112018006727B4 (en) ELECTRONIC DEVICE FOR COMBINING MUSIC WITH PHOTOGRAPHY AND CONTROL METHODS THEREFOR
US20140156651A1 (en) Automatic summarizing of media content
US10339204B2 (en) Converting electronic documents having visible objects
US20140364982A1 (en) Methods and systems for media file management
CN111881904A (en) Blackboard writing recording method and system
US11355155B1 (en) System and method to summarize one or more videos based on user priorities
US20150111189A1 (en) System and method for browsing multimedia file
CN110992960A (en) Control method, control device, electronic equipment and storage medium
CN104243886A (en) High-speed image analyzing and video generating technology based on plug-in technology
US20160335500A1 (en) Method of and system for generating metadata
CN117171369A (en) Content generation method, device, computer equipment and storage medium
US10902047B2 (en) Information processing method for displaying a plurality of images extracted from a moving image
CN104092553A (en) Data processing method and device and conference system
CN114341866A (en) Simultaneous interpretation method, device, server and storage medium
CN110110103A (en) Acquisition methods, device, computer equipment and the storage medium of media resource
EP4099711A1 (en) Method and apparatus and storage medium for processing video and timing of subtitles
KR101783872B1 (en) Video Search System and Method thereof
CN113709521B (en) System for automatically matching background according to video content

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20190319