CN109033394B - Client for picture video annotation data - Google Patents
Client for picture video annotation data Download PDFInfo
- Publication number
- CN109033394B CN109033394B CN201810862044.3A CN201810862044A CN109033394B CN 109033394 B CN109033394 B CN 109033394B CN 201810862044 A CN201810862044 A CN 201810862044A CN 109033394 B CN109033394 B CN 109033394B
- Authority
- CN
- China
- Prior art keywords
- video
- module
- picture
- local
- client
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Images
Abstract
The invention discloses a client for picture video annotation data, which comprises a local picture calling module, a local video calling module, an online video calling module, a picture playing module, a video playing module, a label setting module and a label annotation module. The picture playing module plays the picture to be marked according to the preset picture switching time, and the video playing module is used for playing the video to be marked. And the label labeling module labels the picture to be labeled or the video to be labeled. The client for the image video annotation data disclosed by the invention has the beneficial effects that the annotation can be automatically added to the video or the image by setting the video frame number and the like, and a label to be annotated can be prepared in advance before the annotation is added, so that the workload and the working strength are reduced, and the defects of large workload, high repetition rate, long time consumption and the like of the traditional annotation method are effectively overcome.
Description
Technical Field
The invention belongs to the technical field of computer vision based on deep learning, and particularly relates to a client for image video annotation data.
Background
In the field of computer vision technology based on deep learning, a large amount of picture/video data is required to train a training model in order to serve a deep learning algorithm. Objectively, the amount of data required to train a model is large. Therefore, how to reasonably and efficiently collect training data is one of the important topics for deep learning.
When labeled, each picture corresponds to a label, which is the "explanation" for the picture. Fig. 1 shows two labeled folders, the left folder is a picture folder, the right folder is a txt file, fig. 2 shows the txt file in fig. 1, and each line of data in the file corresponds to each picture on the left of the picture one by one.
In the conventional labeling method, the pictures need to be manually labeled. In other words, all pictures are put in a folder, and then each picture is manually corresponding to a label in the txt file, so that the one-to-one correspondence is needed, and the workload is huge. The video is more troublesome, a screenshot tool is also needed to link the video into pictures, twenty or so frames of pictures are possibly needed every second, namely twenty pictures every second, and then the method of marking the pictures is circulated.
Disclosure of Invention
The present invention overcomes the above-mentioned drawbacks and provides a client for picture video annotation data, which is directed to the state of the art.
The invention adopts the following technical scheme that the client for the image video annotation data comprises:
the system comprises a local picture calling module and a local video calling module, wherein the local picture calling module is used for reading a picture to be labeled stored in a local memory, and the local video calling module is used for reading a video to be labeled stored in the local memory;
the online picture calling module is used for acquiring a video to be annotated of an online live broadcast source;
the image playing module plays the image to be marked according to preset image switching time, and the video playing module is used for playing the video to be marked;
the system comprises a label setting module, a label setting module and a display module, wherein a plurality of label labels are arranged in the label setting module, and each label represents one scene;
and the label labeling module is used for labeling the picture to be labeled or the video to be labeled according to the label.
According to the technical scheme, the client for the image video annotation data further comprises a keyboard input module, the keyboard input module is provided with keyboard keys, the number of the keyboard keys is consistent with that of the annotation labels of the label setting module, and each keyboard key corresponds to each annotation label uniquely.
According to the technical scheme, the label labeling module matches the picture to be labeled or the video to be labeled with the label corresponding to the keyboard key according to the keyboard key input by the keyboard input module.
According to the technical scheme, the client for the image video annotation data further comprises a local image input module, and the local image input module is used for importing the image acquired by the acquisition equipment into a local memory.
According to the technical scheme, the client for the image video annotation data further comprises a local video input module, and the local video input module is used for importing the video acquired by the acquisition equipment into a local memory.
According to the technical scheme, the client for the image video annotation data further comprises a local image conversion module, and the local image conversion module is used for converting the image temporarily stored in the local memory into the image to be annotated in the uniform image format.
According to the technical scheme, the client for the image video annotation data further comprises a local video conversion module, and the local video conversion module is used for converting the video temporarily stored in the local memory into the video to be annotated with the uniform video format.
According to the technical scheme, the client for the image video annotation data further comprises an online video recording module, wherein the online video recording module is used for recording a video to be annotated of an online live broadcast source and temporarily storing the video to be annotated in a local storage.
According to the technical scheme, the client for the image video annotation data further comprises a multi-thread multi-task concurrent processing module, and the multi-thread multi-task concurrent processing module is used for supporting the label annotation module, the image playing module and the video playing module to run simultaneously.
The client for the image video annotation data disclosed by the invention has the beneficial effects that the annotation can be automatically added to the video or the image by setting the video frame number and the like, and a label to be annotated can be prepared in advance before the annotation is added, so that the workload and the working strength are reduced, and the defects of large workload, high repetition rate, long time consumption and the like of the traditional annotation method are effectively overcome.
Drawings
Fig. 1 is a schematic diagram of a conventional labeling method.
Fig. 2 is another schematic diagram of a conventional labeling method.
Fig. 3 is a schematic diagram of a preferred embodiment of the present invention.
Fig. 4 is a schematic diagram of a preferred embodiment of the present invention.
Fig. 5 is a schematic diagram of a preferred embodiment of the present invention.
Fig. 6 is a schematic diagram of a preferred embodiment of the present invention.
Fig. 7 is a schematic diagram of a preferred embodiment of the present invention.
Fig. 8 is a schematic diagram of a preferred embodiment of the present invention.
Fig. 9 is a schematic diagram of a preferred embodiment of the present invention.
Fig. 10 is a schematic diagram of a preferred embodiment of the present invention.
Fig. 11 is a schematic diagram of a preferred embodiment of the present invention.
Fig. 12 is a schematic view of a preferred embodiment of the present invention.
Detailed Description
The invention discloses a client for image video annotation data, and the specific implementation of the invention is further described below with reference to the preferred embodiment.
Referring to fig. 3 to 12 of the drawings, fig. 3 to 12 respectively show schematic diagrams of the client for the picture video annotation data.
Preferably, the client for the video annotation data of pictures disclosed in the present patent application includes:
the system comprises a local picture calling module and a local video calling module, wherein the local picture calling module is used for reading a picture to be labeled stored in a local memory, and the local video calling module is used for reading a video to be labeled stored in the local memory;
the online picture calling module is used for acquiring a video to be annotated of an online live broadcast source;
the image playing module plays the image to be marked according to preset image switching time, and the video playing module is used for playing the video to be marked;
the system comprises a label setting module, a label setting module and a display module, wherein a plurality of label labels are arranged in the label setting module, and each label represents one scene;
and the label labeling module is used for labeling the picture to be labeled or the video to be labeled according to the label.
Furthermore, the client for the image video annotation data further comprises a keyboard input module, the keyboard input module is provided with keyboard keys with the same number as the annotation labels of the label setting module, and each keyboard key is uniquely corresponding to each annotation label.
Further, when a user triggers any keyboard key, the label labeling module matches the picture to be labeled or the video to be labeled with the label corresponding to the keyboard key according to the keyboard key input by the keyboard input module.
Furthermore, the client for the image video annotation data further comprises a local image input module, and the local image input module is used for importing images acquired by acquisition equipment such as a camera into a local memory.
Furthermore, the client for the picture video annotation data further comprises a local video input module, and the local video input module is used for importing videos collected by collecting equipment such as a camera into a local memory.
Further, the client for the image video annotation data further comprises a local image conversion module, and the local image conversion module is used for converting images acquired by the acquisition equipment such as the camera temporarily stored in the local memory into the images to be annotated in the uniform image format.
Further, the client for the image video annotation data further comprises a local video conversion module, and the local video conversion module is used for converting videos acquired by the acquisition devices such as the camera temporarily stored in the local memory into videos to be annotated in a uniform video format.
Further, the client for the picture video annotation data further comprises an online video recording module, wherein the online video recording module is used for recording a video to be annotated of an online live broadcast source and temporarily storing the video to be annotated in a local storage.
Furthermore, the client for the image video annotation data further comprises a multithreading multitask concurrent processing module, and the multithreading multitask concurrent processing module is used for supporting the label annotation module, the image playing module and the video playing module to run simultaneously.
The online video calling module, the online video recording module and other functional modules of the client for the picture video annotation data preferably adopt a streaming media server Red5 framework easyDarwin.
According to the preferred embodiment, referring to fig. 3 of the drawings, the client for tagging data of picture videos disclosed in the patent application of the present invention can automatically add tags to videos or pictures by setting the number of video frames, and the like, and can prepare tags to be tagged in advance before adding tags, which is helpful for reducing workload and working strength, and effectively solves the defects of large workload, high repetition rate, long time consumption, and the like of the conventional tagging method.
It will be apparent to those skilled in the art that modifications and equivalents may be made in the embodiments and/or portions thereof without departing from the spirit and scope of the present invention.
Claims (7)
1. A client for video annotation data of pictures, comprising:
the system comprises a local picture calling module and a local video calling module, wherein the local picture calling module is used for reading a picture to be labeled stored in a local memory, and the local video calling module is used for reading a video to be labeled stored in the local memory;
the online video calling module is used for acquiring a video to be annotated of an online live broadcast source;
the image playing module plays the image to be marked according to preset image switching time, and the video playing module is used for playing the video to be marked;
the system comprises a label setting module, a label setting module and a display module, wherein a plurality of label labels are arranged in the label setting module, and each label represents one scene;
the label labeling module is used for labeling the picture to be labeled or the video to be labeled according to the label;
the client for the image video annotation data further comprises a keyboard input module, the keyboard input module is provided with keyboard keys, the number of the keyboard keys is consistent with that of the annotation labels of the label setting module, and each keyboard key corresponds to each annotation label uniquely;
and the label labeling module matches the picture to be labeled or the video to be labeled with the label corresponding to the keyboard key according to the keyboard key input by the keyboard input module.
2. The client according to claim 1, wherein the client further comprises a local picture input module, and the local picture input module is configured to import the picture captured by the capturing device into a local storage.
3. The client according to claim 2, wherein the client further comprises a local video input module, and the local video input module is configured to import the video captured by the capturing device into the local storage.
4. The client according to claim 3, wherein the client further comprises a local picture conversion module, and the local picture conversion module is configured to convert the picture temporarily stored in the local storage into the picture to be annotated in the unified picture format.
5. The client according to claim 4, wherein the client further comprises a local video conversion module, and the local video conversion module is configured to convert the video temporarily stored in the local storage into the video to be annotated in the unified video format.
6. The client according to claim 1, further comprising an online video recording module, wherein the online video recording module is configured to record a video to be annotated of an online live source, and temporarily store the video to be annotated in a local storage.
7. The client for the picture video annotation data according to any one of claims 1 to 6, wherein the client for the picture video annotation data further comprises a multi-thread multi-task concurrent processing module, and the multi-thread multi-task concurrent processing module is configured to support the tag annotation module, the picture playing module, and the video playing module to run simultaneously.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810862044.3A CN109033394B (en) | 2018-08-01 | 2018-08-01 | Client for picture video annotation data |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810862044.3A CN109033394B (en) | 2018-08-01 | 2018-08-01 | Client for picture video annotation data |
Publications (2)
Publication Number | Publication Date |
---|---|
CN109033394A CN109033394A (en) | 2018-12-18 |
CN109033394B true CN109033394B (en) | 2022-02-11 |
Family
ID=64647336
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201810862044.3A Active CN109033394B (en) | 2018-08-01 | 2018-08-01 | Client for picture video annotation data |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN109033394B (en) |
Families Citing this family (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110991260B (en) * | 2019-11-12 | 2024-01-19 | 苏州智加科技有限公司 | Scene marking method, device, equipment and storage medium |
CN110865756B (en) * | 2019-11-12 | 2022-07-15 | 苏州智加科技有限公司 | Image labeling method, device, equipment and storage medium |
Family Cites Families (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1809148B (en) * | 2002-12-12 | 2013-03-13 | 夏普株式会社 | Multi-medium data processing device capable of easily creating multi-medium content |
KR100539537B1 (en) * | 2003-12-31 | 2005-12-29 | 엘지전자 주식회사 | TV with input Labeling and Labeling Method the Same |
CN100493161C (en) * | 2005-07-15 | 2009-05-27 | 深圳创维-Rgb电子有限公司 | Intelligent channel scanning method |
US7616131B2 (en) * | 2006-10-16 | 2009-11-10 | Motorola, Inc. | Method and apparatus for allowing runtime creation of a user experience for a wireless device |
US7559017B2 (en) * | 2006-12-22 | 2009-07-07 | Google Inc. | Annotation framework for video |
CN101650754A (en) * | 2008-08-11 | 2010-02-17 | 鸿富锦精密工业(深圳)有限公司 | System and method for marking dimension of dynamically previewable graphics |
CN102623034B (en) * | 2012-03-28 | 2015-05-13 | 北京华夏电通科技股份有限公司 | Method and device for realizing mutual positioning and character fast recording of video data and text data |
CN104065979A (en) * | 2013-03-22 | 2014-09-24 | 北京中传数广技术有限公司 | Method for dynamically displaying information related with video content and system thereof |
CN106210900B (en) * | 2015-05-06 | 2020-05-05 | Tcl科技集团股份有限公司 | Method and system for realizing multi-section repeated reading mark of Web audio and video player |
KR101769071B1 (en) * | 2016-05-10 | 2017-08-18 | 네이버 주식회사 | Method and system for manufacturing and using video tag |
CN108090915A (en) * | 2017-12-08 | 2018-05-29 | 四川金英科技有限责任公司 | A kind of video object intelligence method for tracing and system |
CN108255961A (en) * | 2017-12-21 | 2018-07-06 | 深圳市数字城市工程研究中心 | Image annotation method of street view video, terminal device and storage medium |
-
2018
- 2018-08-01 CN CN201810862044.3A patent/CN109033394B/en active Active
Also Published As
Publication number | Publication date |
---|---|
CN109033394A (en) | 2018-12-18 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN109275046B (en) | Teaching data labeling method based on double video acquisition | |
CN105446946B (en) | Rearrangement method, system and the electronic reading terminal of format document | |
CN109033394B (en) | Client for picture video annotation data | |
CN103905734A (en) | Method and device for intelligent tracking and photographing | |
CN111276018A (en) | Network course recording method and device and terminal | |
CN115547367A (en) | Audio and video buffer area reading processing method and device | |
CN108647710B (en) | Video processing method and device, computer and storage medium | |
CN105745921A (en) | Conference recording method and system for video network conference | |
CN111339357A (en) | Recommendation method and device based on live user behaviors | |
US20210279372A1 (en) | Fabric detecting and recording method and apparatus | |
CN114419502A (en) | Data analysis method and device and storage medium | |
US11265594B2 (en) | Data processing method and device, storage medium, electronic device | |
KR20150112113A (en) | Method for managing online lecture contents based on event processing | |
CN111368140A (en) | Video tag generation method and system | |
CN110362806A (en) | Personalized operation arrangement system based on big data | |
CN104125492A (en) | Video playing method and device | |
CN110909726B (en) | Written document interaction system and method based on image recognition | |
CN105303904A (en) | Moving learning method and system | |
CN110275860B (en) | System and method for recording teaching process | |
UA114587C2 (en) | THE SYSTEM (OPTIONS) OF SELECTION, OPENING AND ROTATION OF PHOTO AND / OR VIDEO CONTENT, AND ALSO MONITORING THE QUANTITY OF THE GOODS AND THE METHOD (OPTIONS) FOR ITS | |
CN203632805U (en) | High-definition interrogation system based on blue-ray double CD-ROM drive and network storage | |
CN112580696A (en) | Advertisement label classification method, system and equipment based on video understanding | |
CN112580644A (en) | Testing method and device based on video stream cutout time and readable storage medium | |
CN112347406A (en) | Business association interactive service and display system based on smart large screen | |
CN107807950B (en) | Online friendly reading picture segmentation method for Internet station group magazine files |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
PE01 | Entry into force of the registration of the contract for pledge of patent right | ||
PE01 | Entry into force of the registration of the contract for pledge of patent right |
Denomination of invention: Client for image and video annotation data Effective date of registration: 20230311 Granted publication date: 20220211 Pledgee: Hangzhou High-tech Financing Guarantee Co.,Ltd. Pledgor: ZHEJIANG SHENMOU TECHNOLOGY Co.,Ltd. Registration number: Y2023330000491 |