CN109033394B - Client for picture video annotation data - Google Patents

Client for picture video annotation data Download PDF

Info

Publication number
CN109033394B
CN109033394B CN201810862044.3A CN201810862044A CN109033394B CN 109033394 B CN109033394 B CN 109033394B CN 201810862044 A CN201810862044 A CN 201810862044A CN 109033394 B CN109033394 B CN 109033394B
Authority
CN
China
Prior art keywords
video
module
picture
local
client
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201810862044.3A
Other languages
Chinese (zh)
Other versions
CN109033394A (en
Inventor
钟博煊
周礼
许淞斐
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Zhejiang Shenmou Technology Co ltd
Original Assignee
Zhejiang Shenmou Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Zhejiang Shenmou Technology Co ltd filed Critical Zhejiang Shenmou Technology Co ltd
Priority to CN201810862044.3A priority Critical patent/CN109033394B/en
Publication of CN109033394A publication Critical patent/CN109033394A/en
Application granted granted Critical
Publication of CN109033394B publication Critical patent/CN109033394B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Abstract

The invention discloses a client for picture video annotation data, which comprises a local picture calling module, a local video calling module, an online video calling module, a picture playing module, a video playing module, a label setting module and a label annotation module. The picture playing module plays the picture to be marked according to the preset picture switching time, and the video playing module is used for playing the video to be marked. And the label labeling module labels the picture to be labeled or the video to be labeled. The client for the image video annotation data disclosed by the invention has the beneficial effects that the annotation can be automatically added to the video or the image by setting the video frame number and the like, and a label to be annotated can be prepared in advance before the annotation is added, so that the workload and the working strength are reduced, and the defects of large workload, high repetition rate, long time consumption and the like of the traditional annotation method are effectively overcome.

Description

Client for picture video annotation data
Technical Field
The invention belongs to the technical field of computer vision based on deep learning, and particularly relates to a client for image video annotation data.
Background
In the field of computer vision technology based on deep learning, a large amount of picture/video data is required to train a training model in order to serve a deep learning algorithm. Objectively, the amount of data required to train a model is large. Therefore, how to reasonably and efficiently collect training data is one of the important topics for deep learning.
When labeled, each picture corresponds to a label, which is the "explanation" for the picture. Fig. 1 shows two labeled folders, the left folder is a picture folder, the right folder is a txt file, fig. 2 shows the txt file in fig. 1, and each line of data in the file corresponds to each picture on the left of the picture one by one.
In the conventional labeling method, the pictures need to be manually labeled. In other words, all pictures are put in a folder, and then each picture is manually corresponding to a label in the txt file, so that the one-to-one correspondence is needed, and the workload is huge. The video is more troublesome, a screenshot tool is also needed to link the video into pictures, twenty or so frames of pictures are possibly needed every second, namely twenty pictures every second, and then the method of marking the pictures is circulated.
Disclosure of Invention
The present invention overcomes the above-mentioned drawbacks and provides a client for picture video annotation data, which is directed to the state of the art.
The invention adopts the following technical scheme that the client for the image video annotation data comprises:
the system comprises a local picture calling module and a local video calling module, wherein the local picture calling module is used for reading a picture to be labeled stored in a local memory, and the local video calling module is used for reading a video to be labeled stored in the local memory;
the online picture calling module is used for acquiring a video to be annotated of an online live broadcast source;
the image playing module plays the image to be marked according to preset image switching time, and the video playing module is used for playing the video to be marked;
the system comprises a label setting module, a label setting module and a display module, wherein a plurality of label labels are arranged in the label setting module, and each label represents one scene;
and the label labeling module is used for labeling the picture to be labeled or the video to be labeled according to the label.
According to the technical scheme, the client for the image video annotation data further comprises a keyboard input module, the keyboard input module is provided with keyboard keys, the number of the keyboard keys is consistent with that of the annotation labels of the label setting module, and each keyboard key corresponds to each annotation label uniquely.
According to the technical scheme, the label labeling module matches the picture to be labeled or the video to be labeled with the label corresponding to the keyboard key according to the keyboard key input by the keyboard input module.
According to the technical scheme, the client for the image video annotation data further comprises a local image input module, and the local image input module is used for importing the image acquired by the acquisition equipment into a local memory.
According to the technical scheme, the client for the image video annotation data further comprises a local video input module, and the local video input module is used for importing the video acquired by the acquisition equipment into a local memory.
According to the technical scheme, the client for the image video annotation data further comprises a local image conversion module, and the local image conversion module is used for converting the image temporarily stored in the local memory into the image to be annotated in the uniform image format.
According to the technical scheme, the client for the image video annotation data further comprises a local video conversion module, and the local video conversion module is used for converting the video temporarily stored in the local memory into the video to be annotated with the uniform video format.
According to the technical scheme, the client for the image video annotation data further comprises an online video recording module, wherein the online video recording module is used for recording a video to be annotated of an online live broadcast source and temporarily storing the video to be annotated in a local storage.
According to the technical scheme, the client for the image video annotation data further comprises a multi-thread multi-task concurrent processing module, and the multi-thread multi-task concurrent processing module is used for supporting the label annotation module, the image playing module and the video playing module to run simultaneously.
The client for the image video annotation data disclosed by the invention has the beneficial effects that the annotation can be automatically added to the video or the image by setting the video frame number and the like, and a label to be annotated can be prepared in advance before the annotation is added, so that the workload and the working strength are reduced, and the defects of large workload, high repetition rate, long time consumption and the like of the traditional annotation method are effectively overcome.
Drawings
Fig. 1 is a schematic diagram of a conventional labeling method.
Fig. 2 is another schematic diagram of a conventional labeling method.
Fig. 3 is a schematic diagram of a preferred embodiment of the present invention.
Fig. 4 is a schematic diagram of a preferred embodiment of the present invention.
Fig. 5 is a schematic diagram of a preferred embodiment of the present invention.
Fig. 6 is a schematic diagram of a preferred embodiment of the present invention.
Fig. 7 is a schematic diagram of a preferred embodiment of the present invention.
Fig. 8 is a schematic diagram of a preferred embodiment of the present invention.
Fig. 9 is a schematic diagram of a preferred embodiment of the present invention.
Fig. 10 is a schematic diagram of a preferred embodiment of the present invention.
Fig. 11 is a schematic diagram of a preferred embodiment of the present invention.
Fig. 12 is a schematic view of a preferred embodiment of the present invention.
Detailed Description
The invention discloses a client for image video annotation data, and the specific implementation of the invention is further described below with reference to the preferred embodiment.
Referring to fig. 3 to 12 of the drawings, fig. 3 to 12 respectively show schematic diagrams of the client for the picture video annotation data.
Preferably, the client for the video annotation data of pictures disclosed in the present patent application includes:
the system comprises a local picture calling module and a local video calling module, wherein the local picture calling module is used for reading a picture to be labeled stored in a local memory, and the local video calling module is used for reading a video to be labeled stored in the local memory;
the online picture calling module is used for acquiring a video to be annotated of an online live broadcast source;
the image playing module plays the image to be marked according to preset image switching time, and the video playing module is used for playing the video to be marked;
the system comprises a label setting module, a label setting module and a display module, wherein a plurality of label labels are arranged in the label setting module, and each label represents one scene;
and the label labeling module is used for labeling the picture to be labeled or the video to be labeled according to the label.
Furthermore, the client for the image video annotation data further comprises a keyboard input module, the keyboard input module is provided with keyboard keys with the same number as the annotation labels of the label setting module, and each keyboard key is uniquely corresponding to each annotation label.
Further, when a user triggers any keyboard key, the label labeling module matches the picture to be labeled or the video to be labeled with the label corresponding to the keyboard key according to the keyboard key input by the keyboard input module.
Furthermore, the client for the image video annotation data further comprises a local image input module, and the local image input module is used for importing images acquired by acquisition equipment such as a camera into a local memory.
Furthermore, the client for the picture video annotation data further comprises a local video input module, and the local video input module is used for importing videos collected by collecting equipment such as a camera into a local memory.
Further, the client for the image video annotation data further comprises a local image conversion module, and the local image conversion module is used for converting images acquired by the acquisition equipment such as the camera temporarily stored in the local memory into the images to be annotated in the uniform image format.
Further, the client for the image video annotation data further comprises a local video conversion module, and the local video conversion module is used for converting videos acquired by the acquisition devices such as the camera temporarily stored in the local memory into videos to be annotated in a uniform video format.
Further, the client for the picture video annotation data further comprises an online video recording module, wherein the online video recording module is used for recording a video to be annotated of an online live broadcast source and temporarily storing the video to be annotated in a local storage.
Furthermore, the client for the image video annotation data further comprises a multithreading multitask concurrent processing module, and the multithreading multitask concurrent processing module is used for supporting the label annotation module, the image playing module and the video playing module to run simultaneously.
The online video calling module, the online video recording module and other functional modules of the client for the picture video annotation data preferably adopt a streaming media server Red5 framework easyDarwin.
According to the preferred embodiment, referring to fig. 3 of the drawings, the client for tagging data of picture videos disclosed in the patent application of the present invention can automatically add tags to videos or pictures by setting the number of video frames, and the like, and can prepare tags to be tagged in advance before adding tags, which is helpful for reducing workload and working strength, and effectively solves the defects of large workload, high repetition rate, long time consumption, and the like of the conventional tagging method.
It will be apparent to those skilled in the art that modifications and equivalents may be made in the embodiments and/or portions thereof without departing from the spirit and scope of the present invention.

Claims (7)

1. A client for video annotation data of pictures, comprising:
the system comprises a local picture calling module and a local video calling module, wherein the local picture calling module is used for reading a picture to be labeled stored in a local memory, and the local video calling module is used for reading a video to be labeled stored in the local memory;
the online video calling module is used for acquiring a video to be annotated of an online live broadcast source;
the image playing module plays the image to be marked according to preset image switching time, and the video playing module is used for playing the video to be marked;
the system comprises a label setting module, a label setting module and a display module, wherein a plurality of label labels are arranged in the label setting module, and each label represents one scene;
the label labeling module is used for labeling the picture to be labeled or the video to be labeled according to the label;
the client for the image video annotation data further comprises a keyboard input module, the keyboard input module is provided with keyboard keys, the number of the keyboard keys is consistent with that of the annotation labels of the label setting module, and each keyboard key corresponds to each annotation label uniquely;
and the label labeling module matches the picture to be labeled or the video to be labeled with the label corresponding to the keyboard key according to the keyboard key input by the keyboard input module.
2. The client according to claim 1, wherein the client further comprises a local picture input module, and the local picture input module is configured to import the picture captured by the capturing device into a local storage.
3. The client according to claim 2, wherein the client further comprises a local video input module, and the local video input module is configured to import the video captured by the capturing device into the local storage.
4. The client according to claim 3, wherein the client further comprises a local picture conversion module, and the local picture conversion module is configured to convert the picture temporarily stored in the local storage into the picture to be annotated in the unified picture format.
5. The client according to claim 4, wherein the client further comprises a local video conversion module, and the local video conversion module is configured to convert the video temporarily stored in the local storage into the video to be annotated in the unified video format.
6. The client according to claim 1, further comprising an online video recording module, wherein the online video recording module is configured to record a video to be annotated of an online live source, and temporarily store the video to be annotated in a local storage.
7. The client for the picture video annotation data according to any one of claims 1 to 6, wherein the client for the picture video annotation data further comprises a multi-thread multi-task concurrent processing module, and the multi-thread multi-task concurrent processing module is configured to support the tag annotation module, the picture playing module, and the video playing module to run simultaneously.
CN201810862044.3A 2018-08-01 2018-08-01 Client for picture video annotation data Active CN109033394B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810862044.3A CN109033394B (en) 2018-08-01 2018-08-01 Client for picture video annotation data

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810862044.3A CN109033394B (en) 2018-08-01 2018-08-01 Client for picture video annotation data

Publications (2)

Publication Number Publication Date
CN109033394A CN109033394A (en) 2018-12-18
CN109033394B true CN109033394B (en) 2022-02-11

Family

ID=64647336

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810862044.3A Active CN109033394B (en) 2018-08-01 2018-08-01 Client for picture video annotation data

Country Status (1)

Country Link
CN (1) CN109033394B (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110991260B (en) * 2019-11-12 2024-01-19 苏州智加科技有限公司 Scene marking method, device, equipment and storage medium
CN110865756B (en) * 2019-11-12 2022-07-15 苏州智加科技有限公司 Image labeling method, device, equipment and storage medium

Family Cites Families (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1809148B (en) * 2002-12-12 2013-03-13 夏普株式会社 Multi-medium data processing device capable of easily creating multi-medium content
KR100539537B1 (en) * 2003-12-31 2005-12-29 엘지전자 주식회사 TV with input Labeling and Labeling Method the Same
CN100493161C (en) * 2005-07-15 2009-05-27 深圳创维-Rgb电子有限公司 Intelligent channel scanning method
US7616131B2 (en) * 2006-10-16 2009-11-10 Motorola, Inc. Method and apparatus for allowing runtime creation of a user experience for a wireless device
US7559017B2 (en) * 2006-12-22 2009-07-07 Google Inc. Annotation framework for video
CN101650754A (en) * 2008-08-11 2010-02-17 鸿富锦精密工业(深圳)有限公司 System and method for marking dimension of dynamically previewable graphics
CN102623034B (en) * 2012-03-28 2015-05-13 北京华夏电通科技股份有限公司 Method and device for realizing mutual positioning and character fast recording of video data and text data
CN104065979A (en) * 2013-03-22 2014-09-24 北京中传数广技术有限公司 Method for dynamically displaying information related with video content and system thereof
CN106210900B (en) * 2015-05-06 2020-05-05 Tcl科技集团股份有限公司 Method and system for realizing multi-section repeated reading mark of Web audio and video player
KR101769071B1 (en) * 2016-05-10 2017-08-18 네이버 주식회사 Method and system for manufacturing and using video tag
CN108090915A (en) * 2017-12-08 2018-05-29 四川金英科技有限责任公司 A kind of video object intelligence method for tracing and system
CN108255961A (en) * 2017-12-21 2018-07-06 深圳市数字城市工程研究中心 Image annotation method of street view video, terminal device and storage medium

Also Published As

Publication number Publication date
CN109033394A (en) 2018-12-18

Similar Documents

Publication Publication Date Title
CN109275046B (en) Teaching data labeling method based on double video acquisition
CN105446946B (en) Rearrangement method, system and the electronic reading terminal of format document
CN109033394B (en) Client for picture video annotation data
CN103905734A (en) Method and device for intelligent tracking and photographing
CN111276018A (en) Network course recording method and device and terminal
CN115547367A (en) Audio and video buffer area reading processing method and device
CN108647710B (en) Video processing method and device, computer and storage medium
CN105745921A (en) Conference recording method and system for video network conference
CN111339357A (en) Recommendation method and device based on live user behaviors
US20210279372A1 (en) Fabric detecting and recording method and apparatus
CN114419502A (en) Data analysis method and device and storage medium
US11265594B2 (en) Data processing method and device, storage medium, electronic device
KR20150112113A (en) Method for managing online lecture contents based on event processing
CN111368140A (en) Video tag generation method and system
CN110362806A (en) Personalized operation arrangement system based on big data
CN104125492A (en) Video playing method and device
CN110909726B (en) Written document interaction system and method based on image recognition
CN105303904A (en) Moving learning method and system
CN110275860B (en) System and method for recording teaching process
UA114587C2 (en) THE SYSTEM (OPTIONS) OF SELECTION, OPENING AND ROTATION OF PHOTO AND / OR VIDEO CONTENT, AND ALSO MONITORING THE QUANTITY OF THE GOODS AND THE METHOD (OPTIONS) FOR ITS
CN203632805U (en) High-definition interrogation system based on blue-ray double CD-ROM drive and network storage
CN112580696A (en) Advertisement label classification method, system and equipment based on video understanding
CN112580644A (en) Testing method and device based on video stream cutout time and readable storage medium
CN112347406A (en) Business association interactive service and display system based on smart large screen
CN107807950B (en) Online friendly reading picture segmentation method for Internet station group magazine files

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
PE01 Entry into force of the registration of the contract for pledge of patent right
PE01 Entry into force of the registration of the contract for pledge of patent right

Denomination of invention: Client for image and video annotation data

Effective date of registration: 20230311

Granted publication date: 20220211

Pledgee: Hangzhou High-tech Financing Guarantee Co.,Ltd.

Pledgor: ZHEJIANG SHENMOU TECHNOLOGY Co.,Ltd.

Registration number: Y2023330000491