WO2023231159A1 - Image retrieval method and apparatus for realizing achieving ia by combining rpa and ai, and electronic device - Google Patents

Image retrieval method and apparatus for realizing achieving ia by combining rpa and ai, and electronic device Download PDF

Info

Publication number
WO2023231159A1
WO2023231159A1 PCT/CN2022/106339 CN2022106339W WO2023231159A1 WO 2023231159 A1 WO2023231159 A1 WO 2023231159A1 CN 2022106339 W CN2022106339 W CN 2022106339W WO 2023231159 A1 WO2023231159 A1 WO 2023231159A1
Authority
WO
WIPO (PCT)
Prior art keywords
image
information
initial
pixel
area image
Prior art date
Application number
PCT/CN2022/106339
Other languages
French (fr)
Chinese (zh)
Inventor
谭繁华
Original Assignee
来也科技(北京)有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 来也科技(北京)有限公司 filed Critical 来也科技(北京)有限公司
Publication of WO2023231159A1 publication Critical patent/WO2023231159A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/50Information retrieval; Database structures therefor; File system structures therefor of still image data
    • G06F16/53Querying
    • G06F16/532Query formulation, e.g. graphical querying
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/50Information retrieval; Database structures therefor; File system structures therefor of still image data
    • G06F16/58Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/583Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content

Definitions

  • the present disclosure relates to the field of computer technology, and in particular to an image retrieval method, device and electronic equipment that combines RPA (Robotic Process Automation, Robotic Process Automation) and AI (Artificial Intelligence, Artificial Intelligence) to realize IA (Intelligent Automation, Intelligent Automation).
  • RPA Robot Process Automation, Robotic Process Automation
  • AI Artificial Intelligence, Artificial Intelligence
  • Robotic Process Automation uses specific "robot software” to simulate human operations on a computer and automatically execute process tasks according to rules.
  • AI Artificial Intelligence
  • Intelligent Automation is a general term for a series of technologies from robotic process automation to artificial intelligence. It combines RPA with Optical Character Recognition (OCR), Intelligent Character Recognition (ICR), process mining ( Process Mining), Deep Learning (Deep Learning, DL), Machine Learning (ML), Natural Language Processing (NLP), Speech Recognition (Automatic Speech Recognition, ASR), Speech Synthesis (Text To Speech, TTS), Computer Vision (CV) and other AI technologies are combined to create end-to-end business processes that can think, learn and adapt, covering process discovery, process automation, and automatic and continuous data collection Collect and understand the meaning of data, and use data to manage and optimize the entire process of business processes.
  • OCR Optical Character Recognition
  • ICR Intelligent Character Recognition
  • process mining Process Mining
  • Deep Learning Deep Learning
  • ML Machine Learning
  • NLP Natural Language Processing
  • Speech Recognition Automatic Speech Recognition, ASR
  • Speech Synthesis Text To Speech, TTS
  • Computer Vision Computer Vision
  • the present disclosure aims to solve one of the technical problems in the related art, at least to a certain extent.
  • the purpose of this disclosure is to propose an image retrieval method, device, electronic device and storage medium that combines RPA and AI to realize IA, which can use RPA combined with artificial intelligence AI to realize intelligent automated IA for image retrieval, and can achieve image retrieval before image retrieval.
  • Preprocess the image in a timely manner to remove interference information in the image effectively improve the pertinence of the obtained target area image in the retrieval process, effectively reduce the impact of interference information on the retrieval process, thereby effectively improving image retrieval efficiency and image retrieval results. accuracy.
  • the image retrieval method that combines RPA and AI to implement IA proposed by the embodiment of the first aspect of the present disclosure includes: obtaining an initial image based on robotic process automation RPA technology, where the initial image has image description information; and obtaining an initial image from the initial image based on artificial intelligence AI technology. Intercept the initial area image; process the initial area image according to the image description information to obtain the target area image; retrieve the target content based on the target area image.
  • intercepting the initial area image from the initial image based on artificial intelligence AI technology includes: calling a natural language processing NLP service to identify the subject information in the initial image; and determining, based on the subject information, that the subject corresponds to the subject in the initial image.
  • the location description information intercept the area image corresponding to the location description information from the initial image as the initial area image.
  • the method further includes: determining the image scale information of the initial image; and/or determining the pixel feature information of the initial image; and/or determining the initial image for the initial image. Processing parameter information specified by the image; use image scale information, and/or pixel feature information, and/or processing parameter information as image description information.
  • RPA Robotic Process Automation
  • the image description information includes: image scale information; wherein, processing the initial area image according to the image description information to obtain the target area image includes: enlarging the initial area image according to the image scale information; The image is used as the target area image.
  • the image description information includes: pixel feature information; wherein, processing the initial area image according to the image description information to obtain the target area image includes: obtaining the area pixels in the initial image area; parsing the area from the pixel feature information Regional pixel features of pixels; enhance the regional pixel features of each regional pixel in the initial image area to obtain the target area image.
  • the image description information includes: image scale information and pixel feature information; wherein, processing the initial area image according to the image description information to obtain the target area image includes: enlarging the initial area image according to the image scale information, Obtain an image of the area to be filled, where the image of the area to be filled includes: pixels to be filled; parsing the first pixel features of the pixels to be filled from the pixel feature information; parsing the second pixel features of the regional pixels in other area images from the pixel feature information , wherein the initial area image and other area images together constitute the initial image; a filling pixel feature is generated according to the first pixel feature and the second pixel feature; the to-be-filled area image is filled according to the filling pixel feature to obtain the target area image.
  • the image description information includes: processing parameter information; wherein processing the initial area image according to the image description information to obtain the target area image includes: processing the initial area image according to the processing parameter information to obtain the target area image.
  • retrieving target content based on the target area image includes: determining semantic representation information of the target area image; retrieving the target content based on the semantic representation information.
  • determining the semantic representation information of the target area image includes: identifying the target object outline from the target area image; determining the object outline information according to the target object outline; processing the object outline information to obtain the outline vector representation; converting the outline Vector representation as semantic representation of information.
  • retrieving the target content according to the semantic representation information includes: determining a candidate similarity level corresponding to the semantic representation information, wherein the candidate similarity level belongs to a pre-constructed graph data structure, the candidate similarity level, It is the level to which the similarity between the corresponding represented content and the initial image belongs; the content represented by the candidate similarity level in the graph data structure is used as the target content.
  • the image retrieval device that combines RPA and AI to implement IA proposed by the embodiment of the second aspect of the present disclosure includes: an acquisition module for acquiring an initial image based on robotic process automation RPA technology, where the initial image has image description information; a first processing module , used to intercept the initial area image from the initial image based on artificial intelligence AI technology; the second processing module is used to process the initial area image according to the image description information to obtain the target area image; the retrieval module is used to retrieve the target according to the target area image content.
  • the first processing module is specifically configured to: call a natural language processing NLP service to identify subject information in the initial image; determine based on the subject information that the subject corresponds to the position description information in the initial image; from the initial The area image corresponding to the position description information is intercepted from the image as the initial area image.
  • the device further includes: a determining module, configured to determine image scale information of the initial image; and/or determine pixel feature information of the initial image; and/or determine processing parameter information specified for the initial image; and Image scale information, and/or pixel feature information, and/or processing parameter information are used as image description information.
  • a determining module configured to determine image scale information of the initial image; and/or determine pixel feature information of the initial image; and/or determine processing parameter information specified for the initial image
  • Image scale information, and/or pixel feature information, and/or processing parameter information are used as image description information.
  • the image description information includes: image scale information; wherein, the second processing module is specifically configured to: expand the initial region image according to the image scale information; and use the expanded image as the target region image.
  • the image description information includes: pixel feature information; wherein, the second processing module is further configured to: obtain the regional pixels in the initial image region; parse the regional pixel features of the regional pixels from the pixel feature information; The regional pixel features of each area pixel in the initial image area are enhanced to obtain the target area image.
  • the image description information includes: image scale information and pixel feature information; wherein, the second processing module is further configured to: expand the initial region image according to the image scale information to obtain the region image to be filled, where , the region image to be filled includes: pixels to be filled; parsing the first pixel features of the pixels to be filled from the pixel feature information; parsing the second pixel features of the region pixels in other region images from the pixel feature information, where the initial region image and Other area images together constitute the initial image; a filling pixel feature is generated based on the first pixel feature and the second pixel feature; the to-be-filled area image is filled according to the filling pixel feature to obtain the target area image.
  • the image description information includes: processing parameter information; wherein the second processing module is further configured to: process the initial area image according to the processing parameter information to obtain the target area image.
  • the retrieval module includes: a determination sub-module, used to determine the semantic representation information of the target area image; a retrieval sub-module, used to retrieve the target content according to the semantic representation information.
  • the determination sub-module is specifically used to: identify the target object contour from the target area image; determine the object contour information according to the target object contour; process the object contour information to obtain the contour vector representation; use the contour vector representation as semantic meaning representation information.
  • the retrieval sub-module is specifically used to: determine candidate similarity levels corresponding to semantic representation information, where the candidate similarity levels belong to a pre-constructed graph data structure, and the candidate similarity levels are their corresponding The level that represents the similarity between the content and the initial image; use the content represented by the candidate similarity level in the graph data structure as the target content.
  • the electronic device provided by the embodiment of the third aspect of the present disclosure includes: at least one processor and a memory; the memory stores computer execution instructions; and at least one processor executes the computer execution instructions stored in the memory, so that the at least one processor executes the first aspect of the disclosure.
  • the embodiment proposes an image retrieval method that combines RPA and AI to implement IA.
  • the computer-readable storage medium proposed in the embodiment of the fourth aspect of the disclosure has computer-executable instructions stored in the computer-readable storage medium.
  • the processor executes the computer-executed instructions, the combination of RPA and AI proposed in the embodiment of the first aspect of the disclosure is realized.
  • Implement the image retrieval method of IA is realized.
  • the advantages or beneficial effects of the above technical solutions at least include: obtaining the initial image based on robotic process automation RPA technology, intercepting the initial area image from the initial image based on artificial intelligence AI technology, processing the initial area image according to the image description information, and obtaining the target area image. , retrieve the target content based on the target area image, and can use RPA combined with artificial intelligence AI to realize intelligent automation IA of image retrieval. It can preprocess the image in time before image retrieval to remove interference information in the image and effectively improve the obtained target. The pertinence of regional images in the retrieval process can effectively reduce the impact of interference information on the retrieval process, thereby effectively improving the image retrieval efficiency and the accuracy of image retrieval results.
  • Figure 1 is a schematic flowchart of an image retrieval method that combines RPA and AI to implement IA proposed by an embodiment of the present disclosure
  • Figure 2 is a schematic flowchart of an image retrieval method for implementing IA by combining RPA and AI proposed by another embodiment of the present disclosure
  • Figure 3 is a schematic flowchart of an image retrieval method for implementing IA by combining RPA and AI proposed by another embodiment of the present disclosure
  • Figure 4 is a schematic flow chart of an image retrieval method for implementing IA by combining RPA and AI proposed by another embodiment of the present disclosure
  • Figure 5 is a schematic flowchart of an image retrieval method for implementing IA by combining RPA and AI proposed by another embodiment of the present disclosure
  • Figure 6 is a schematic flowchart of an image retrieval method for implementing IA by combining RPA and AI proposed by another embodiment of the present disclosure
  • Figure 7 is a schematic structural diagram of an image retrieval device that combines RPA and AI to implement IA proposed by an embodiment of the present disclosure
  • Figure 8 is a schematic structural diagram of an image retrieval device that combines RPA and AI to implement IA proposed by another embodiment of the present disclosure
  • FIG. 9 shows a structural block diagram of an electronic device according to an embodiment of the present disclosure.
  • Robot Process Automation refers to the automatic execution of process tasks according to rules on a computer through robot application software.
  • AI Artificial Intelligence
  • hardware-level technologies such as sensors, dedicated artificial intelligence chips, cloud computing, distributed storage, and big data processing
  • artificial intelligence software technology mainly includes computer vision technology, speech recognition technology, natural language processing technology, and machine learning, Deep learning, big data processing technology, knowledge graph technology and other major directions.
  • IA Intelligent Automation
  • ICR Optical Character Recognition
  • ICR Intelligent character Recognition
  • ICR Intelligent Character Recognition
  • ICR Process Mining
  • DL Deep Learning
  • ML Machine Learning
  • NLP Natural Language Processing
  • Speech Recognition Automatic Speech Recognition
  • ASR Automatic Speech Recognition
  • TTS Transmission To Speech
  • CV computer vision
  • other AI technologies are combined to create an end-to-end business process that can think, learn and adapt, covering everything from processes to The entire process from discovery, process automation, to using data to manage and optimize business processes through automatic and continuous data collection, understanding the meaning of data, and using data.
  • the term "initial image” refers to an image to be retrieved.
  • the initial image can be, for example, a car image captured by a traffic monitoring device, or it can be any kind of image containing a retrieval object. There are no restrictions on this.
  • image description information refers to data information that describes one or more relevant features of the initial image.
  • the image description information can be used to describe the multi-dimensional features of the image, and Features include, for example, scale features, pixel features, etc.
  • the image description information may specifically include, for example, image scale information of the initial image, and/or pixel feature information, and/or processing parameter information, etc., without limitation.
  • the term "initial area image” refers to a partial image intercepted from the initial image using artificial intelligence-based AI technology.
  • the initial area image may include a target area image.
  • target area image refers to an image obtained by processing the initial area image using the image description information as a reference basis.
  • target content refers to the content obtained by retrieving the target area image as the retrieval reference basis during the image retrieval process.
  • the target content can be, for example, a retrieved picture, There are no restrictions on the text, audio and video, etc. that describe the retrieved images.
  • the term "subject" refers to the main description object contained in the initial image, such as a person in a portrait picture, a car in a car display picture, etc., without limitation.
  • subject information refers to information related to the retrieval object contained in the initial image, such as the position information and area information of the retrieval object in the initial image, and is not limited to this.
  • position description information can be used to describe information related to the location of the subject in the initial image, such as the distribution and proportion of the subject in the initial image. This is not done limit.
  • image scale information may refer to relevant information used to describe the scale of the initial image, such as the size, area, etc. of the initial image, without limitation.
  • pixel feature information can be used to describe the feature information of pixels contained in the initial image, such as the number of pixels, color, etc., without limitation.
  • processing parameter information may refer to the expansion factor, fill color, brightness, hue, saturation, sharpening degree, etc. specified in advance for the initial image, without limitation.
  • area pixel refers to a pixel corresponding to one or more image areas in the initial image area.
  • regional pixel features refers to the relevant features of regional pixels obtained based on pixel feature information, such as the number, color, etc. of regional pixels.
  • region image to be filled refers to an image obtained by enlarging the initial region image based on image scale information.
  • pixels to be filled refers to pixels in the image of the area to be filled that need to be filled.
  • first pixel feature refers to the relevant feature of the pixel to be filled that is obtained based on the pixel feature information.
  • second pixel feature refers to the relevant features of regional pixels in other regional images obtained based on pixel feature information.
  • the term "filling pixel feature" refers to the pixel feature obtained based on the first pixel feature and the second pixel feature.
  • the filling pixel feature can be used to perform filling processing as an image of the area to be filled. Reference.
  • semantic representation information may be used for information that characterizes image-related features of a target area.
  • the target area image can have image features such as color features, contour features, linear features, center features, diagonal features, texture features, local features, and shape features, and there are no restrictions on this.
  • the semantic representation information may refer to information that characterizes one or more of the above image features.
  • target object outline refers to the outline of the retrieval target object contained in the target area image, such as the outline of the human body in the person image, the outline of the car light in the car image, and is not limited to this .
  • object contour information refers to relevant information obtained based on the contour of the target object.
  • the term "contour vector representation” refers to a vector representation that can represent the contour information of an object and is mapped in a vector space.
  • the vector representation can be a feature obtained by mapping features to a vector space. , such as contour features.
  • the term “candidate similarity level” refers to the level in the graph data structure to which the degree of similarity between the content represented by the semantic representation information and the initial image belongs.
  • graph data structure refers to a data structure established in advance in the vector retrieval library using vector distance as the basis for division.
  • the graph data structure can be used to perform image retrieval in the image retrieval process.
  • Vector distance is used as a reference to find candidate similarity levels to narrow the search scope.
  • the intelligent automation platform can realize the seamless integration of RPA, Intelligent Document Processing (IDP), Conversational AI (CoAI), Process Mining and other capabilities, and has the capabilities of "business understanding", “The five major categories of functions, “Process Creation”, “Run Anywhere”, “Centralized Management and Control”, and “Human-Machine Collaboration”, enable enterprises to realize end-to-end intelligent automation of business processes, replace manual operations, further improve business efficiency, and accelerate digital transformation.
  • Intelligent document processing is one of the core capabilities of the intelligent automation platform.
  • Intelligent Document Processing is based on Optical Character Recognition (OCR), Computer Vision (CV), Natural Language Processing (NLP), Knowledge Graph (KG) ) and other AI technologies, it can identify, classify, extract elements, verify, compare, and correct errors of various types of documents, and is a new generation of automation technology that helps enterprises realize the intelligence and automation of document processing.
  • OCR Optical Character Recognition
  • CV Computer Vision
  • NLP Natural Language Processing
  • KG Knowledge Graph
  • Figure 1 is a schematic flowchart of an image retrieval method for implementing IA by combining RPA and AI proposed by an embodiment of the present disclosure.
  • an image retrieval method that combines RPA and AI to implement IA is configured as an image retrieval device that combines RPA and AI to implement IA.
  • the image retrieval method that combines RPA and AI to implement IA can be configured in Among the image retrieval devices that combine RPA and AI to implement IA, the image retrieval device that combines RPA and AI to implement IA can be installed in a server or in an electronic device. This is not limited in the embodiments of the present disclosure.
  • an image retrieval method that combines RPA and AI to implement IA is configured in an electronic device as an example.
  • electronic devices such as smartphones, tablets, personal digital assistants, e-books and other hardware devices with various operating systems.
  • execution subject of the embodiments of the present disclosure may be, in terms of hardware, a central processing unit (CPU) in a server or an electronic device, and in terms of software, it may be, for example, a server or a related processor in an electronic device. Background service, there is no restriction on this.
  • CPU central processing unit
  • software it may be, for example, a server or a related processor in an electronic device. Background service, there is no restriction on this.
  • the "retrieval” in the embodiment of the present disclosure refers to the image retrieval process of realizing intelligent automation IA by combining robotic process automation RPA and artificial intelligence AI.
  • the image retrieval process is a fully automated image retrieval process.
  • the retrieval process, and the image retrieval process is also combined with artificial intelligence AI to realize automated image retrieval in the field of Natural Language Processing (NLP).
  • NLP Natural Language Processing
  • NLP natural language processing
  • Natural language processing that is, computer science, artificial intelligence, and linguistics focus on computer and human (natural) language. areas of interaction.
  • the full-process automation can be implemented to obtain the initial image based on the robotic process automation RPA technology, and the initial area image can be intercepted from the initial image based on the artificial intelligence AI technology.
  • the initial area image is processed according to the image description information to obtain the target area image, and the target content is retrieved based on the target area image.
  • the image retrieval method that combines RPA and AI to implement IA includes:
  • S101 Obtain an initial image based on Robotic Process Automation (RPA) technology, where the initial image has image description information.
  • RPA Robotic Process Automation
  • Robotic Process Automation refers to the automatic execution of process tasks according to rules on the computer through robot application software.
  • the initial image refers to an image to be retrieved.
  • the initial image may be, for example, a car image captured by a traffic monitoring device, or may be any type of image containing a retrieval object, and is not limited to this.
  • an application scenario in the embodiment of the present disclosure may be, for example, using Robotic Process Automation (RPA) to obtain the car image captured by the traffic monitoring device, using the car image as the initial image, and then the acquired initial image can be Perform image retrieval that combines RPA and AI to implement IA to determine the car information in the initial image.
  • RPA Robotic Process Automation
  • the image retrieval method that combines RPA and AI to implement IA described in the embodiments of the present disclosure can also be applied to any other possible image retrieval. In the scene, there is no restriction on this.
  • the image description information refers to the data information that describes one or more relevant features of the initial image.
  • the image description information can be used to describe the multi-dimensional features of the image, and the features are, for example, scale features, pixel features etc.
  • the image description information may specifically be, for example, image scale information of the initial image, and/or pixel feature information, and/or processing parameter information, etc., without limitation.
  • the application data interface when acquiring the initial image based on Robotic Process Automation (RPA) technology, can be pre-configured, and the RPA robot will receive the user-robot interaction via the application data interface according to the preset software operation process. Interaction image information, and use the obtained image information of the interaction between the user and the robot as the initial image.
  • RPA Robotic Process Automation
  • a third-party image collection device can also be used, and a communication link between the execution subject of the embodiment of the present disclosure and the third-party image collection device can be established in advance, and the data collected by the third-party image collection device can be obtained based on Robotic Process Automation (RPA).
  • RPA Robotic Process Automation
  • Image use it as the initial image, or you can use any other possible method based on Robotic Process Automation (RPA) to obtain the initial image, and there is no limit to this.
  • S102 Intercept the initial area image from the initial image based on artificial intelligence AI technology.
  • Artificial Intelligence refers to the study of using computers to simulate certain human thinking processes and intelligent behaviors (such as learning, reasoning, thinking, planning, etc.). It has both hardware-level technology and software-level technology.
  • Artificial intelligence hardware technology generally includes technologies such as sensors, dedicated artificial intelligence chips, cloud computing, distributed storage, and big data processing; artificial intelligence software technology mainly includes computer vision technology, speech recognition technology, natural language processing technology, and machine learning, Deep learning, big data processing technology, knowledge graph technology and other major directions.
  • the initial area image refers to a partial image intercepted from the initial image based on artificial intelligence AI technology, and the initial area image may include a target area image.
  • the initial area image can be intercepted from the initial image based on artificial intelligence (AI) technology.
  • AI artificial intelligence
  • artificial intelligence AI when intercepting the initial area image from the initial image based on artificial intelligence AI technology, artificial intelligence AI may be used to identify the boundary information of the subject image in the initial image, and then intercept the initial area from the initial image based on the boundary information. image.
  • a pre-trained matting model can also be used to intercept the initial image to obtain the initial region image, or any other possible method based on artificial intelligence (AI) can be used to intercept the initial region from the initial image. images, no restrictions are placed on this.
  • AI artificial intelligence
  • S103 Process the initial area image according to the image description information to obtain the target area image.
  • the target area image refers to the image obtained by processing the initial area image using the image description information as a reference basis.
  • the image description information and the initial area image can be input into the pre-trained image processing model to obtain the target area image and transmit it to
  • the execution subject of the embodiment of the present disclosure may also use any other possible methods, such as mathematical and engineering methods, to process the initial area image according to the image description information to obtain the target area image, which is not limited.
  • S104 retrieve target content based on the target area image.
  • embodiments of the present disclosure can effectively combine RPA and AI to realize intelligent automation (IA) of the image retrieval process, thereby effectively improving the automation of image retrieval and reducing labor costs.
  • the target content refers to the content obtained by retrieving the target area image as the retrieval reference basis during the image retrieval process.
  • the target content can be, for example, the retrieved picture or the text describing the retrieved picture. , audio and video, etc., there are no restrictions on this.
  • an image retrieval database can be obtained in advance, and the image retrieval database can contain the above target content, so as to achieve retrieval of similar images in the image retrieval database based on the target area image.
  • the target content can be retrieved based on the target area image.
  • the classification feature information of the target area image can be determined, and then the target content can be retrieved based on the classification feature information.
  • a retrieval learning model can be pre-trained.
  • the retrieval learning model can perform feature analysis of the target area image, and perform retrieval operations on the image retrieval library based on the obtained feature analysis results, or any other possibility can be used.
  • the method retrieves the target content based on the target area image, without any restrictions.
  • the initial image is obtained based on Robotic Process Automation RPA technology
  • the initial area image is intercepted from the initial image based on artificial intelligence AI technology
  • the initial area image is processed according to the image description information
  • the target area image is obtained
  • the target area image is retrieved based on the target area image.
  • Target content can use RPA combined with artificial intelligence AI to realize intelligent automation IA of image retrieval. It can preprocess images in time before image retrieval to remove interference information in the image and effectively improve the performance of the obtained target area image in the retrieval process. Targeted, effectively reduce the impact of interference information on the retrieval process, thereby effectively improving image retrieval efficiency and the accuracy of image retrieval results.
  • Figure 2 is a schematic flowchart of an image retrieval method for implementing IA by combining RPA and AI proposed by another embodiment of the present disclosure.
  • the image retrieval method that combines RPA and AI to implement IA includes:
  • S202 Determine the image scale information of the initial image.
  • the image scale information may refer to relevant information used to describe the scale of the initial image, such as the size, area, etc. of the initial image, which is not limited.
  • the image scale information of the initial image can be determined, and the pre-trained image scale algorithm model can be used to perform algorithm analysis on the initial image to obtain the pre-trained image scale algorithm model.
  • the output scale information is used as the image scale information of the initial image.
  • the obtained image scale information can effectively characterize the feature information of the initial image from the scale dimension.
  • the pixel feature information can be used to describe the feature information of the pixels contained in the initial image, such as the number of pixels, hue, etc., and there is no limit to this.
  • the pixel feature information of the initial image can also be determined, and a convolutional neural network can be used to perform feature analysis and processing on the initial image to obtain the pixel features output by the convolutional neural network.
  • the information is used as the pixel feature information of the initial image, and the obtained pixel feature information can effectively characterize the feature information of the initial image from the dimension of pixel features.
  • the processing parameter information may refer to information such as expansion factor, fill color, brightness, hue, saturation, and sharpening degree specified in advance for the initial image, and there is no limit to this.
  • the processing parameter information specified for the initial image can also be determined.
  • the processing parameter information can be configured based on the user configuration instructions, or it can also be determined in advance.
  • the template information of the target area image is then analyzed and compared with the initial image based on the template information, and the processing parameter information applicable to the initial image is determined based on the obtained comparison results. There is no limit to this.
  • S205 Use image scale information, and/or pixel feature information, and/or processing parameter information as image description information.
  • the image scale information, and/or pixel feature information, and/or processing parameter information of the initial image can be acquired, and one or more of them can be used as the image Description information, and the obtained image description information can be used as a reference for subsequent initial region image processing.
  • the image scale information of the initial image by determining the image scale information of the initial image, and/or determining the pixel feature information of the initial image, and/or determining the processing parameter information specified for the initial image, and using the image scale information, and/or the pixel feature information , and/or process parameter information as image description information, thereby enabling the obtained image description information to characterize the relevant information of the initial image from multiple feature dimensions, making the image description information applicable to different image preprocessing scenarios.
  • the image description information processes the initial area image, it can achieve multi-dimensional effective processing, effectively improve the reference value of the obtained target area image in the image retrieval process, thereby effectively improving the flexibility of the image retrieval process.
  • S206 Call the natural language processing NLP service to identify the subject information in the initial image.
  • Natural Language Processing (NLP) services take language as the object and use computer technology to analyze, understand and process natural language. That is, the computer is used as a powerful tool for language research, and language information is processed with the support of the computer. Conduct quantitative research and provide language description services that can be used between humans and computers.
  • NLP Natural Language Processing
  • the subject refers to the main description object contained in the initial image, such as the person in the portrait picture, the car in the car display picture, etc., and there is no limit to this.
  • the main information refers to the relevant information of the retrieval object contained in the initial image, such as the position information and area information of the retrieval object in the initial image, and there is no limit to this.
  • the natural language processing NLP service can be called to identify the subject information in the initial image.
  • the image can be used
  • the subject detection algorithm detects the subject information in the initial image, or it can also annotate the subject of the image through human-computer collaboration to obtain the subject information in the initial image, thereby reducing the cost of annotation, and there is no restriction on this.
  • S207 According to the subject information, determine that the subject corresponds to the position description information in the initial image.
  • the position description information refers to relevant information that can be used to describe the location of the subject in the initial image, such as the distribution and proportion of the subject in the initial image, etc., and there is no limit to this.
  • the natural language processing NLP service after calling the natural language processing NLP service to identify the subject information in the initial image, it can be determined based on the subject information that the subject corresponds to the position description information in the initial image.
  • the obtained position description information can be used for subsequent processing from the initial image.
  • the regional image corresponding to the location description information is intercepted from the image to provide a reliable reference basis.
  • S208 Intercept the area image corresponding to the position description information from the initial image as the initial area image.
  • the initial area image refers to a partial image intercepted from the initial image using artificial intelligence-based AI technology.
  • the initial area image may include a target area image.
  • the region image corresponding to the position description information can be intercepted from the initial image as the initial region image, and the region image corresponding to the position description information can be intercepted in the initial image according to the position description information. Annotate and then crop based on the annotation information to obtain the initial region image.
  • the natural language processing NLP service is called to identify the subject information in the initial image. Based on the subject information, it is determined that the subject corresponds to the position description information in the initial image, and the area corresponding to the position description information is intercepted from the initial image. The image is used as the initial area image.
  • the initial image may contain interference information other than the subject information, and this interference information may affect the efficiency and accuracy of the retrieval process, when calling the natural language processing NLP service to identify the subject in the initial image information, and determine that the subject corresponds to the position description information in the initial image based on the subject information, so that the obtained position description information can effectively represent the position information of the subject in the initial image, and then intercept the area image corresponding to the position description information from the initial image As an initial area image, the interference information in the obtained initial area image can be effectively reduced, thereby improving the accuracy of the obtained initial area image in representing subject information.
  • S209 Process the initial area image according to the image description information to obtain the target area image.
  • S210 Determine the semantic representation information of the target area image.
  • semantic representation information refers to information that can be used to characterize the image-related features of the target area.
  • the target area image can have image features such as color features, contour features, linear features, center features, diagonal features, texture features, local features, and shape features, and there are no restrictions on this.
  • the semantic representation information may refer to information that characterizes one or more of the above image features.
  • the semantic representation information of the target area image is determined by pre-training a feature extractor for the target area image, and then inputting the target area image into the feature extractor to obtain a feature vector of one or more dimensions.
  • Representation information, and the obtained feature vector representation information of one or more dimensions is used as the semantic representation information of the target area image, or any other possible method can be used to determine the semantic representation information of the target area image, without limitation.
  • determining the semantic representation information of the target area image may include identifying the target object outline from the target area image, determining the object outline information according to the target object outline, processing the object outline information, and obtaining the outline vector representation, Contour vector representation is used as semantic representation information. Since the object contour can effectively represent the characteristic information of the object, when the object contour information is determined based on the target object contour and the object contour information is processed to obtain the contour vector representation, the contour vector representation can be effectively improved. Representation effect, and then using the contour vector representation as semantic representation information can effectively improve the applicability of the obtained semantic representation information in the image retrieval process.
  • the target object outline refers to the outline of the retrieval target object contained in the target area image, such as the human body outline in the person image and the car outline lights in the car image. There is no limit to this.
  • the object contour information refers to the relevant information obtained based on the contour of the target object.
  • the contour vector representation refers to a vector representation that can represent the contour information of an object and is mapped in a vector space.
  • the vector representation can be a feature obtained by mapping features to a vector space, such as a contour feature.
  • obtaining the contour vector representation may include performing operations such as dimensionality reduction, whitening, and pooling on the target area image, extracting the contour features of the subject in the target area image, and mapping them into the vector space to obtain the contour.
  • Operations such as dimensionality reduction, whitening, and pooling on the target area image, extracting the contour features of the subject in the target area image, and mapping them into the vector space to obtain the contour.
  • Vector representation may include performing operations such as dimensionality reduction, whitening, and pooling on the target area image, extracting the contour features of the subject in the target area image, and mapping them into the vector space to obtain the contour.
  • the visual neural network when determining the semantic representation information of the target area image, can be used to identify the target object outline from the target area image, determine the object outline information according to the target object outline, and process the object outline information to obtain Contour vector representation, using contour vector representation as semantic representation information.
  • the target content can be retrieved according to the semantic representation information, and the semantic representation information can be used as a reference basis to retrieve pictures that meet the above semantic representation information in the image retrieval database. to get the target content.
  • retrieving the target content according to the semantic representation information may be to determine the candidate similarity level corresponding to the semantic representation information, where the candidate similarity level belongs to a pre-built graph data structure, and the candidate similarity level The level is the level to which the similarity between the corresponding represented content and the initial image belongs. Then the content represented by the candidate similarity level in the graph data structure is used as the target content. Since there may be a large amount of data in the retrieval database, when determining the The candidate similarity level corresponding to the semantic representation information, and using the content represented by the candidate similarity level in the graph data structure as the target content, can greatly reduce the computational cost of the retrieval process and effectively improve the retrieval efficiency.
  • the candidate similarity level refers to the level of similarity between the content represented by the semantic representation information and the initial image in the graph data structure.
  • the graph data structure refers to the data structure established in advance in the vector retrieval library using vector distance as the basis for division.
  • the graph data structure can be used to use vector distance as the reference basis to find candidate similarities during the image retrieval process. hierarchies to narrow the search scope.
  • candidate similarity levels corresponding to the semantic representation information when retrieving target content according to the semantic representation information, candidate similarity levels corresponding to the semantic representation information can be determined, where the candidate similarity levels belong to a pre-constructed graph data structure, and the candidate similarity levels, is the level to which the similarity between the corresponding represented content and the initial image belongs, and then the content represented by the candidate similarity level in the graph data structure is used as the target content.
  • the semantic representation information of the target area image is determined, and the target content is retrieved based on the semantic representation information. Since the semantic representation information can effectively characterize the relevant features of the target area image, when the target content is retrieved based on the semantic representation information, the target content can be effectively retrieved. Improving the pertinence and purpose of the search process can effectively improve the reliability of search results.
  • the image scale information of the initial image by determining the image scale information of the initial image, and/or determining the pixel feature information of the initial image, and/or determining the processing parameter information specified for the initial image, and using the image scale information, and/or the pixel feature information , and/or process parameter information as image description information, thereby enabling the obtained image description information to characterize the relevant information of the initial image from multiple feature dimensions, making the image description information applicable to different image preprocessing scenarios.
  • the image description information processes the initial area image, it can achieve multi-dimensional effective processing, effectively improve the reference value of the obtained target area image in the image retrieval process, thereby effectively improving the flexibility of the image retrieval process.
  • the natural language processing NLP service By calling the natural language processing NLP service to identify the subject information in the initial image, based on the subject information, determine that the subject corresponds to the location description information in the initial image, and intercept the area image corresponding to the location description information from the initial image as the initial area image , since the initial image may contain interference information other than the subject information, and this interference information may affect the efficiency and accuracy of the retrieval process, when calling the natural language processing NLP service to identify the subject information in the initial image, and based on the subject
  • the information determines that the subject corresponds to the position description information in the initial image, so that the obtained position description information can effectively represent the position information of the subject in the initial image, and then intercepts the area image corresponding to the position description information from the initial image as the initial area image, It can effectively reduce the interference information in the obtained initial area image, thereby improving the accuracy of the obtained initial area image to represent the subject information.
  • the retrieval process can be effectively improved. sex and purpose, which can effectively improve the reliability of search results.
  • the object contour can effectively represent the characteristic information of the object
  • the representation effect of the contour vector representation can be effectively improved, and then the contour vector representation is used as Semantic representation information can effectively improve the applicability of the obtained semantic representation information in the image retrieval process.
  • FIG. 3 is a schematic flowchart of an image retrieval method for implementing IA by combining RPA and AI proposed by another embodiment of the present disclosure.
  • the image retrieval method that combines RPA and AI to implement IA includes:
  • S304 Intercept the initial area image from the initial image based on artificial intelligence AI technology.
  • the initial area image can be enlarged according to the image scale information, so that the image obtained after processing
  • the scale of is equal to the scale of the initial image or any other scale suitable for the image retrieval process, and there is no restriction on this.
  • the enlarged image can be used as the target area image.
  • the image scale information is used as image description information
  • the initial area image is expanded according to the image scale information
  • the expanded image is used as the target area image. Since the image is intercepted from the initial image based on artificial intelligence AI technology The scale of the obtained initial area image may be low.
  • the initial area image is expanded according to the image scale information, and the enlarged image is used as the target area image, it can effectively avoid the initial area image being too low and affecting the retrieval effect. It can effectively improve the reliability of the obtained target area image as a retrieval basis.
  • S307 retrieve target content based on the target area image.
  • the initial area image is enlarged according to the image scale information, and the enlarged image is used as the target area image. Since the scale of the initial area image intercepted from the initial image based on artificial intelligence AI technology may be relatively large, Low, when the initial area image is enlarged according to the image scale information, and the enlarged image is used as the target area image, it can effectively avoid the initial area image scale being too low and affect the retrieval effect, and can effectively improve the resulting target area image as a retrieval The reliability of the basis.
  • FIG. 4 is a schematic flowchart of an image retrieval method for implementing IA by combining RPA and AI proposed by another embodiment of the present disclosure.
  • the image retrieval method that combines RPA and AI to implement IA includes:
  • S402 Determine the pixel feature information of the initial image.
  • S404 Intercept the initial area image from the initial image based on artificial intelligence AI technology.
  • regional pixels refer to pixels corresponding to one or more image regions in the initial image.
  • the area pixels in the initial image area can be obtained.
  • regional pixel features refer to the relevant features of regional pixels obtained based on pixel feature information, such as the number and color of regional pixels.
  • matching processing can be performed based on the above-mentioned pixel feature information and the regional pixels to analyze and obtain the regional pixel features of the regional pixels.
  • S407 Enhance the regional pixel features of each area pixel in the initial image area to obtain the target area image.
  • the regional pixel features of each regional pixel in the initial image area can be enhanced to improve the recognition of the regional pixel features of each regional pixel. , get the target area image.
  • the pixel feature information is used as the image description information.
  • the regional pixel features of the regional pixels are analyzed from the pixel feature information, and the regional pixel features of each regional pixel in the initial image region are analyzed. Enhancement processing is performed to obtain the target area image. Since the intensity of regional pixel features may affect the image retrieval effect, when the regional pixel features of each regional pixel in the initial image area are enhanced, the representation of the subject image of the obtained target area image can be effectively improved. capabilities, thereby improving the pertinence and accuracy of the image retrieval process.
  • S408 retrieve the target content according to the target area image.
  • the target area image is obtained. Since The strength of regional pixel features may affect the image retrieval effect. When the regional pixel features of each regional pixel in the initial image area are enhanced, the representation ability of the obtained target area image to the subject image can be effectively improved, thereby improving the targeting of the image retrieval process. sex and accuracy.
  • FIG. 5 is a schematic flowchart of an image retrieval method for implementing IA by combining RPA and AI proposed by another embodiment of the present disclosure.
  • the image retrieval method that combines RPA and AI to implement IA includes:
  • S501 Obtain the initial image based on Robotic Process Automation (RPA) technology.
  • RPA Robotic Process Automation
  • S502 Determine the image scale information of the initial image.
  • S503 Determine the pixel feature information of the initial image.
  • S504 Use image scale information and pixel feature information as image description information.
  • S505 Intercept the initial area image from the initial image based on artificial intelligence AI technology.
  • S506 Expand the initial area image according to the image scale information to obtain an image of the area to be filled, where the image of the area to be filled includes: pixels to be filled.
  • the image of the area to be filled refers to the image obtained by enlarging the initial area image based on the image scale information.
  • the pixels to be filled refer to the pixels in the image of the area to be filled that need to be filled.
  • the initial area image after using the image scale information and pixel feature information as image description information, and intercepting the initial area image from the initial image based on artificial intelligence AI technology, the initial area image can be enlarged according to the image scale information to obtain For the area image to be filled, the initial area image can be enlarged according to the image scale information, and the scale of the initial area image is adjusted to the scale of the initial image or any other scale value suitable for the image retrieval process, and the initial area image after the enlargement process is The area image is used as the area image to be filled.
  • S507 Parse the first pixel feature of the pixel to be filled from the pixel feature information.
  • the first pixel feature refers to the relevant feature of the pixel to be filled that is obtained based on the pixel feature information.
  • the first pixel feature of the pixel to be filled can be analyzed from the pixel feature information, and the first pixel feature of the pixel to be filled can be combined based on the pixel feature information.
  • the pixels are matched, and the feature information in the pixel feature information that matches the pixel to be filled is used as the first pixel feature.
  • S508 Parse the second pixel features of the regional pixels in other regional images from the pixel feature information, where the initial regional image and other regional images together constitute the initial image.
  • the second pixel feature refers to the relevant features of regional pixels in other regional images obtained based on pixel feature information.
  • the second pixel feature of the regional pixels in the image of other regions can be parsed from the pixel feature information, and other regions can be combined based on the pixel feature information.
  • the regional pixels in the image are matched, and the characteristic information in the pixel feature information that matches the regional pixels in other regional images is used as the second pixel feature.
  • S509 Generate filling pixel features based on the first pixel feature and the second pixel feature.
  • the filling pixel feature refers to the pixel feature obtained based on the first pixel feature and the second pixel feature.
  • the filling pixel feature can be used as a reference for filling the image of the area to be filled.
  • the method can be based on the first pixel feature and the second Pixel features are used to generate filled pixel features.
  • a pre-trained machine learning model can be used to analyze the first pixel feature and the second pixel feature to generate filled pixel features.
  • S510 Perform filling processing on the image of the area to be filled according to the characteristics of the filling pixels to obtain the target area image.
  • the to-be-filled area image can be filled according to the filling pixel characteristics to obtain the target area image, and the pixels to be filled can be determined based on the filling pixel characteristics. Then, the area image to be filled is filled based on the pixels to be filled, so as to obtain the target area image.
  • the image scale information and pixel feature information are used as image description information.
  • the initial area image is expanded according to the image scale information to obtain the area image to be filled.
  • the first pixel of the pixel to be filled is parsed from the pixel feature information.
  • Features, analyze the second pixel features of area pixels in other area images from the pixel feature information generate filling pixel features based on the first pixel features and second pixel features, fill the area image to be filled according to the filling pixel features, and obtain the target Therefore, while ensuring that the size of the obtained area image to be filled conforms to the normal image size, it can be filled by combining the first pixel feature and the second pixel feature to avoid affecting the representation of the image due to the enlargement process. , which can greatly improve the representation effect of the obtained target area image.
  • S511 retrieve target content based on the target area image.
  • the initial region image is expanded according to the image scale information to obtain the region image to be filled
  • the first pixel feature of the pixel to be filled is parsed from the pixel feature information
  • the regions in other region images are parsed from the pixel feature information.
  • the second pixel feature of the pixel is used to generate a filling pixel feature based on the first pixel feature and the second pixel feature.
  • the to-be-filled area image is filled according to the filling pixel feature to obtain the target area image.
  • the obtained area to be filled can be guaranteed. While the size of the image conforms to the normal image size, the first pixel feature and the second pixel feature are combined to fill it to avoid affecting the representation of the image due to the enlargement process, thereby greatly improving the quality of the obtained target area image. representation effect.
  • FIG. 6 is a schematic flowchart of an image retrieval method for implementing IA by combining RPA and AI proposed by another embodiment of the present disclosure.
  • the image retrieval method that combines RPA and AI to implement IA includes:
  • S601 Obtain the initial image based on Robotic Process Automation (RPA) technology.
  • RPA Robotic Process Automation
  • S602 Determine the processing parameter information specified for the initial image.
  • S604 Intercept the initial area image from the initial image based on artificial intelligence AI technology.
  • S605 Process the initial area image according to the processing parameter information to obtain the target area image.
  • the processing parameter information refers to the expansion factor, fill color, brightness, hue, saturation, sharpening degree, etc. specified in advance for the initial image, and there is no limit to this.
  • the initial area image can be processed according to the processing parameter information to obtain the target area image, which can be predetermined Based on the processing parameter information of the initial image (such as the specified expansion factor, fill color, brightness, hue, saturation, sharpening degree, etc.), and then adjust the corresponding parameters of the initial area image based on the processing parameter information to obtain Target area image.
  • the processing parameter information of the initial image such as the specified expansion factor, fill color, brightness, hue, saturation, sharpening degree, etc.
  • the processing parameter information is used as the image description information, and the target area image is obtained by processing the initial area image according to the processing parameter information. Since the processing parameter information can be correspondingly configured according to the user configuration instructions, when the initial area image is processed based on the processing parameter information, The regional image can flexibly process the initial regional image according to the application scenario to obtain the target region image suitable for the image retrieval process, thereby effectively improving the flexibility of the initial regional image processing process.
  • S606 retrieve target content based on the target area image.
  • the target area image is obtained by processing the initial area image according to the processing parameter information. Since the processing parameter information can be correspondingly configured according to the user configuration instructions, when the initial area image is processed based on the processing parameter information, the initial area image can be processed according to the application scenario.
  • the regional image is flexibly processed to obtain a target region image suitable for the image retrieval process, thereby effectively improving the flexibility of the initial region image processing process.
  • Figure 7 is a schematic structural diagram of an image retrieval device that combines RPA and AI to implement IA proposed by an embodiment of the present disclosure.
  • the image retrieval device 70 that combines RPA and AI to implement IA is applied in the field of natural language processing NLP, including:
  • the acquisition module 701 is used to acquire an initial image based on robotic process automation RPA technology, where the initial image has image description information;
  • the first processing module 702 is used to intercept the initial area image from the initial image based on artificial intelligence AI technology
  • the second processing module 703 is used to process the initial area image according to the image description information to obtain the target area image;
  • the retrieval module 704 is used to retrieve target content according to the target area image.
  • the first processing module 702 is specifically used to:
  • the subject information determine that the subject corresponds to the position description information in the initial image
  • a region image corresponding to the position description information is intercepted from the initial image as the initial region image.
  • Figure 8 is a schematic structural diagram of an image retrieval device that combines RPA and AI to implement IA proposed by another embodiment of the present disclosure.
  • the image retrieval device also includes:
  • Determining module 705 used to determine the image scale information of the initial image; and/or determine the pixel feature information of the initial image; and/or determine the processing parameter information specified for the initial image; and combine the image scale information, and/or the pixel feature information , and/or process parameter information as image description information.
  • the image description information includes: image scale information
  • the second processing module 703 is specifically used for:
  • the enlarged image is used as the target area image.
  • the image description information includes: pixel feature information
  • the second processing module 703 is also used for:
  • the regional pixel features of each area pixel in the initial image area are enhanced to obtain the target area image.
  • the image description information includes: image scale information and pixel feature information
  • the second processing module 703 is also used for:
  • the initial area image is expanded according to the image scale information to obtain the area image to be filled, where the area image to be filled includes: pixels to be filled;
  • the image of the area to be filled is filled according to the characteristics of the filled pixels to obtain the target area image.
  • the image description information includes: processing parameter information;
  • the second processing module 703 is also used for:
  • the initial area image is processed according to the processing parameter information to obtain the target area image.
  • the retrieval module 704 includes:
  • Determination sub-module 7041 used to determine the semantic representation information of the target area image
  • the retrieval sub-module 7042 is used to retrieve target content based on semantic representation information.
  • the determination sub-module 7041 is specifically used for:
  • the search sub-module 7042 is specifically used for:
  • the content represented by the candidate similarity level in the graph data structure is used as the target content.
  • the present disclosure also provides an image retrieval device that combines RPA and AI to implement IA. Since the embodiment of the disclosure provides an image retrieval method that combines RPA with AI The image retrieval device that implements IA with AI corresponds to the image retrieval method that combines RPA and AI to implement IA provided in the above embodiments of Figures 1 to 6. Therefore, the implementation of the image retrieval method that combines RPA and AI to implement IA is also applicable to The image retrieval device that combines RPA and AI to implement IA provided by the embodiment of the present disclosure will not be described in detail in the embodiment of the present disclosure.
  • the initial image is obtained based on Robotic Process Automation RPA technology
  • the initial area image is intercepted from the initial image based on artificial intelligence AI technology
  • the initial area image is processed according to the image description information
  • the target area image is obtained
  • the target area image is retrieved based on the target area image.
  • Target content can use RPA combined with artificial intelligence AI to realize intelligent automation IA of image retrieval. It can preprocess images in time before image retrieval to remove interference information in the image and effectively improve the performance of the obtained target area image in the retrieval process. Targeted, effectively reduce the impact of interference information on the retrieval process, thereby effectively improving image retrieval efficiency and the accuracy of image retrieval results.
  • the present disclosure also proposes an electronic device, including: a memory, a processor, and a computer program stored in the memory and executable on the processor.
  • the processor executes the program, the aforementioned embodiments of the present disclosure are implemented.
  • the proposed image retrieval method combines RPA and AI to realize IA.
  • FIG. 9 shows a structural block diagram of an electronic device according to an embodiment of the present disclosure.
  • the electronic device 90 includes: a memory 910 and a processor 920 .
  • the memory 910 stores a computer program that can run on the processor 920 .
  • the processor 920 executes the computer program, it implements the image retrieval method for implementing IA by combining RPA and AI in the above embodiment.
  • the number of memory 910 and processor 920 may be one or more.
  • the electronic device 90 also includes:
  • the communication interface 930 is used to communicate with external devices and perform data interactive transmission.
  • the bus can be an Industry Standard Architecture (ISA) bus, a Peripheral Component Interconnect (PCI) bus or an Extended Industry Standard Architecture (EISA) bus, etc.
  • ISA Industry Standard Architecture
  • PCI Peripheral Component Interconnect
  • EISA Extended Industry Standard Architecture
  • the bus can be divided into address bus, data bus, control bus, etc. For ease of presentation, only one thick line is used in Figure 9, but it does not mean that there is only one bus or one type of bus.
  • the memory 910, the processor 920 and the communication interface 930 are integrated on one chip, the memory 910, the processor 920 and the communication interface 930 can communicate with each other through the internal interface.
  • Embodiments of the present disclosure provide a computer-readable storage medium, which stores a computer program. When the program is executed by a processor, the method provided in the embodiment of the present disclosure is implemented.
  • An embodiment of the present disclosure also provides a chip, which includes a processor for calling and running instructions stored in the memory, so that the communication device installed with the chip executes the method provided by the embodiment of the present disclosure.
  • Embodiments of the present disclosure also provide a chip, including: an input interface, an output interface, a processor, and a memory.
  • the input interface, the output interface, the processor, and the memory are connected through an internal connection path.
  • the processor is used to execute the code in the memory. , when the code is executed, the processor is used to execute the method provided by the application embodiment.
  • processor can be a central processing unit (Central Processing Unit, CPU), or other general-purpose processor, digital signal processor (Digital Signal Processing, DSP), application specific integrated circuit (Application Specific Integrated Circuit, ASIC), Field Programmable Gate Array (FPGA) or other programmable logic devices, discrete gate or transistor logic devices, discrete hardware components, etc.
  • CPU Central Processing Unit
  • DSP Digital Signal Processing
  • ASIC Application Specific Integrated Circuit
  • FPGA Field Programmable Gate Array
  • a general-purpose processor can be a microprocessor or any conventional processor, etc. It is worth noting that the processor may be a processor that supports Advanced RISC Machines (ARM) architecture.
  • ARM Advanced RISC Machines
  • the above-mentioned memory may include read-only memory and random access memory, and may also include non-volatile random access memory.
  • the memory may be volatile memory or non-volatile memory, or may include both volatile and non-volatile memory.
  • non-volatile memory can include read-only memory (Read-Only Memory, ROM), programmable read-only memory (Programmable ROM, PROM), erasable programmable read-only memory (Erasable PROM, EPROM), electrically removable memory. Erase programmable read-only memory (Electrically EPROM, EEPROM) or flash memory.
  • Volatile memory may include Random Access Memory (RAM), which acts as an external cache.
  • RAM static random access memory
  • DRAM dynamic random access memory
  • DRAM synchronous dynamic random access memory
  • SDRAM double data rate synchronous dynamic random access Memory
  • Double Data Date SDRAM, DDR SDRAM enhanced synchronous dynamic random access memory
  • Enhanced SDRAM, ESDRAM synchronous link dynamic random access memory
  • Direct Rambus RAM Direct Rambus RAM
  • a computer program product includes one or more computer instructions.
  • Computer program instructions When computer program instructions are loaded and executed on a computer, processes or functions in accordance with the present disclosure are produced, in whole or in part.
  • the computer may be a general purpose computer, a special purpose computer, a computer network, or other programmable device.
  • Computer instructions may be stored in or transmitted from one computer-readable storage medium to another computer-readable storage medium.
  • references to the terms “one embodiment,” “some embodiments,” “an example,” “specific examples,” or “some examples” or the like means that specific features are described in connection with the embodiment or example.
  • structures, materials, or features are included in at least one embodiment or example of the present disclosure.
  • the specific features, structures, materials or characteristics described may be combined in any suitable manner in any one or more embodiments or examples.
  • those skilled in the art may combine and combine different embodiments or examples and features of different embodiments or examples described in this specification unless they are inconsistent with each other.
  • first and second are used for descriptive purposes only and cannot be understood as indicating or implying relative importance or implicitly indicating the quantity of indicated technical features. Thus, features defined as “first” and “second” may explicitly or implicitly include at least one of these features.
  • “plurality” means two or more than two, unless otherwise expressly and specifically limited.
  • the logic and/or steps represented in the flowcharts or otherwise described herein, for example, may be considered a sequenced list of executable instructions for implementing the logical functions, and may be embodied in any computer-readable medium,
  • instruction execution systems, devices or equipment such as computer-based systems, systems including processors or other systems that can fetch instructions from and execute instructions from the instruction execution system, device or equipment), or in combination with these instruction execution systems, devices or equipment.
  • various parts of the present disclosure may be implemented in hardware, software, firmware, or combinations thereof.
  • various steps or methods may be implemented in software or firmware stored in a memory and executed by a suitable instruction execution system. All or part of the steps of the method in the above embodiment can be completed by instructing relevant hardware through a program.
  • the program can be stored in a computer-readable storage medium. When executed, the program includes one of the steps of the method embodiment or other steps. combination.
  • each functional unit in various embodiments of the present disclosure may be integrated into one processing module, each unit may exist physically alone, or two or more units may be integrated into one module.
  • the above integrated modules can be implemented in the form of hardware or software function modules. If the above integrated modules are implemented in the form of software function modules and sold or used as independent products, they can also be stored in a computer-readable storage medium.
  • the storage medium can be a read-only memory, a magnetic disk or an optical disk, etc.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Library & Information Science (AREA)
  • Mathematical Physics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The present disclosure provides an image retrieval method and apparatus for achieving intelligent automation (IA) by combining robotic process automation (RPA) and artificial intelligence (AI), and an electronic device. The method comprises: obtaining an initial image on the basis of RPA technology, wherein the initial image has image description information; intercepting an initial area image from the initial image on the basis of AI technology; processing the initial area image according to the image description information to obtain a target area image; and retrieving target content according to the target area image. According to the present disclosure, IA of image retrieval can be realized by combining RPA and AI, the image can be preprocessed in time before image retrieval, interference information in the image is removed, the pertinence of the obtained target area image in the retrieval process is effectively improved, and the influence of interference information on the retrieval process is effectively reduced, thereby effectively improving the image retrieval efficiency and the accuracy of the image retrieval result.

Description

结合RPA和AI实现IA的图像检索方法、装置及电子设备Image retrieval methods, devices and electronic equipment that combine RPA and AI to realize IA
相关申请的交叉引用Cross-references to related applications
本申请基于申请号为202210600925.4、申请日为2022年05月30日的中国专利申请提出,并要求该中国专利申请的优先权,该中国专利申请的全部内容在此引入本申请作为参考。This application is filed based on a Chinese patent application with application number 202210600925.4 and a filing date of May 30, 2022, and claims the priority of the Chinese patent application. The entire content of the Chinese patent application is hereby incorporated by reference into this application.
技术领域Technical field
本公开涉及计算机技术领域,尤其涉及一种结合RPA(Robotic Process Automation,机器人流程自动化)和AI(Artificial Intelligence,人工智能)实现IA(Intelligent Automation,智能自动化)的图像检索方法、装置及电子设备。The present disclosure relates to the field of computer technology, and in particular to an image retrieval method, device and electronic equipment that combines RPA (Robotic Process Automation, Robotic Process Automation) and AI (Artificial Intelligence, Artificial Intelligence) to realize IA (Intelligent Automation, Intelligent Automation).
背景技术Background technique
机器人流程自动化(Robotic Process Automation)简称RPA,是通过特定的“机器人软件”,模拟人在计算机上的操作,按规则自动执行流程任务。Robotic Process Automation, referred to as RPA, uses specific "robot software" to simulate human operations on a computer and automatically execute process tasks according to rules.
人工智能(Artificial Intelligence,AI)是研究、开发用于模拟、延伸和扩展人的智能的理论、方法、技术及应用系统的一门技术科学。Artificial Intelligence (AI) is a technical science that studies and develops theories, methods, technologies and application systems for simulating, extending and expanding human intelligence.
智能自动化(Intelligent Automation,IA)是一系列从机器人流程自动化到人工智能的技术总称,将RPA与光学字符识别(Optical Character Recognition,OCR)、智能字符识别(Intelligent Character Recognition,ICR)、流程挖掘(Process Mining)、深度学习(Deep Learning,DL)、机器学习(Machine Learning,ML)、自然语言处理(Natural Language Processing,NLP)、语音识别(Automatic Speech Recognition,ASR)、语音合成(Text To Speech,TTS)、计算机视觉(Computer Vision,CV)等多种AI技术相结合,以创建能够思考、学习及自适应的端到端的业务流程,涵盖从流程发现、流程自动化,到通过自动而持续的数据收集、理解数据的含义,使用数据来管理和优化业务流程的整个历程。Intelligent Automation (IA) is a general term for a series of technologies from robotic process automation to artificial intelligence. It combines RPA with Optical Character Recognition (OCR), Intelligent Character Recognition (ICR), process mining ( Process Mining), Deep Learning (Deep Learning, DL), Machine Learning (ML), Natural Language Processing (NLP), Speech Recognition (Automatic Speech Recognition, ASR), Speech Synthesis (Text To Speech, TTS), Computer Vision (CV) and other AI technologies are combined to create end-to-end business processes that can think, learn and adapt, covering process discovery, process automation, and automatic and continuous data collection Collect and understand the meaning of data, and use data to manage and optimize the entire process of business processes.
相关技术中,当原始图像中主体部分占比较小时,采用人工标注对图像向量算法进行训练,或者,采用图像元信息的混合检索模式,以进行图像检索,导致图像检索的处理方式较为复杂,需要耗费较高的人工成本,且无法有效地保障图像检索效果。In related technologies, when the main part of the original image is relatively small, manual annotation is used to train the image vector algorithm, or a hybrid retrieval mode of image meta-information is used to perform image retrieval, resulting in a more complex image retrieval processing method that requires It consumes high labor costs and cannot effectively guarantee the image retrieval effect.
发明内容Contents of the invention
本公开旨在至少在一定程度上解决相关技术中的技术问题之一。The present disclosure aims to solve one of the technical problems in the related art, at least to a certain extent.
为此,本公开的目的在于提出一种结合RPA和AI实现IA的图像检索方法、装置、电子设备及存储介质,能够利用RPA结合人工智能AI实现图像检索的智能自动化IA,能够在图像检索之前及时地对图像进行预处理,以去除图像中的干扰信息,有效提升所得目标区域图像在检索过程中的针对性,有效降低干扰信息对检索过程的影响,从而有效提升图像检索效率和图像检索结果的准确性。To this end, the purpose of this disclosure is to propose an image retrieval method, device, electronic device and storage medium that combines RPA and AI to realize IA, which can use RPA combined with artificial intelligence AI to realize intelligent automated IA for image retrieval, and can achieve image retrieval before image retrieval. Preprocess the image in a timely manner to remove interference information in the image, effectively improve the pertinence of the obtained target area image in the retrieval process, effectively reduce the impact of interference information on the retrieval process, thereby effectively improving image retrieval efficiency and image retrieval results. accuracy.
本公开第一方面实施例提出的结合RPA和AI实现IA的图像检索方法,包括:基于机器人流程自动化RPA技术获取初始图像,其中,初始图像具有图像描述信息;基于人工智能AI技术从初始图像中截取初始区域图像;根据图像描述信息处理初始区域图像,得到目标区域图像;根据目标区域图像,检索目标内容。The image retrieval method that combines RPA and AI to implement IA proposed by the embodiment of the first aspect of the present disclosure includes: obtaining an initial image based on robotic process automation RPA technology, where the initial image has image description information; and obtaining an initial image from the initial image based on artificial intelligence AI technology. Intercept the initial area image; process the initial area image according to the image description information to obtain the target area image; retrieve the target content based on the target area image.
在一种实施方式中,基于人工智能AI技术从初始图像中截取初始区域图像,包括:调用自然语言处理NLP服务,以识别初始图像中的主体信息;根据主体信息,确定主体对应于初始图像中的位置描述信息;从初始图像中截取与位置描述信息对应的区域图像作为初始区域图像。In one implementation, intercepting the initial area image from the initial image based on artificial intelligence AI technology includes: calling a natural language processing NLP service to identify the subject information in the initial image; and determining, based on the subject information, that the subject corresponds to the subject in the initial image. The location description information; intercept the area image corresponding to the location description information from the initial image as the initial area image.
在一种实施方式中,在基于机器人流程自动化RPA技术获取初始图像之后,所述方法还包括:确定初始图像的图像尺度信息;和/或确定初始图像的像素特征信息;和/或确定针对初始图像指定的处理参数信息;将图像尺度信息、和/或像素特征信息、和/或处理参数信 息作为图像描述信息。In one embodiment, after acquiring the initial image based on Robotic Process Automation (RPA) technology, the method further includes: determining the image scale information of the initial image; and/or determining the pixel feature information of the initial image; and/or determining the initial image for the initial image. Processing parameter information specified by the image; use image scale information, and/or pixel feature information, and/or processing parameter information as image description information.
在一种实施方式中,图像描述信息包括:图像尺度信息;其中,根据图像描述信息处理初始区域图像,得到目标区域图像,包括:根据图像尺度信息对初始区域图像进行扩大处理;将扩大处理后的图像作为目标区域图像。In one embodiment, the image description information includes: image scale information; wherein, processing the initial area image according to the image description information to obtain the target area image includes: enlarging the initial area image according to the image scale information; The image is used as the target area image.
在一种实施方式中,图像描述信息包括:像素特征信息;其中,根据图像描述信息处理初始区域图像,得到目标区域图像,包括:获取初始图像区域中的区域像素;从像素特征信息中解析区域像素的区域像素特征;对初始图像区域中各个区域像素的区域像素特征进行增强处理,得到目标区域图像。In one embodiment, the image description information includes: pixel feature information; wherein, processing the initial area image according to the image description information to obtain the target area image includes: obtaining the area pixels in the initial image area; parsing the area from the pixel feature information Regional pixel features of pixels; enhance the regional pixel features of each regional pixel in the initial image area to obtain the target area image.
在一种实施方式中,图像描述信息包括:图像尺度信息和像素特征信息;其中,根据图像描述信息处理初始区域图像,得到目标区域图像,包括:根据图像尺度信息对初始区域图像进行扩大处理,得到待填充区域图像,其中,待填充区域图像包括:待填充像素;从像素特征信息中解析待填充像素的第一像素特征;从像素特征信息中解析其他区域图像中区域像素的第二像素特征,其中,初始区域图像和其他区域图像共同构成初始图像;根据第一像素特征和第二像素特征,生成填充像素特征;根据填充像素特征对待填充区域图像进行填充处理,得到目标区域图像。In one embodiment, the image description information includes: image scale information and pixel feature information; wherein, processing the initial area image according to the image description information to obtain the target area image includes: enlarging the initial area image according to the image scale information, Obtain an image of the area to be filled, where the image of the area to be filled includes: pixels to be filled; parsing the first pixel features of the pixels to be filled from the pixel feature information; parsing the second pixel features of the regional pixels in other area images from the pixel feature information , wherein the initial area image and other area images together constitute the initial image; a filling pixel feature is generated according to the first pixel feature and the second pixel feature; the to-be-filled area image is filled according to the filling pixel feature to obtain the target area image.
在一种实施方式中,图像描述信息包括:处理参数信息;其中,根据图像描述信息处理初始区域图像,得到目标区域图像,包括:根据处理参数信息处理初始区域图像,得到目标区域图像。In one embodiment, the image description information includes: processing parameter information; wherein processing the initial area image according to the image description information to obtain the target area image includes: processing the initial area image according to the processing parameter information to obtain the target area image.
在一种实施方式中,根据目标区域图像,检索目标内容,包括:确定目标区域图像的语义表征信息;根据语义表征信息,检索目标内容。In one implementation, retrieving target content based on the target area image includes: determining semantic representation information of the target area image; retrieving the target content based on the semantic representation information.
在一种实施方式中,确定目标区域图像的语义表征信息,包括:从目标区域图像识别出目标物体轮廓;根据目标物体轮廓,确定物体轮廓信息;处理物体轮廓信息,得到轮廓向量表征;将轮廓向量表征作为语义表征信息。In one implementation, determining the semantic representation information of the target area image includes: identifying the target object outline from the target area image; determining the object outline information according to the target object outline; processing the object outline information to obtain the outline vector representation; converting the outline Vector representation as semantic representation of information.
在一种实施方式中,根据语义表征信息,检索目标内容,包括:确定与语义表征信息对应的候选相似度层级,其中,候选相似度层级属于预先构建的图状数据结构,候选相似度层级,是其相应所表征内容与初始图像之间的相似程度所属的层级;将图状数据结构中候选相似度层级所表征内容作为目标内容。In one implementation, retrieving the target content according to the semantic representation information includes: determining a candidate similarity level corresponding to the semantic representation information, wherein the candidate similarity level belongs to a pre-constructed graph data structure, the candidate similarity level, It is the level to which the similarity between the corresponding represented content and the initial image belongs; the content represented by the candidate similarity level in the graph data structure is used as the target content.
本公开第二方面实施例提出的结合RPA和AI实现IA的图像检索装置,包括:获取模块,用于基于机器人流程自动化RPA技术获取初始图像,其中,初始图像具有图像描述信息;第一处理模块,用于基于人工智能AI技术从初始图像中截取初始区域图像;第二处理模块,用于根据图像描述信息处理初始区域图像,得到目标区域图像;检索模块,用于根据目标区域图像,检索目标内容。The image retrieval device that combines RPA and AI to implement IA proposed by the embodiment of the second aspect of the present disclosure includes: an acquisition module for acquiring an initial image based on robotic process automation RPA technology, where the initial image has image description information; a first processing module , used to intercept the initial area image from the initial image based on artificial intelligence AI technology; the second processing module is used to process the initial area image according to the image description information to obtain the target area image; the retrieval module is used to retrieve the target according to the target area image content.
在一种实施方式中,第一处理模块,具体用于:调用自然语言处理NLP服务,以识别初始图像中的主体信息;根据主体信息,确定主体对应于初始图像中的位置描述信息;从初始图像中截取与位置描述信息对应的区域图像作为初始区域图像。In one implementation, the first processing module is specifically configured to: call a natural language processing NLP service to identify subject information in the initial image; determine based on the subject information that the subject corresponds to the position description information in the initial image; from the initial The area image corresponding to the position description information is intercepted from the image as the initial area image.
在一种实施方式中,装置还包括:确定模块,用于确定初始图像的图像尺度信息;和/或确定初始图像的像素特征信息;和/或确定针对初始图像指定的处理参数信息;并将图像尺度信息、和/或像素特征信息、和/或处理参数信息作为图像描述信息。In one embodiment, the device further includes: a determining module, configured to determine image scale information of the initial image; and/or determine pixel feature information of the initial image; and/or determine processing parameter information specified for the initial image; and Image scale information, and/or pixel feature information, and/or processing parameter information are used as image description information.
在一种实施方式中,图像描述信息包括:图像尺度信息;其中,第二处理模块,具体用于:根据图像尺度信息对初始区域图像进行扩大处理;将扩大处理后的图像作为目标区域图像。In one embodiment, the image description information includes: image scale information; wherein, the second processing module is specifically configured to: expand the initial region image according to the image scale information; and use the expanded image as the target region image.
在一种实施方式中,图像描述信息包括:像素特征信息;其中,第二处理模块,还用于:获取初始图像区域中的区域像素;从像素特征信息中解析区域像素的区域像素特征;对初始图像区域中各个区域像素的区域像素特征进行增强处理,得到目标区域图像。In one implementation, the image description information includes: pixel feature information; wherein, the second processing module is further configured to: obtain the regional pixels in the initial image region; parse the regional pixel features of the regional pixels from the pixel feature information; The regional pixel features of each area pixel in the initial image area are enhanced to obtain the target area image.
在一种实施方式中,图像描述信息包括:图像尺度信息和像素特征信息;其中,第二处理模块,还用于:根据图像尺度信息对初始区域图像进行扩大处理,得到待填充区域图像, 其中,待填充区域图像包括:待填充像素;从像素特征信息中解析待填充像素的第一像素特征;从像素特征信息中解析其他区域图像中区域像素的第二像素特征,其中,初始区域图像和其他区域图像共同构成初始图像;根据第一像素特征和第二像素特征,生成填充像素特征;根据填充像素特征对待填充区域图像进行填充处理,得到目标区域图像。In one embodiment, the image description information includes: image scale information and pixel feature information; wherein, the second processing module is further configured to: expand the initial region image according to the image scale information to obtain the region image to be filled, where , the region image to be filled includes: pixels to be filled; parsing the first pixel features of the pixels to be filled from the pixel feature information; parsing the second pixel features of the region pixels in other region images from the pixel feature information, where the initial region image and Other area images together constitute the initial image; a filling pixel feature is generated based on the first pixel feature and the second pixel feature; the to-be-filled area image is filled according to the filling pixel feature to obtain the target area image.
在一种实施方式中,图像描述信息包括:处理参数信息;其中,第二处理模块,还用于:根据处理参数信息处理初始区域图像,得到目标区域图像。In one implementation, the image description information includes: processing parameter information; wherein the second processing module is further configured to: process the initial area image according to the processing parameter information to obtain the target area image.
在一种实施方式中,检索模块包括:确定子模块,用于确定目标区域图像的语义表征信息;检索子模块,用于根据语义表征信息,检索目标内容。In one implementation, the retrieval module includes: a determination sub-module, used to determine the semantic representation information of the target area image; a retrieval sub-module, used to retrieve the target content according to the semantic representation information.
在一种实施方式中,确定子模块具体用于:从目标区域图像识别出目标物体轮廓;根据目标物体轮廓,确定物体轮廓信息;处理物体轮廓信息,得到轮廓向量表征;将轮廓向量表征作为语义表征信息。In one implementation, the determination sub-module is specifically used to: identify the target object contour from the target area image; determine the object contour information according to the target object contour; process the object contour information to obtain the contour vector representation; use the contour vector representation as semantic meaning representation information.
在一种实施方式中,检索子模块具体用于:确定与语义表征信息对应的候选相似度层级,其中,候选相似度层级属于预先构建的图状数据结构,候选相似度层级,是其相应所表征内容与初始图像之间的相似程度所属的层级;将图状数据结构中候选相似度层级所表征内容作为目标内容。In one implementation, the retrieval sub-module is specifically used to: determine candidate similarity levels corresponding to semantic representation information, where the candidate similarity levels belong to a pre-constructed graph data structure, and the candidate similarity levels are their corresponding The level that represents the similarity between the content and the initial image; use the content represented by the candidate similarity level in the graph data structure as the target content.
本公开第三方面实施例提出的电子设备,包括:至少一个处理器和存储器;存储器存储计算机执行指令;至少一个处理器执行存储器存储的计算机执行指令,使得至少一个处理器执行本公开第一方面实施例提出的结合RPA和AI实现IA的图像检索方法。The electronic device provided by the embodiment of the third aspect of the present disclosure includes: at least one processor and a memory; the memory stores computer execution instructions; and at least one processor executes the computer execution instructions stored in the memory, so that the at least one processor executes the first aspect of the disclosure. The embodiment proposes an image retrieval method that combines RPA and AI to implement IA.
本公开第四方面实施例提出的计算机可读存储介质,计算机可读存储介质中存储有计算机执行指令,当处理器执行计算机执行指令时,实现本公开第一方面实施例提出的结合RPA和AI实现IA的图像检索方法。The computer-readable storage medium proposed in the embodiment of the fourth aspect of the disclosure has computer-executable instructions stored in the computer-readable storage medium. When the processor executes the computer-executed instructions, the combination of RPA and AI proposed in the embodiment of the first aspect of the disclosure is realized. Implement the image retrieval method of IA.
上述技术方案中的优点或有益效果至少包括:通过基于机器人流程自动化RPA技术获取初始图像,基于人工智能AI技术从初始图像中截取初始区域图像,根据图像描述信息处理初始区域图像,得到目标区域图像,根据目标区域图像,检索目标内容,能够利用RPA结合人工智能AI实现图像检索的智能自动化IA,能够在图像检索之前及时地对图像进行预处理,以去除图像中的干扰信息,有效提升所得目标区域图像在检索过程中的针对性,有效降低干扰信息对检索过程的影响,从而有效提升图像检索效率和图像检索结果的准确性。The advantages or beneficial effects of the above technical solutions at least include: obtaining the initial image based on robotic process automation RPA technology, intercepting the initial area image from the initial image based on artificial intelligence AI technology, processing the initial area image according to the image description information, and obtaining the target area image. , retrieve the target content based on the target area image, and can use RPA combined with artificial intelligence AI to realize intelligent automation IA of image retrieval. It can preprocess the image in time before image retrieval to remove interference information in the image and effectively improve the obtained target. The pertinence of regional images in the retrieval process can effectively reduce the impact of interference information on the retrieval process, thereby effectively improving the image retrieval efficiency and the accuracy of image retrieval results.
上述概述仅仅是为了说明书的目的,并不意图以任何方式进行限制。除上述描述的示意性的方面、实施方式和特征之外,通过参考附图和以下的详细描述,本公开进一步的方面、实施方式和特征将会是容易明白的。The above summary is for illustration purposes only and is not intended to be limiting in any way. In addition to the illustrative aspects, embodiments and features described above, further aspects, embodiments and features of the present disclosure will be readily apparent by reference to the drawings and the following detailed description.
附图说明Description of the drawings
在附图中,除非另外规定,否则贯穿多个附图相同的附图标记表示相同或相似的部件或元素。这些附图不一定是按照比例绘制的。应该理解,这些附图仅描绘了根据本公开的一些实施方式,而不应将其视为是对本公开范围的限制。In the drawings, unless otherwise specified, the same reference numbers refer to the same or similar parts or elements throughout the several figures. The drawings are not necessarily to scale. It should be understood that these drawings depict only some embodiments in accordance with the disclosure and are not to be considered limiting of the scope of the disclosure.
图1是本公开一实施例提出的结合RPA和AI实现IA的图像检索方法的流程示意图;Figure 1 is a schematic flowchart of an image retrieval method that combines RPA and AI to implement IA proposed by an embodiment of the present disclosure;
图2是本公开另一实施例提出的结合RPA和AI实现IA的图像检索方法的流程示意图;Figure 2 is a schematic flowchart of an image retrieval method for implementing IA by combining RPA and AI proposed by another embodiment of the present disclosure;
图3是本公开另一实施例提出的结合RPA和AI实现IA的图像检索方法的流程示意图;Figure 3 is a schematic flowchart of an image retrieval method for implementing IA by combining RPA and AI proposed by another embodiment of the present disclosure;
图4是本公开另一实施例提出的结合RPA和AI实现IA的图像检索方法的流程示意图;Figure 4 is a schematic flow chart of an image retrieval method for implementing IA by combining RPA and AI proposed by another embodiment of the present disclosure;
图5是本公开另一实施例提出的结合RPA和AI实现IA的图像检索方法的流程示意图;Figure 5 is a schematic flowchart of an image retrieval method for implementing IA by combining RPA and AI proposed by another embodiment of the present disclosure;
图6是本公开另一实施例提出的结合RPA和AI实现IA的图像检索方法的流程示意图;Figure 6 is a schematic flowchart of an image retrieval method for implementing IA by combining RPA and AI proposed by another embodiment of the present disclosure;
图7是本公开一实施例提出的结合RPA和AI实现IA的图像检索装置的结构示意图;Figure 7 is a schematic structural diagram of an image retrieval device that combines RPA and AI to implement IA proposed by an embodiment of the present disclosure;
图8是本公开另一实施例提出的结合RPA和AI实现IA的图像检索装置的结构示意图;Figure 8 is a schematic structural diagram of an image retrieval device that combines RPA and AI to implement IA proposed by another embodiment of the present disclosure;
图9示出根据本公开一实施例的电子设备的结构框图。FIG. 9 shows a structural block diagram of an electronic device according to an embodiment of the present disclosure.
具体实施方式Detailed ways
下面详细描述本公开的实施例,所述实施例的示例在附图中示出,其中自始至终相同或类似的标号表示相同或类似的元件或具有相同或类似功能的元件。下面通过参考附图描述的实施例是示例性的,仅用于解释本公开,而不能理解为对本公开的限制。Embodiments of the present disclosure are described in detail below, examples of which are illustrated in the accompanying drawings, wherein the same or similar reference numerals throughout represent the same or similar elements or elements having the same or similar functions. The embodiments described below with reference to the drawings are exemplary and are only used to explain the present disclosure and are not to be construed as limitations of the present disclosure.
在本公开的描述中,术语“多个”指两个或两个以上。In the description of the present disclosure, the term "plurality" means two or more.
在本公开的描述中,术语“机器人流程自动化(Robotic Process Automation,RPA)”,是指通过机器人应用软件在计算机上按照规则自动执行流程任务。In the description of this disclosure, the term "Robotic Process Automation (RPA)" refers to the automatic execution of process tasks according to rules on a computer through robot application software.
在本公开的描述中,术语“人工智能(Artificial Intelligence,AI)”,是指研究使计算机来模拟人的某些思维过程和智能行为(如学习、推理、思考、规划等)的学科,既有硬件层面的技术也有软件层面的技术。人工智能硬件技术一般包括如传感器、专用人工智能芯片、云计算、分布式存储、大数据处理等技术;人工智能软件技术主要包括计算机视觉技术、语音识别技术、自然语言处理技术,以及机器学习、深度学习、大数据处理技术、知识图谱技术等几大方向。In the description of this disclosure, the term "Artificial Intelligence (AI)" refers to the study of using computers to simulate certain human thinking processes and intelligent behaviors (such as learning, reasoning, thinking, planning, etc.), both There are hardware-level technologies and software-level technologies. Artificial intelligence hardware technology generally includes technologies such as sensors, dedicated artificial intelligence chips, cloud computing, distributed storage, and big data processing; artificial intelligence software technology mainly includes computer vision technology, speech recognition technology, natural language processing technology, and machine learning, Deep learning, big data processing technology, knowledge graph technology and other major directions.
在本公开的描述中,术语“智能自动化(Intelligent Automation,IA)”,是指一系列从机器人流程自动化到人工智能的技术总称,将RPA与光学字符识别(Optical Character Recognition,OCR)、智能字符识别(Intelligent Character Recognition,ICR)、流程挖掘(Process Mining)、深度学习(Deep Learning,DL)、机器学习(Machine Learning,ML)、自然语言处理(Natural Language Processing,NLP)、语音识别(Automatic Speech Recognition,ASR)、语音合成(Text To Speech,TTS)、计算机视觉(Computer Vision,CV)等多种AI技术相结合,以创建能够思考、学习及自适应的端到端的业务流程,涵盖从流程发现、流程自动化,到通过自动而持续的数据收集、理解数据的含义,使用数据来管理和优化业务流程的整个历程。In the description of this disclosure, the term "Intelligent Automation (IA)" refers to a series of technologies from robotic process automation to artificial intelligence, combining RPA with Optical Character Recognition (OCR), intelligent character Recognition (Intelligent Character Recognition, ICR), Process Mining (Process Mining), Deep Learning (DL), Machine Learning (ML), Natural Language Processing (Natural Language Processing, NLP), Speech Recognition (Automatic Speech) Recognition, ASR), speech synthesis (Text To Speech, TTS), computer vision (CV) and other AI technologies are combined to create an end-to-end business process that can think, learn and adapt, covering everything from processes to The entire process from discovery, process automation, to using data to manage and optimize business processes through automatic and continuous data collection, understanding the meaning of data, and using data.
在本公开的描述中,术语“初始图像”,是指待对其进行检索处理的图像,该初始图像例如可以为交通监控装置拍摄的汽车图像,或者可以为任意种类的包含检索对象的图像,对此不作限制。In the description of this disclosure, the term "initial image" refers to an image to be retrieved. The initial image can be, for example, a car image captured by a traffic monitoring device, or it can be any kind of image containing a retrieval object. There are no restrictions on this.
在本公开的描述中,术语“图像描述信息”,是指对初始图像的一个或多个相关特征进行描述的数据信息,该图像描述信息,可以用于对图像进行多维度的特征描述,而特征例如,尺度特征、像素特征等,该图像描述信息,可以具体例如为初始图像的图像尺度信息、和/或像素特征信息、和/或处理参数信息等,对此不做限制。In the description of this disclosure, the term "image description information" refers to data information that describes one or more relevant features of the initial image. The image description information can be used to describe the multi-dimensional features of the image, and Features include, for example, scale features, pixel features, etc. The image description information may specifically include, for example, image scale information of the initial image, and/or pixel feature information, and/or processing parameter information, etc., without limitation.
在本公开的描述中,术语“初始区域图像”,是指采用基于人工智能AI技术从初始图像中所截取得到的部分图像,该初始区域图像中可以包括目标区域图像。In the description of this disclosure, the term "initial area image" refers to a partial image intercepted from the initial image using artificial intelligence-based AI technology. The initial area image may include a target area image.
在本公开的描述中,术语“目标区域图像”,是指以图像描述信息作为参考依据,对初始区域图像进行处理,所得到的图像。In the description of this disclosure, the term "target area image" refers to an image obtained by processing the initial area image using the image description information as a reference basis.
在本公开的描述中,术语“目标内容”,是指在该图像检索过程中,以目标区域图像作为检索参考依据,对其进行检索得到的内容,该目标内容可以例如为检索得到的图片、对检索到图片进行描述的文本、音视频等,对此不做限制。In the description of the present disclosure, the term "target content" refers to the content obtained by retrieving the target area image as the retrieval reference basis during the image retrieval process. The target content can be, for example, a retrieved picture, There are no restrictions on the text, audio and video, etc. that describe the retrieved images.
在本公开的描述中,术语“主体”,是指初始图像中所包含的主要描述对象,例如人物肖像图中的人物、汽车展示图中的汽车等,对此不做限制。In the description of the present disclosure, the term "subject" refers to the main description object contained in the initial image, such as a person in a portrait picture, a car in a car display picture, etc., without limitation.
在本公开的描述中,术语“主体信息”,是指初始图像中所包含的检索对象的相关信息,例如检索对象在初始图像中的位置信息、面积信息等,对此不做限制。In the description of this disclosure, the term "subject information" refers to information related to the retrieval object contained in the initial image, such as the position information and area information of the retrieval object in the initial image, and is not limited to this.
在本公开的描述中,术语“位置描述信息”,可以被用于描述主体在初始图像中所处位置的相关信息,例如主体在初始图像中的分布情况以及所占比例等,对此不做限制。In the description of this disclosure, the term "position description information" can be used to describe information related to the location of the subject in the initial image, such as the distribution and proportion of the subject in the initial image. This is not done limit.
在本公开的描述中,术语“图像尺度信息”,可以是指被用于描述初始图像尺度的相关信息,例如初始图像的尺寸、面积等,对此不做限制。In the description of the present disclosure, the term "image scale information" may refer to relevant information used to describe the scale of the initial image, such as the size, area, etc. of the initial image, without limitation.
在本公开的描述中,术语“像素特征信息”,可以被用于描述初始图像中所包含像素的特征信息,例如像素的数量、颜色等,对此不做限制。In the description of the present disclosure, the term "pixel feature information" can be used to describe the feature information of pixels contained in the initial image, such as the number of pixels, color, etc., without limitation.
在本公开的描述中,术语“处理参数信息”,可以是指预先针对初始图像所指定的扩充 倍数、填充颜色、亮度、色调、饱和度以及锐化程度等,对此不做限制。In the description of the present disclosure, the term "processing parameter information" may refer to the expansion factor, fill color, brightness, hue, saturation, sharpening degree, etc. specified in advance for the initial image, without limitation.
在本公开的描述中,术语“区域像素”,是指初始图像区域中一个或多个图像区域对应的像素。In the description of the present disclosure, the term "area pixel" refers to a pixel corresponding to one or more image areas in the initial image area.
在本公开的描述中,术语“区域像素特征”,是指基于像素特征信息所获取的区域像素的相关特征,例如区域像素的数量、颜色等。In the description of this disclosure, the term "regional pixel features" refers to the relevant features of regional pixels obtained based on pixel feature information, such as the number, color, etc. of regional pixels.
在本公开的描述中,术语“待填充区域图像”,是指基于图像尺度信息对初始区域图像进行扩大处理所得到的图像。In the description of the present disclosure, the term "region image to be filled" refers to an image obtained by enlarging the initial region image based on image scale information.
在本公开的描述中,术语“待填充像素”,是指待填充区域图像中需要进行填充的像素。In the description of the present disclosure, the term "pixels to be filled" refers to pixels in the image of the area to be filled that need to be filled.
在本公开的描述中,术语“第一像素特征”,是指基于像素特征信息所获取的待填充像素的相关特征。In the description of the present disclosure, the term "first pixel feature" refers to the relevant feature of the pixel to be filled that is obtained based on the pixel feature information.
在本公开的描述中,术语“第二像素特征”,是指基于像素特征信息所获取的其他区域图像中区域像素的相关特征。In the description of the present disclosure, the term "second pixel feature" refers to the relevant features of regional pixels in other regional images obtained based on pixel feature information.
在本公开的描述中,术语“填充像素特征”,是指基于第一像素特征和第二像素特征所获取的像素特征,该填充像素特征,可以被用于作为待填充区域图像进行填充处理的参考依据。In the description of this disclosure, the term "filling pixel feature" refers to the pixel feature obtained based on the first pixel feature and the second pixel feature. The filling pixel feature can be used to perform filling processing as an image of the area to be filled. Reference.
在本公开的描述中,术语“语义表征信息”,可以被用于表征目标区域图像相关特征的信息。例如目标区域图像可以具备颜色特征,轮廓特征,线性特征、中心特征、对角性特征、纹理特征、局部特征以及形状特征等图像特征,对此不做限制。而语义表征信息,可以是指表征上述一个或多个图像特征的信息。In the description of the present disclosure, the term "semantic representation information" may be used for information that characterizes image-related features of a target area. For example, the target area image can have image features such as color features, contour features, linear features, center features, diagonal features, texture features, local features, and shape features, and there are no restrictions on this. The semantic representation information may refer to information that characterizes one or more of the above image features.
在本公开的描述中,术语“目标物体轮廓”,是指目标区域图像中所包含的检索目标物体的轮廓,例如人物图像中的人体轮廓、汽车图像中的汽车轮廓灯,对此不做限制。In the description of this disclosure, the term "target object outline" refers to the outline of the retrieval target object contained in the target area image, such as the outline of the human body in the person image, the outline of the car light in the car image, and is not limited to this .
在本公开的描述中,术语“物体轮廓信息”,是指基于目标物体轮廓所获取的相关信息。In the description of the present disclosure, the term "object contour information" refers to relevant information obtained based on the contour of the target object.
在本公开的描述中,术语“轮廓向量表征”,是指可以表征物体轮廓信息,且映射于向量空间中的向量表征,该向量表征,可以是采用将特征映射于向量空间的方式得到的特征,例如轮廓特征。In the description of the present disclosure, the term "contour vector representation" refers to a vector representation that can represent the contour information of an object and is mapped in a vector space. The vector representation can be a feature obtained by mapping features to a vector space. , such as contour features.
在本公开的描述中,术语“候选相似度层级”,是指语义表征信息所表征内容与初始图像之间的相似程度在图状数据结构中所属的层级。In the description of the present disclosure, the term "candidate similarity level" refers to the level in the graph data structure to which the degree of similarity between the content represented by the semantic representation information and the initial image belongs.
在本公开的描述中,术语“图状数据结构”,是指预先在向量检索库中以向量距离作为划分依据而建立的数据结构,图状数据结构,可以被用于在图像检索过程中以向量距离作为参考依据寻找候选相似度层级,以缩小检索范围。In the description of the present disclosure, the term "graph data structure" refers to a data structure established in advance in the vector retrieval library using vector distance as the basis for division. The graph data structure can be used to perform image retrieval in the image retrieval process. Vector distance is used as a reference to find candidate similarity levels to narrow the search scope.
智能自动化平台能够实现RPA、智能文档处理(Intelligent Document Processing,IDP)、对话式AI(Conversational AI,CoAI)、流程挖掘(Process Mining)等多项能力的无缝集成,具有“业务理解”、“流程创建”、“随处运行”、“集中管控”、“人机协同”这五大类功能,为企业实现业务流程端到端的智能自动化,代替人工的操作,进一步提高业务效率,加速数字化转型。The intelligent automation platform can realize the seamless integration of RPA, Intelligent Document Processing (IDP), Conversational AI (CoAI), Process Mining and other capabilities, and has the capabilities of "business understanding", " The five major categories of functions, "Process Creation", "Run Anywhere", "Centralized Management and Control", and "Human-Machine Collaboration", enable enterprises to realize end-to-end intelligent automation of business processes, replace manual operations, further improve business efficiency, and accelerate digital transformation.
智能文档处理(IDP)是智能自动化平台的核心能力之一。智能文档处理(Intelligent Document Processing,IDP)是基于光学字符识别(Optical Character Recognition,OCR)、计算机视觉(Computer Vision,CV)、自然语言处理(Natural Language Processing,NLP)、知识图谱(Knowledge Graph,KG)等AI技术,对各类文档进行识别、分类、要素提取、校验、比对、纠错等处理,帮助企业实现文档处理工作的智能化和自动化的新一代自动化技术。Intelligent document processing (IDP) is one of the core capabilities of the intelligent automation platform. Intelligent Document Processing (IDP) is based on Optical Character Recognition (OCR), Computer Vision (CV), Natural Language Processing (NLP), Knowledge Graph (KG) ) and other AI technologies, it can identify, classify, extract elements, verify, compare, and correct errors of various types of documents, and is a new generation of automation technology that helps enterprises realize the intelligence and automation of document processing.
参照下面的描述和附图,将清楚本公开的实施例的这些和其他方面。在这些描述和附图中,具体公开了本公开的实施例中的一些特定实施方式,来表示实施本公开的实施例的原理的一些方式,但是应当理解,本公开的实施例的范围不受此限制。相反,本公开的实施例包括落入所附加权利要求书的精神和内涵范围内的所有变化、修改和等同物。These and other aspects of embodiments of the present disclosure will become apparent with reference to the following description and accompanying drawings. In these descriptions and drawings, some specific implementations of the embodiments of the disclosure are specifically disclosed to represent some of the ways of implementing the principles of the embodiments of the disclosure, but it should be understood that the scope of the embodiments of the disclosure is not limited by this restriction. On the contrary, the disclosed embodiments include all changes, modifications and equivalents falling within the spirit and scope of the appended claims.
以下结合附图描述根据本公开实施例的结合RPA和AI实现IA的图像检索方法。An image retrieval method that combines RPA and AI to implement IA according to an embodiment of the present disclosure is described below with reference to the accompanying drawings.
图1是本公开一实施例提出的结合RPA和AI实现IA的图像检索方法的流程示意图。Figure 1 is a schematic flowchart of an image retrieval method for implementing IA by combining RPA and AI proposed by an embodiment of the present disclosure.
本实施例以结合RPA和AI实现IA的图像检索方法被配置为结合RPA和AI实现IA的图像检索装置中来举例说明,本实施例中结合RPA和AI实现IA的图像检索方法可以被配置在结合RPA和AI实现IA的图像检索装置中,结合RPA和AI实现IA的图像检索装置可以设置在服务器中,或者也可以设置在电子设备中,本公开实施例对此不作限制。In this embodiment, an image retrieval method that combines RPA and AI to implement IA is configured as an image retrieval device that combines RPA and AI to implement IA. In this embodiment, the image retrieval method that combines RPA and AI to implement IA can be configured in Among the image retrieval devices that combine RPA and AI to implement IA, the image retrieval device that combines RPA and AI to implement IA can be installed in a server or in an electronic device. This is not limited in the embodiments of the present disclosure.
本实施例以结合RPA和AI实现IA的图像检索方法被配置在电子设备中为例。其中,电子设备例如智能手机、平板电脑、个人数字助理、电子书等具有各种操作系统的硬件设备。In this embodiment, an image retrieval method that combines RPA and AI to implement IA is configured in an electronic device as an example. Among them, electronic devices such as smartphones, tablets, personal digital assistants, e-books and other hardware devices with various operating systems.
需要说明的是,本公开实施例的执行主体,在硬件上可以例如为服务器或者电子设备中的中央处理器(Central Processing Unit,CPU),在软件上可以例如为服务器或者电子设备中的相关的后台服务,对此不作限制。It should be noted that the execution subject of the embodiments of the present disclosure may be, in terms of hardware, a central processing unit (CPU) in a server or an electronic device, and in terms of software, it may be, for example, a server or a related processor in an electronic device. Background service, there is no restriction on this.
另外,本公开实施例中的“检索”,是指结合机器人流程自动化RPA和人工智能AI实现智能自动化IA的图像检索的过程,也即是说,该图像检索的过程是一个全流程自动化的图像检索的过程,并且该图像检索的过程还与人工智能AI相结合,实现自动化地进行自然语言处理(Natural Language Processing,NLP)领域中的图像检索。In addition, the "retrieval" in the embodiment of the present disclosure refers to the image retrieval process of realizing intelligent automation IA by combining robotic process automation RPA and artificial intelligence AI. In other words, the image retrieval process is a fully automated image retrieval process. The retrieval process, and the image retrieval process is also combined with artificial intelligence AI to realize automated image retrieval in the field of Natural Language Processing (NLP).
本公开可以具体应用于人工智能AI的自然语言处理(Natural Language Processing,NLP)领域,自然语言处理(Natural Language Processing,NLP),即计算机科学,人工智能,语言学关注计算机和人类(自然)语言之间的相互作用的领域。The present disclosure can be specifically applied to the field of natural language processing (NLP) of artificial intelligence AI. Natural language processing (NLP), that is, computer science, artificial intelligence, and linguistics focus on computer and human (natural) language. areas of interaction.
举例而言,本公开实施例中基于该全流程自动化的图像检索过程,可以实现全流程自动化地执行基于机器人流程自动化RPA技术获取初始图像,基于人工智能AI技术从初始图像中截取初始区域图像,根据图像描述信息处理初始区域图像,得到目标区域图像,根据目标区域图像,检索目标内容。For example, in the embodiment of the present disclosure, based on the full-process automated image retrieval process, the full-process automation can be implemented to obtain the initial image based on the robotic process automation RPA technology, and the initial area image can be intercepted from the initial image based on the artificial intelligence AI technology. The initial area image is processed according to the image description information to obtain the target area image, and the target content is retrieved based on the target area image.
如图1所示,该结合RPA和AI实现IA的图像检索方法,包括:As shown in Figure 1, the image retrieval method that combines RPA and AI to implement IA includes:
S101:基于机器人流程自动化RPA技术获取初始图像,其中,初始图像具有图像描述信息。S101: Obtain an initial image based on Robotic Process Automation (RPA) technology, where the initial image has image description information.
其中,机器人流程自动化(Robotic Process Automation,RPA),是指通过机器人应用软件在计算机上按照规则自动执行流程任务。Among them, Robotic Process Automation (RPA) refers to the automatic execution of process tasks according to rules on the computer through robot application software.
其中,初始图像,是指待对其进行检索处理的图像,该初始图像例如可以为交通监控装置拍摄的汽车图像,或者可以为任意种类的包含检索对象的图像,对此不作限制。The initial image refers to an image to be retrieved. The initial image may be, for example, a car image captured by a traffic monitoring device, or may be any type of image containing a retrieval object, and is not limited to this.
也即是说,本公开实施例中的一种应用场景可以具体例如为,采用机器人流程自动化RPA获取交通监控装置所拍摄的汽车图像,将该汽车图像作为初始图像,而后可以对获取的初始图像进行结合RPA和AI实现IA的图像检索,以确定该初始图像中的汽车信息,或者,本公开实施例描述的结合RPA和AI实现IA的图像检索方法,也可以应用于其他任意可能的图像检索场景中,对此不做限制。That is to say, an application scenario in the embodiment of the present disclosure may be, for example, using Robotic Process Automation (RPA) to obtain the car image captured by the traffic monitoring device, using the car image as the initial image, and then the acquired initial image can be Perform image retrieval that combines RPA and AI to implement IA to determine the car information in the initial image. Alternatively, the image retrieval method that combines RPA and AI to implement IA described in the embodiments of the present disclosure can also be applied to any other possible image retrieval. In the scene, there is no restriction on this.
其中,图像描述信息,是指对初始图像的一个或多个相关特征进行描述的数据信息,该图像描述信息,可以用于对图像进行多维度的特征描述,而特征例如,尺度特征、像素特征等,该图像描述信息,可以具体例如为初始图像的图像尺度信息、和/或像素特征信息、和/或处理参数信息等,对此不做限制。Among them, the image description information refers to the data information that describes one or more relevant features of the initial image. The image description information can be used to describe the multi-dimensional features of the image, and the features are, for example, scale features, pixel features etc., the image description information may specifically be, for example, image scale information of the initial image, and/or pixel feature information, and/or processing parameter information, etc., without limitation.
本公开实施例中,在基于机器人流程自动化RPA技术获取初始图像时,可以预先配置应用程序数据接口,由RPA机器人按照预先设定好的软件操作流程,经由该应用程序数据接口接收用户与机器人进行交互的图像信息,并将获取的用户与机器人进行交互的图像信息作为初始图像。In the embodiment of the present disclosure, when acquiring the initial image based on Robotic Process Automation (RPA) technology, the application data interface can be pre-configured, and the RPA robot will receive the user-robot interaction via the application data interface according to the preset software operation process. Interaction image information, and use the obtained image information of the interaction between the user and the robot as the initial image.
另一些实施例中,也可以采用第三方图像收集装置,并预先建立本公开实施例的执行主体与该第三方图像收集装置的通信链接,基于机器人流程自动化RPA获取第三方图像收集装置所收集的图像,将其作为初始图像,或者,也可以采用其他任意可能的基于机器人流程自动化RPA的方式获取初始图像,对此不做限制。In other embodiments, a third-party image collection device can also be used, and a communication link between the execution subject of the embodiment of the present disclosure and the third-party image collection device can be established in advance, and the data collected by the third-party image collection device can be obtained based on Robotic Process Automation (RPA). Image, use it as the initial image, or you can use any other possible method based on Robotic Process Automation (RPA) to obtain the initial image, and there is no limit to this.
S102:基于人工智能AI技术从初始图像中截取初始区域图像。S102: Intercept the initial area image from the initial image based on artificial intelligence AI technology.
其中,人工智能(Artificial Intelligence,AI),是指研究使计算机来模拟人的某些思维过程和智能行为(如学习、推理、思考、规划等)的学科,既有硬件层面的技术也有软件层面的技术。人工智能硬件技术一般包括如传感器、专用人工智能芯片、云计算、分布式存储、大数据处理等技术;人工智能软件技术主要包括计算机视觉技术、语音识别技术、自然语言处理技术,以及机器学习、深度学习、大数据处理技术、知识图谱技术等几大方向。Among them, Artificial Intelligence (AI) refers to the study of using computers to simulate certain human thinking processes and intelligent behaviors (such as learning, reasoning, thinking, planning, etc.). It has both hardware-level technology and software-level technology. Technology. Artificial intelligence hardware technology generally includes technologies such as sensors, dedicated artificial intelligence chips, cloud computing, distributed storage, and big data processing; artificial intelligence software technology mainly includes computer vision technology, speech recognition technology, natural language processing technology, and machine learning, Deep learning, big data processing technology, knowledge graph technology and other major directions.
其中,初始区域图像,是指基于人工智能AI技术从初始图像中所截取得到的部分图像,该初始区域图像中可以包括目标区域图像。The initial area image refers to a partial image intercepted from the initial image based on artificial intelligence AI technology, and the initial area image may include a target area image.
本公开实施例在基于机器人流程自动化RPA技术获取初始图像之后,可以基于人工智能AI技术从初始图像中截取初始区域图像。In the embodiment of the present disclosure, after acquiring the initial image based on robotic process automation (RPA) technology, the initial area image can be intercepted from the initial image based on artificial intelligence (AI) technology.
本公开实施例中,在基于人工智能AI技术从初始图像中截取初始区域图像时,可以是采用人工智能AI识别初始图像中主体图像的边界信息,而后基于该边界信息从初始图像中截取初始区域图像。In the embodiment of the present disclosure, when intercepting the initial area image from the initial image based on artificial intelligence AI technology, artificial intelligence AI may be used to identify the boundary information of the subject image in the initial image, and then intercept the initial area from the initial image based on the boundary information. image.
另一些实施例中,也可以采用预训练的抠图模型对初始图像进行截取处理,以得到初始区域图像,或者,也可以采用其他任意可能的基于人工智能AI的方法从初始图像中截取初始区域图像,对此不做限制。In other embodiments, a pre-trained matting model can also be used to intercept the initial image to obtain the initial region image, or any other possible method based on artificial intelligence (AI) can be used to intercept the initial region from the initial image. images, no restrictions are placed on this.
S103:根据图像描述信息处理初始区域图像,得到目标区域图像。S103: Process the initial area image according to the image description information to obtain the target area image.
其中,目标区域图像,是指以图像描述信息作为参考依据,对初始区域图像进行处理,所得到的图像。Among them, the target area image refers to the image obtained by processing the initial area image using the image description information as a reference basis.
本公开实施例中,在根据图像描述信息处理初始区域图像,得到目标区域图像时,可以将图像描述信息和初始区域图像输入至预训练的图像处理模型中,以得到目标区域图像,并传输至本公开实施例的执行主体,或者,也可以采用其他任意可能的方法,如数学、工程学的方法,根据图像描述信息处理初始区域图像,得到目标区域图像,对此不做限制。In the embodiment of the present disclosure, when the initial area image is processed according to the image description information to obtain the target area image, the image description information and the initial area image can be input into the pre-trained image processing model to obtain the target area image and transmit it to The execution subject of the embodiment of the present disclosure may also use any other possible methods, such as mathematical and engineering methods, to process the initial area image according to the image description information to obtain the target area image, which is not limited.
S104:根据目标区域图像,检索目标内容。S104: Retrieve target content based on the target area image.
由此,本公开实施例可以有效结合RPA和AI实现图像检索过程的智能自动化(Intelligent Automation,IA),从而有效提升图像检索的自动化程度,降低人工成本。Therefore, embodiments of the present disclosure can effectively combine RPA and AI to realize intelligent automation (IA) of the image retrieval process, thereby effectively improving the automation of image retrieval and reducing labor costs.
其中,目标内容,是指在该图像检索过程中,以目标区域图像作为检索参考依据,对其进行检索得到的内容,该目标内容可以例如为检索得到的图片、对检索到图片进行描述的文本、音视频等,对此不做限制。Among them, the target content refers to the content obtained by retrieving the target area image as the retrieval reference basis during the image retrieval process. The target content can be, for example, the retrieved picture or the text describing the retrieved picture. , audio and video, etc., there are no restrictions on this.
本公开实施例中可以预先获取图像检索库,该图像检索库中可以包含上述目标内容,以实现根据目标区域图像,对图像检索库中相似图像的检索。In the embodiment of the present disclosure, an image retrieval database can be obtained in advance, and the image retrieval database can contain the above target content, so as to achieve retrieval of similar images in the image retrieval database based on the target area image.
本公开实施例在上述根据图像描述信息处理初始区域图像,得到目标区域图像之后,可以根据目标区域图像,检索目标内容。In the embodiment of the present disclosure, after the initial area image is processed according to the image description information and the target area image is obtained, the target content can be retrieved based on the target area image.
本公开实施例中,根据目标区域图像,检索目标内容时,可以确定目标区域图像的分类特征信息,而后基于该分类特征信息检索目标内容。In the embodiments of the present disclosure, when retrieving target content based on the target area image, the classification feature information of the target area image can be determined, and then the target content can be retrieved based on the classification feature information.
另一些实施例中,可以预先训练一个检索学习模型,该检索学习模型可以执行对目标区域图像的特征分析,并依据所得特征分析结果对图像检索库进行检索操作,或者,也可以采用其他任意可能方式根据目标区域图像,检索目标内容,对此不做限制。In other embodiments, a retrieval learning model can be pre-trained. The retrieval learning model can perform feature analysis of the target area image, and perform retrieval operations on the image retrieval library based on the obtained feature analysis results, or any other possibility can be used. The method retrieves the target content based on the target area image, without any restrictions.
本实施例中,通过基于机器人流程自动化RPA技术获取初始图像,基于人工智能AI技术从初始图像中截取初始区域图像,根据图像描述信息处理初始区域图像,得到目标区域图像,根据目标区域图像,检索目标内容,能够利用RPA结合人工智能AI实现图像检索的智能自动化IA,能够在图像检索之前及时地对图像进行预处理,以去除图像中的干扰信息,有效提升所得目标区域图像在检索过程中的针对性,有效降低干扰信息对检索过程的影响,从而有效提升图像检索效率和图像检索结果的准确性。In this embodiment, the initial image is obtained based on Robotic Process Automation RPA technology, the initial area image is intercepted from the initial image based on artificial intelligence AI technology, the initial area image is processed according to the image description information, and the target area image is obtained, and the target area image is retrieved based on the target area image. Target content can use RPA combined with artificial intelligence AI to realize intelligent automation IA of image retrieval. It can preprocess images in time before image retrieval to remove interference information in the image and effectively improve the performance of the obtained target area image in the retrieval process. Targeted, effectively reduce the impact of interference information on the retrieval process, thereby effectively improving image retrieval efficiency and the accuracy of image retrieval results.
图2是本公开另一实施例提出的结合RPA和AI实现IA的图像检索方法的流程示意图。Figure 2 is a schematic flowchart of an image retrieval method for implementing IA by combining RPA and AI proposed by another embodiment of the present disclosure.
如图2所示,该结合RPA和AI实现IA的图像检索方法,包括:As shown in Figure 2, the image retrieval method that combines RPA and AI to implement IA includes:
S201:基于机器人流程自动化RPA技术获取初始图像。S201: Obtain initial images based on Robotic Process Automation (RPA) technology.
S201的描述说明可以示例参见上述实施例,在此不再赘述。For the description of S201, reference may be made to the above-mentioned embodiments and will not be described again here.
S202:确定初始图像的图像尺度信息。S202: Determine the image scale information of the initial image.
其中,图像尺度信息,可以是指被用于描述初始图像尺度的相关信息,例如初始图像的尺寸、面积等,对此不做限制。The image scale information may refer to relevant information used to describe the scale of the initial image, such as the size, area, etc. of the initial image, which is not limited.
本公开实施例在基于机器人流程自动化RPA技术获取初始图像之后,可以确定初始图像的图像尺度信息,可以利用预训练的图像尺度算法模型对初始图像进行算法分析,以得到预训练的图像尺度算法模型所输出的尺度信息,并将预训练的图像尺度算法模型所输出的尺度信息作为初始图像的图像尺度信息,所得图像尺度信息可以从尺度维度有效表征初始图像的特征信息。In embodiments of the present disclosure, after acquiring the initial image based on robotic process automation (RPA) technology, the image scale information of the initial image can be determined, and the pre-trained image scale algorithm model can be used to perform algorithm analysis on the initial image to obtain the pre-trained image scale algorithm model. The output scale information is used as the image scale information of the initial image. The obtained image scale information can effectively characterize the feature information of the initial image from the scale dimension.
S203:确定初始图像的像素特征信息。S203: Determine the pixel feature information of the initial image.
其中,像素特征信息,可以被用于描述初始图像中所包含像素的特征信息,例如像素的数量、色调等,对此不做限制。Among them, the pixel feature information can be used to describe the feature information of the pixels contained in the initial image, such as the number of pixels, hue, etc., and there is no limit to this.
本公开实施例在基于机器人流程自动化RPA技术获取初始图像之后,还可以确定初始图像的像素特征信息,可以使用卷积神经网络对初始图像进行特征分析处理,以得到卷积神经网络输出的像素特征信息作为初始图像的像素特征信息,所得像素特征信息可以从像素特征的维度有效表征初始图像的特征信息。In the embodiments of the present disclosure, after acquiring the initial image based on Robotic Process Automation (RPA) technology, the pixel feature information of the initial image can also be determined, and a convolutional neural network can be used to perform feature analysis and processing on the initial image to obtain the pixel features output by the convolutional neural network. The information is used as the pixel feature information of the initial image, and the obtained pixel feature information can effectively characterize the feature information of the initial image from the dimension of pixel features.
S204:确定针对初始图像指定的处理参数信息。S204: Determine the processing parameter information specified for the initial image.
其中,处理参数信息,可以是指预先针对初始图像所指定的扩充倍数、填充颜色、亮度、色调、饱和度以及锐化程度等信息,对此不做限制。The processing parameter information may refer to information such as expansion factor, fill color, brightness, hue, saturation, and sharpening degree specified in advance for the initial image, and there is no limit to this.
本公开实施例中在基于机器人流程自动化RPA技术获取初始图像之后,还可以确定针对初始图像指定的处理参数信息,该处理参数信息可以是依据于用户配置指令进行配置,或者,也可以是预先确定目标区域图像的模板信息,而后基于该模板信息与初始图像进行分析对比,根据所得对比结果以确定适用于该初始图像的处理参数信息,对此不做限制。In the embodiment of the present disclosure, after the initial image is obtained based on the Robotic Process Automation (RPA) technology, the processing parameter information specified for the initial image can also be determined. The processing parameter information can be configured based on the user configuration instructions, or it can also be determined in advance. The template information of the target area image is then analyzed and compared with the initial image based on the template information, and the processing parameter information applicable to the initial image is determined based on the obtained comparison results. There is no limit to this.
S205:将图像尺度信息、和/或像素特征信息、和/或处理参数信息作为图像描述信息。S205: Use image scale information, and/or pixel feature information, and/or processing parameter information as image description information.
本公开实施例中在基于机器人流程自动化RPA技术获取初始图像之后,可以获取初始图像的图像尺度信息、和/或像素特征信息、和/或处理参数信息,并将其中的一个或多个作为图像描述信息,所得图像描述信息可以作为后续初始区域图像处理过程的参考依据。In the embodiment of the present disclosure, after acquiring the initial image based on the Robotic Process Automation RPA technology, the image scale information, and/or pixel feature information, and/or processing parameter information of the initial image can be acquired, and one or more of them can be used as the image Description information, and the obtained image description information can be used as a reference for subsequent initial region image processing.
本实施例中,通过确定初始图像的图像尺度信息,和/或确定初始图像的像素特征信息,和/或确定针对初始图像指定的处理参数信息,并将图像尺度信息、和/或像素特征信息、和/或处理参数信息作为图像描述信息,由此,使所得图像描述信息可以从多个特征维度表征初始图像的相关信息,能够使该图像描述信息适用于不同的图像预处理场景,当基于图像描述信息处理初始区域图像时,能够实现多维度的有效处理,有效提升所得目标区域图像在图像检索过程的参考价值,从而有效提升该图像检索过程的灵活性。In this embodiment, by determining the image scale information of the initial image, and/or determining the pixel feature information of the initial image, and/or determining the processing parameter information specified for the initial image, and using the image scale information, and/or the pixel feature information , and/or process parameter information as image description information, thereby enabling the obtained image description information to characterize the relevant information of the initial image from multiple feature dimensions, making the image description information applicable to different image preprocessing scenarios. When based on When the image description information processes the initial area image, it can achieve multi-dimensional effective processing, effectively improve the reference value of the obtained target area image in the image retrieval process, thereby effectively improving the flexibility of the image retrieval process.
S206:调用自然语言处理NLP服务,以识别初始图像中的主体信息。S206: Call the natural language processing NLP service to identify the subject information in the initial image.
其中,自然语言处理(Natural Language Processing,NLP)服务,是以语言为对象,利用计算机技术来分析、理解和处理自然语言,即把计算机作为语言研究的强大工具,在计算机的支持下对语言信息进行定量化的研究,并提供可供人与计算机之间能共同使用的语言描写服务。Among them, Natural Language Processing (NLP) services take language as the object and use computer technology to analyze, understand and process natural language. That is, the computer is used as a powerful tool for language research, and language information is processed with the support of the computer. Conduct quantitative research and provide language description services that can be used between humans and computers.
其中,主体,是指初始图像中所包含的主要描述对象,例如人物肖像图中的人物、汽车展示图中的汽车等,对此不做限制。Among them, the subject refers to the main description object contained in the initial image, such as the person in the portrait picture, the car in the car display picture, etc., and there is no limit to this.
其中,主体信息,是指初始图像中所包含的检索对象的相关信息,例如检索对象在初始图像中的位置信息、面积信息等,对此不做限制。Among them, the main information refers to the relevant information of the retrieval object contained in the initial image, such as the position information and area information of the retrieval object in the initial image, and there is no limit to this.
本公开实施例中在将图像尺度信息、和/或像素特征信息、和/或处理参数信息作为图像描述信息之后,可以调用自然语言处理NLP服务,以识别初始图像中的主体信息,可以采用图像主体检测算法检测初始图像中的主体信息,或者,也可以通过人机协同的方式对图像主体进行标注,以得到初始图像中的主体信息,从而减少标注成本,对此不做限制。In the embodiment of the present disclosure, after using the image scale information, and/or pixel feature information, and/or processing parameter information as image description information, the natural language processing NLP service can be called to identify the subject information in the initial image. The image can be used The subject detection algorithm detects the subject information in the initial image, or it can also annotate the subject of the image through human-computer collaboration to obtain the subject information in the initial image, thereby reducing the cost of annotation, and there is no restriction on this.
S207:根据主体信息,确定主体对应于初始图像中的位置描述信息。S207: According to the subject information, determine that the subject corresponds to the position description information in the initial image.
其中,位置描述信息,是指可以被用于描述主体在初始图像中所处位置的相关信息,例如主体在初始图像中的分布情况以及所占比例等,对此不做限制。The position description information refers to relevant information that can be used to describe the location of the subject in the initial image, such as the distribution and proportion of the subject in the initial image, etc., and there is no limit to this.
本公开实施例中在调用自然语言处理NLP服务,以识别初始图像中的主体信息之后,可以根据该主体信息,确定主体对应于初始图像中的位置描述信息,所得位置描述信息可以为后续从初始图像中截取与位置描述信息对应的区域图像提供可靠的参考依据。In the embodiment of the present disclosure, after calling the natural language processing NLP service to identify the subject information in the initial image, it can be determined based on the subject information that the subject corresponds to the position description information in the initial image. The obtained position description information can be used for subsequent processing from the initial image. The regional image corresponding to the location description information is intercepted from the image to provide a reliable reference basis.
S208:从初始图像中截取与位置描述信息对应的区域图像作为初始区域图像。S208: Intercept the area image corresponding to the position description information from the initial image as the initial area image.
其中,初始区域图像,是指采用基于人工智能AI技术从初始图像中所截取得到的部分图像,该初始区域图像中可以包括目标区域图像。The initial area image refers to a partial image intercepted from the initial image using artificial intelligence-based AI technology. The initial area image may include a target area image.
本公开实施例在根据主体信息,确定主体对应于初始图像中的位置描述信息之后,可以从初始图像中截取与位置描述信息对应的区域图像作为初始区域图像,可以根据位置描述信息在初始图像中进行标注,而后根据标注信息进行裁剪,以得到初始区域图像。In the embodiment of the present disclosure, after determining that the subject corresponds to the position description information in the initial image based on the subject information, the region image corresponding to the position description information can be intercepted from the initial image as the initial region image, and the region image corresponding to the position description information can be intercepted in the initial image according to the position description information. Annotate and then crop based on the annotation information to obtain the initial region image.
本实施例中,通过调用自然语言处理NLP服务,以识别初始图像中的主体信息,根据主体信息,确定主体对应于初始图像中的位置描述信息,从初始图像中截取与位置描述信息对应的区域图像作为初始区域图像,由于初始图像中可能包含主体信息之外的干扰信息,且该干扰信息可能会影响检索过程的效率和准确性,当调用自然语言处理NLP服务,以识别初始图像中的主体信息,并根据主体信息确定主体对应于初始图像中的位置描述信息,可以使所得位置描述信息能够有效表征主体在初始图像中的位置信息,而后从初始图像中截取与位置描述信息对应的区域图像作为初始区域图像,可以有效减少所得初始区域图像中的干扰信息,从而提升所得初始区域图像对主体信息的表征准确性。In this embodiment, the natural language processing NLP service is called to identify the subject information in the initial image. Based on the subject information, it is determined that the subject corresponds to the position description information in the initial image, and the area corresponding to the position description information is intercepted from the initial image. The image is used as the initial area image. Since the initial image may contain interference information other than the subject information, and this interference information may affect the efficiency and accuracy of the retrieval process, when calling the natural language processing NLP service to identify the subject in the initial image information, and determine that the subject corresponds to the position description information in the initial image based on the subject information, so that the obtained position description information can effectively represent the position information of the subject in the initial image, and then intercept the area image corresponding to the position description information from the initial image As an initial area image, the interference information in the obtained initial area image can be effectively reduced, thereby improving the accuracy of the obtained initial area image in representing subject information.
S209:根据图像描述信息处理初始区域图像,得到目标区域图像。S209: Process the initial area image according to the image description information to obtain the target area image.
S209的描述说明可以示例参见上述实施例,在此不再赘述。For the description of S209, reference may be made to the above-mentioned embodiment, and details will not be described again here.
S210:确定目标区域图像的语义表征信息。S210: Determine the semantic representation information of the target area image.
其中,语义表征信息,是指可以被用于表征目标区域图像相关特征的信息。例如目标区域图像可以具备颜色特征、轮廓特征、线性特征、中心特征、对角性特征、纹理特征、局部特征以及形状特征等图像特征,对此不做限制。而语义表征信息,可以是指表征上述一个或多个图像特征的信息。Among them, semantic representation information refers to information that can be used to characterize the image-related features of the target area. For example, the target area image can have image features such as color features, contour features, linear features, center features, diagonal features, texture features, local features, and shape features, and there are no restrictions on this. The semantic representation information may refer to information that characterizes one or more of the above image features.
本公开实施例中确定目标区域图像的语义表征信息,可以是预先训练针对目标区域图像的特征提取器,而后将目标区域图像输入至该特征提取器中,以得到一个或多个维度的特征向量表征信息,并将所得一个或多个维度的特征向量表征信息作为目标区域图像的语义表征信息,或者,也可以采用其他任意可能的方法确定目标区域图像的语义表征信息,对此不做限制。In the embodiment of the present disclosure, the semantic representation information of the target area image is determined by pre-training a feature extractor for the target area image, and then inputting the target area image into the feature extractor to obtain a feature vector of one or more dimensions. Representation information, and the obtained feature vector representation information of one or more dimensions is used as the semantic representation information of the target area image, or any other possible method can be used to determine the semantic representation information of the target area image, without limitation.
可选地,一些实施例中,确定目标区域图像的语义表征信息,可以是从目标区域图像识别出目标物体轮廓,根据目标物体轮廓,确定物体轮廓信息,处理物体轮廓信息,得到轮廓向量表征,将轮廓向量表征作为语义表征信息,由于物体轮廓可以有效表征物体的特征信息,当基于目标物体轮廓确定物体轮廓信息,并对物体轮廓信息进行处理以得到轮廓向量表征,可以有效提升轮廓向量表征的表征效果,而后将轮廓向量表征作为语义表征信息,可以有效提升所得语义表征信息在图像检索过程中的适用性。Optionally, in some embodiments, determining the semantic representation information of the target area image may include identifying the target object outline from the target area image, determining the object outline information according to the target object outline, processing the object outline information, and obtaining the outline vector representation, Contour vector representation is used as semantic representation information. Since the object contour can effectively represent the characteristic information of the object, when the object contour information is determined based on the target object contour and the object contour information is processed to obtain the contour vector representation, the contour vector representation can be effectively improved. Representation effect, and then using the contour vector representation as semantic representation information can effectively improve the applicability of the obtained semantic representation information in the image retrieval process.
其中,目标物体轮廓,是指目标区域图像中所包含的检索目标物体的轮廓,例如人物图像中的人体轮廓、汽车图像中的汽车轮廓灯,对此不做限制。Among them, the target object outline refers to the outline of the retrieval target object contained in the target area image, such as the human body outline in the person image and the car outline lights in the car image. There is no limit to this.
其中,物体轮廓信息,是指基于目标物体轮廓所获取的相关信息。Among them, the object contour information refers to the relevant information obtained based on the contour of the target object.
其中,轮廓向量表征,是指可以表征物体轮廓信息,且映射于向量空间中的向量表征,该向量表征,可以是采用将特征映射于向量空间的方式得到的特征,例如轮廓特征。Among them, the contour vector representation refers to a vector representation that can represent the contour information of an object and is mapped in a vector space. The vector representation can be a feature obtained by mapping features to a vector space, such as a contour feature.
本公开实施例中,获取轮廓向量表征,可以是对目标区域图像采取降维、白化、池化等操作,提取目标区域图像中主体的轮廓特征,并将其映射于向量空间中,以得到轮廓向量表征。In the embodiments of the present disclosure, obtaining the contour vector representation may include performing operations such as dimensionality reduction, whitening, and pooling on the target area image, extracting the contour features of the subject in the target area image, and mapping them into the vector space to obtain the contour. Vector representation.
本公开实施例中,在确定目标区域图像的语义表征信息时,可以采用视觉神经网络从目标区域图像识别出目标物体轮廓,根据目标物体轮廓,确定物体轮廓信息,经由处理物体轮廓信息,以得到轮廓向量表征,将轮廓向量表征作为语义表征信息。In the embodiment of the present disclosure, when determining the semantic representation information of the target area image, the visual neural network can be used to identify the target object outline from the target area image, determine the object outline information according to the target object outline, and process the object outline information to obtain Contour vector representation, using contour vector representation as semantic representation information.
S211:根据语义表征信息,检索目标内容。S211: Retrieve target content based on semantic representation information.
本公开实施例中,在上述确定目标区域图像的语义表征信息之后,可以根据语义表征信息,检索目标内容,可以语义表征信息作为参考依据,在图像检索库中检索符合上述语义表征信息的图片,以得到目标内容。In the embodiment of the present disclosure, after the semantic representation information of the target area image is determined above, the target content can be retrieved according to the semantic representation information, and the semantic representation information can be used as a reference basis to retrieve pictures that meet the above semantic representation information in the image retrieval database. to get the target content.
可选地,一些实施例中,根据语义表征信息,检索目标内容,可以是确定与语义表征信息对应的候选相似度层级,其中,候选相似度层级属于预先构建的图状数据结构,候选相似度层级,是其相应所表征内容与初始图像之间的相似程度所属的层级,而后将图状数据结构中候选相似度层级所表征内容作为目标内容,由于检索库中可能存在大量数据,当确定与语义表征信息对应的候选相似度层级,并将图状数据结构中候选相似度层级所表征内容作为目标内容,可以较大程度地降低检索过程的计算成本,并有效提升检索效率。Optionally, in some embodiments, retrieving the target content according to the semantic representation information may be to determine the candidate similarity level corresponding to the semantic representation information, where the candidate similarity level belongs to a pre-built graph data structure, and the candidate similarity level The level is the level to which the similarity between the corresponding represented content and the initial image belongs. Then the content represented by the candidate similarity level in the graph data structure is used as the target content. Since there may be a large amount of data in the retrieval database, when determining the The candidate similarity level corresponding to the semantic representation information, and using the content represented by the candidate similarity level in the graph data structure as the target content, can greatly reduce the computational cost of the retrieval process and effectively improve the retrieval efficiency.
其中,候选相似度层级,是指语义表征信息所表征内容与初始图像之间的相似程度在图状数据结构中所属的层级。Among them, the candidate similarity level refers to the level of similarity between the content represented by the semantic representation information and the initial image in the graph data structure.
其中,图状数据结构,是指预先在向量检索库中以向量距离作为划分依据而建立的数据结构,图状数据结构,可以被用于在图像检索过程中以向量距离作为参考依据寻找候选相似度层级,以缩小检索范围。Among them, the graph data structure refers to the data structure established in advance in the vector retrieval library using vector distance as the basis for division. The graph data structure can be used to use vector distance as the reference basis to find candidate similarities during the image retrieval process. hierarchies to narrow the search scope.
本公开实施例中,在根据语义表征信息,检索目标内容时,可以确定与语义表征信息对应的候选相似度层级,其中,候选相似度层级属于预先构建的图状数据结构,候选相似度层级,是其相应所表征内容与初始图像之间的相似程度所属的层级,而后将图状数据结构中候选相似度层级所表征内容作为目标内容。In the embodiments of the present disclosure, when retrieving target content according to the semantic representation information, candidate similarity levels corresponding to the semantic representation information can be determined, where the candidate similarity levels belong to a pre-constructed graph data structure, and the candidate similarity levels, is the level to which the similarity between the corresponding represented content and the initial image belongs, and then the content represented by the candidate similarity level in the graph data structure is used as the target content.
本实施例中,通过确定目标区域图像的语义表征信息,并根据语义表征信息,检索目标内容,由于语义表征信息可以有效表征目标区域图像的相关特征,当基于语义表征信息检索目标内容,可以有效提升检索过程的针对性和目标性,能够有效提升检索结果的可靠性。In this embodiment, the semantic representation information of the target area image is determined, and the target content is retrieved based on the semantic representation information. Since the semantic representation information can effectively characterize the relevant features of the target area image, when the target content is retrieved based on the semantic representation information, the target content can be effectively retrieved. Improving the pertinence and purpose of the search process can effectively improve the reliability of search results.
本实施例中,通过确定初始图像的图像尺度信息,和/或确定初始图像的像素特征信息,和/或确定针对初始图像指定的处理参数信息,并将图像尺度信息、和/或像素特征信息、和/或处理参数信息作为图像描述信息,由此,使所得图像描述信息可以从多个特征维度表征初始图像的相关信息,能够使该图像描述信息适用于不同的图像预处理场景,当基于图像描述信息处理初始区域图像时,能够实现多维度的有效处理,有效提升所得目标区域图像在图像检索过程的参考价值,从而有效提升该图像检索过程的灵活性。通过调用自然语言处理NLP服务,以识别初始图像中的主体信息,根据主体信息,确定主体对应于初始图像中的位置描述信息,从初始图像中截取与位置描述信息对应的区域图像作为初始区域图像,由于初始图像中可能包含主体信息之外的干扰信息,且该干扰信息可能会影响检索过程的效率和准确性,当调用自然语言处理NLP服务,以识别初始图像中的主体信息,并根据主体信息确定主体对应于初始图像中的位置描述信息,可以使所得位置描述信息能够有效表征主体在初始图像中的位置信息,而后从初始图像中截取与位置描述信息对应的区域图像作为初始区域图像,可以有效减少所得初始区域图像中的干扰信息,从而提升所得初始区域图像对主体信息的表征准确性。通过确定目标区域图像的语义表征信息,并根据语义表征信息,检索目标内容,由于语义表征信息可以有效表征目标区域图像的相关特征,当基于语义表征信息检索目标内容,可以有效提升检索过程的针对性和目标性,能够有效提升检索结果的可靠性。由于物体轮廓可以有效表征物体的特征信息,当基于目标物体轮廓确定物体轮廓信息,并对物体轮廓信息进行处理以得到轮廓向量表征,可以有效提升轮廓向量表征的表征效果,而后将轮廓向量表征作为语义表征信息,可以有效提升所得语义表征信息在图像检索过程中的适用性。由于检索库中可能存在大量数据,当确定与语义表征信息对应的候选相似度层级,并将图状数据结构中候选相似度层级所表征内容作为目标内容,可以较大程度地降低检索过程的 计算成本,并有效提升检索效率。In this embodiment, by determining the image scale information of the initial image, and/or determining the pixel feature information of the initial image, and/or determining the processing parameter information specified for the initial image, and using the image scale information, and/or the pixel feature information , and/or process parameter information as image description information, thereby enabling the obtained image description information to characterize the relevant information of the initial image from multiple feature dimensions, making the image description information applicable to different image preprocessing scenarios. When based on When the image description information processes the initial area image, it can achieve multi-dimensional effective processing, effectively improve the reference value of the obtained target area image in the image retrieval process, thereby effectively improving the flexibility of the image retrieval process. By calling the natural language processing NLP service to identify the subject information in the initial image, based on the subject information, determine that the subject corresponds to the location description information in the initial image, and intercept the area image corresponding to the location description information from the initial image as the initial area image , since the initial image may contain interference information other than the subject information, and this interference information may affect the efficiency and accuracy of the retrieval process, when calling the natural language processing NLP service to identify the subject information in the initial image, and based on the subject The information determines that the subject corresponds to the position description information in the initial image, so that the obtained position description information can effectively represent the position information of the subject in the initial image, and then intercepts the area image corresponding to the position description information from the initial image as the initial area image, It can effectively reduce the interference information in the obtained initial area image, thereby improving the accuracy of the obtained initial area image to represent the subject information. By determining the semantic representation information of the target area image and retrieving the target content based on the semantic representation information, since the semantic representation information can effectively represent the relevant features of the target area image, when the target content is retrieved based on the semantic representation information, the retrieval process can be effectively improved. sex and purpose, which can effectively improve the reliability of search results. Since the object contour can effectively represent the characteristic information of the object, when the object contour information is determined based on the target object contour and the object contour information is processed to obtain the contour vector representation, the representation effect of the contour vector representation can be effectively improved, and then the contour vector representation is used as Semantic representation information can effectively improve the applicability of the obtained semantic representation information in the image retrieval process. Since there may be a large amount of data in the retrieval database, when the candidate similarity levels corresponding to the semantic representation information are determined and the content represented by the candidate similarity levels in the graph data structure is used as the target content, the calculation of the retrieval process can be greatly reduced. cost, and effectively improve retrieval efficiency.
图3是本公开另一实施例提出的结合RPA和AI实现IA的图像检索方法的流程示意图。FIG. 3 is a schematic flowchart of an image retrieval method for implementing IA by combining RPA and AI proposed by another embodiment of the present disclosure.
如图3所示,该结合RPA和AI实现IA的图像检索方法,包括:As shown in Figure 3, the image retrieval method that combines RPA and AI to implement IA includes:
S301:基于机器人流程自动化RPA技术获取初始图像。S301: Obtain the initial image based on Robotic Process Automation (RPA) technology.
S302:确定初始图像的图像尺度信息。S302: Determine the image scale information of the initial image.
S303:将图像尺度信息作为图像描述信息。S303: Use image scale information as image description information.
S304:基于人工智能AI技术从初始图像中截取初始区域图像。S304: Intercept the initial area image from the initial image based on artificial intelligence AI technology.
S301-S304的描述说明可以示例参见上述实施例,在此不再赘述。For descriptions of S301-S304, reference may be made to the above-mentioned embodiments and will not be described again here.
S305:根据图像尺度信息对初始区域图像进行扩大处理。S305: Expand the initial area image according to the image scale information.
本公开实施例中,在确定初始图像的图像尺度信息,并基于人工智能AI技术从初始图像中截取初始区域图像之后,可以根据图像尺度信息对初始区域图像进行扩大处理,使处理后所得到图像的尺度等于初始图像的尺度或其他任意适用于该图像检索过程的尺度,对此不做限制。In the embodiment of the present disclosure, after determining the image scale information of the initial image and intercepting the initial area image from the initial image based on artificial intelligence AI technology, the initial area image can be enlarged according to the image scale information, so that the image obtained after processing The scale of is equal to the scale of the initial image or any other scale suitable for the image retrieval process, and there is no restriction on this.
S306:将扩大处理后的图像作为目标区域图像。S306: Use the enlarged image as the target area image.
本公开实施例中,在根据图像尺度信息对初始区域图像进行扩大处理之后,可以将扩大处理后的图像作为目标区域图像。In the embodiment of the present disclosure, after the initial area image is enlarged according to the image scale information, the enlarged image can be used as the target area image.
本实施例中,将图像尺度信息作为图像描述信息,通过根据图像尺度信息对初始区域图像进行扩大处理,将扩大处理后的图像作为目标区域图像,由于基于人工智能AI技术从初始图像中所截取得到的初始区域图像的尺度可能较低,当根据图像尺度信息对初始区域图像进行扩大处理,并将扩大处理后的图像作为目标区域图像,可以有效避免初始区域图像尺度过低而影响检索效果,能够有效提升所得目标区域图像作为检索依据的可靠性。In this embodiment, the image scale information is used as image description information, the initial area image is expanded according to the image scale information, and the expanded image is used as the target area image. Since the image is intercepted from the initial image based on artificial intelligence AI technology The scale of the obtained initial area image may be low. When the initial area image is expanded according to the image scale information, and the enlarged image is used as the target area image, it can effectively avoid the initial area image being too low and affecting the retrieval effect. It can effectively improve the reliability of the obtained target area image as a retrieval basis.
S307:根据目标区域图像,检索目标内容。S307: Retrieve target content based on the target area image.
S307的描述说明可以示例参见上述实施例,在此不再赘述。For the description of S307, reference may be made to the above-mentioned embodiment, and details will not be described again here.
本实施例中,通过根据图像尺度信息对初始区域图像进行扩大处理,将扩大处理后的图像作为目标区域图像,由于基于人工智能AI技术从初始图像中所截取得到的初始区域图像的尺度可能较低,当根据图像尺度信息对初始区域图像进行扩大处理,并将扩大处理后的图像作为目标区域图像,可以有效避免初始区域图像尺度过低而影响检索效果,能够有效提升所得目标区域图像作为检索依据的可靠性。In this embodiment, the initial area image is enlarged according to the image scale information, and the enlarged image is used as the target area image. Since the scale of the initial area image intercepted from the initial image based on artificial intelligence AI technology may be relatively large, Low, when the initial area image is enlarged according to the image scale information, and the enlarged image is used as the target area image, it can effectively avoid the initial area image scale being too low and affect the retrieval effect, and can effectively improve the resulting target area image as a retrieval The reliability of the basis.
图4是本公开另一实施例提出的结合RPA和AI实现IA的图像检索方法的流程示意图。FIG. 4 is a schematic flowchart of an image retrieval method for implementing IA by combining RPA and AI proposed by another embodiment of the present disclosure.
如图4所示,该结合RPA和AI实现IA的图像检索方法,包括:As shown in Figure 4, the image retrieval method that combines RPA and AI to implement IA includes:
S401:基于机器人流程自动化RPA技术获取初始图像。S401: Obtain the initial image based on Robotic Process Automation (RPA) technology.
S402:确定初始图像的像素特征信息。S402: Determine the pixel feature information of the initial image.
S403:将像素特征信息作为图像描述信息。S403: Use pixel feature information as image description information.
S404:基于人工智能AI技术从初始图像中截取初始区域图像。S404: Intercept the initial area image from the initial image based on artificial intelligence AI technology.
S401-S404的描述说明可以示例参见上述实施例,在此不再赘述。For descriptions of S401-S404, reference may be made to the above-mentioned embodiments and will not be described again here.
S405:获取初始图像区域中的区域像素。S405: Obtain the area pixels in the initial image area.
其中,区域像素,是指初始图像中一个或多个图像区域对应的像素。Among them, regional pixels refer to pixels corresponding to one or more image regions in the initial image.
本公开实施例中,在确定初始图像的像素特征信息,并基于人工智能AI技术从初始图像中截取初始区域图像之后,可以获取初始图像区域中的区域像素。In the embodiment of the present disclosure, after determining the pixel feature information of the initial image and intercepting the initial area image from the initial image based on artificial intelligence AI technology, the area pixels in the initial image area can be obtained.
S406:从像素特征信息中解析区域像素的区域像素特征。S406: Parse the regional pixel features of the regional pixels from the pixel feature information.
其中,区域像素特征,是指基于像素特征信息所获取的区域像素的相关特征,例如区域像素的数量、颜色等。Among them, regional pixel features refer to the relevant features of regional pixels obtained based on pixel feature information, such as the number and color of regional pixels.
本公开实施例中,在获取初始图像区域中的区域像素之后,可以基于上述像素特征信息和区域像素进行匹配处理,以解析获取区域像素的区域像素特征。In the embodiment of the present disclosure, after obtaining the regional pixels in the initial image area, matching processing can be performed based on the above-mentioned pixel feature information and the regional pixels to analyze and obtain the regional pixel features of the regional pixels.
S407:对初始图像区域中各个区域像素的区域像素特征进行增强处理,得到目标区域图像。S407: Enhance the regional pixel features of each area pixel in the initial image area to obtain the target area image.
本公开实施例中,在从像素特征信息中解析区域像素的区域像素特征之后,可以对初始图像区域中各个区域像素的区域像素特征进行增强处理,以提升各个区域像素的区域像素特征的辨识度,得到目标区域图像。In the embodiment of the present disclosure, after parsing the regional pixel features of the regional pixels from the pixel feature information, the regional pixel features of each regional pixel in the initial image area can be enhanced to improve the recognition of the regional pixel features of each regional pixel. , get the target area image.
本实施例中,将像素特征信息作为图像描述信息,通过获取初始图像区域中的区域像素,从像素特征信息中解析区域像素的区域像素特征,对初始图像区域中各个区域像素的区域像素特征进行增强处理,得到目标区域图像,由于区域像素特征的强度可能会影响图像检索效果,当对初始图像区域中各个区域像素的区域像素特征进行增强处理,可以有效提升所得目标区域图像对主体图像的表征能力,从而提升图像检索过程的针对性和准确性。In this embodiment, the pixel feature information is used as the image description information. By obtaining the regional pixels in the initial image region, the regional pixel features of the regional pixels are analyzed from the pixel feature information, and the regional pixel features of each regional pixel in the initial image region are analyzed. Enhancement processing is performed to obtain the target area image. Since the intensity of regional pixel features may affect the image retrieval effect, when the regional pixel features of each regional pixel in the initial image area are enhanced, the representation of the subject image of the obtained target area image can be effectively improved. capabilities, thereby improving the pertinence and accuracy of the image retrieval process.
S408:根据目标区域图像,检索目标内容。S408: Retrieve the target content according to the target area image.
S408的描述说明可以示例参见上述实施例,在此不再赘述。For the description of S408, reference may be made to the above-mentioned embodiments and will not be described again here.
本实施例中,通过获取初始图像区域中的区域像素,从像素特征信息中解析区域像素的区域像素特征,对初始图像区域中各个区域像素的区域像素特征进行增强处理,得到目标区域图像,由于区域像素特征的强度可能会影响图像检索效果,当对初始图像区域中各个区域像素的区域像素特征进行增强处理,可以有效提升所得目标区域图像对主体图像的表征能力,从而提升图像检索过程的针对性和准确性。In this embodiment, by obtaining the regional pixels in the initial image area, parsing the regional pixel characteristics of the regional pixels from the pixel characteristic information, and performing enhancement processing on the regional pixel characteristics of each regional pixel in the initial image area, the target area image is obtained. Since The strength of regional pixel features may affect the image retrieval effect. When the regional pixel features of each regional pixel in the initial image area are enhanced, the representation ability of the obtained target area image to the subject image can be effectively improved, thereby improving the targeting of the image retrieval process. sex and accuracy.
图5是本公开另一实施例提出的结合RPA和AI实现IA的图像检索方法的流程示意图。FIG. 5 is a schematic flowchart of an image retrieval method for implementing IA by combining RPA and AI proposed by another embodiment of the present disclosure.
如图5所示,该结合RPA和AI实现IA的图像检索方法,包括:As shown in Figure 5, the image retrieval method that combines RPA and AI to implement IA includes:
S501:基于机器人流程自动化RPA技术获取初始图像。S501: Obtain the initial image based on Robotic Process Automation (RPA) technology.
S502:确定初始图像的图像尺度信息。S502: Determine the image scale information of the initial image.
S503:确定初始图像的像素特征信息。S503: Determine the pixel feature information of the initial image.
S504:将图像尺度信息和像素特征信息作为图像描述信息。S504: Use image scale information and pixel feature information as image description information.
S505:基于人工智能AI技术从初始图像中截取初始区域图像。S505: Intercept the initial area image from the initial image based on artificial intelligence AI technology.
S501-S505的描述说明可以示例参见上述实施例,在此不再赘述。For descriptions of S501-S505, reference may be made to the above-mentioned embodiments and will not be described again here.
S506:根据图像尺度信息对初始区域图像进行扩大处理,得到待填充区域图像,其中,待填充区域图像包括:待填充像素。S506: Expand the initial area image according to the image scale information to obtain an image of the area to be filled, where the image of the area to be filled includes: pixels to be filled.
其中,待填充区域图像,是指基于图像尺度信息对初始区域图像进行扩大处理所得到的图像。Among them, the image of the area to be filled refers to the image obtained by enlarging the initial area image based on the image scale information.
其中,待填充像素,是指待填充区域图像中需要进行填充的像素。Among them, the pixels to be filled refer to the pixels in the image of the area to be filled that need to be filled.
本公开实施例中,在将图像尺度信息和像素特征信息作为图像描述信息,并基于人工智能AI技术从初始图像中截取初始区域图像之后,可以根据图像尺度信息对初始区域图像进行扩大处理,得到待填充区域图像,可以根据图像尺度信息对初始区域图像进行扩大处理,将初始区域图像的尺度调整为初始图像的尺度或其他任意适用于该图像检索过程的尺度数值,并将扩大处理后的初始区域图像作为待填充区域图像。In the embodiment of the present disclosure, after using the image scale information and pixel feature information as image description information, and intercepting the initial area image from the initial image based on artificial intelligence AI technology, the initial area image can be enlarged according to the image scale information to obtain For the area image to be filled, the initial area image can be enlarged according to the image scale information, and the scale of the initial area image is adjusted to the scale of the initial image or any other scale value suitable for the image retrieval process, and the initial area image after the enlargement process is The area image is used as the area image to be filled.
S507:从像素特征信息中解析待填充像素的第一像素特征。S507: Parse the first pixel feature of the pixel to be filled from the pixel feature information.
其中,第一像素特征,是指基于像素特征信息所获取的待填充像素的相关特征。The first pixel feature refers to the relevant feature of the pixel to be filled that is obtained based on the pixel feature information.
本公开实施例中,在根据图像尺度信息对初始区域图像进行扩大处理,得到待填充区域图像之后,可以从像素特征信息中解析待填充像素的第一像素特征,可以基于像素特征信息结合待填充像素进行匹配处理,并将像素特征信息中与待填充像素匹配的特征信息作为第一像素特征。In the embodiment of the present disclosure, after the initial area image is expanded according to the image scale information to obtain the area image to be filled, the first pixel feature of the pixel to be filled can be analyzed from the pixel feature information, and the first pixel feature of the pixel to be filled can be combined based on the pixel feature information. The pixels are matched, and the feature information in the pixel feature information that matches the pixel to be filled is used as the first pixel feature.
S508:从像素特征信息中解析其他区域图像中区域像素的第二像素特征,其中,初始区域图像和其他区域图像共同构成初始图像。S508: Parse the second pixel features of the regional pixels in other regional images from the pixel feature information, where the initial regional image and other regional images together constitute the initial image.
其中,第二像素特征,是指基于像素特征信息所获取的其他区域图像中区域像素的相关特征。The second pixel feature refers to the relevant features of regional pixels in other regional images obtained based on pixel feature information.
本公开实施例中,在从像素特征信息中解析待填充像素的第一像素特征之后,可以从像素特征信息中解析其他区域图像中区域像素的第二像素特征,可以基于像素特征信息结合其他区域图像中区域像素进行匹配处理,并将像素特征信息中与其他区域图像中区域像素匹配 的特征信息作为第二像素特征。In the embodiment of the present disclosure, after parsing the first pixel feature of the pixel to be filled from the pixel feature information, the second pixel feature of the regional pixels in the image of other regions can be parsed from the pixel feature information, and other regions can be combined based on the pixel feature information. The regional pixels in the image are matched, and the characteristic information in the pixel feature information that matches the regional pixels in other regional images is used as the second pixel feature.
S509:根据第一像素特征和第二像素特征,生成填充像素特征。S509: Generate filling pixel features based on the first pixel feature and the second pixel feature.
其中,填充像素特征,是指基于第一像素特征和第二像素特征所获取的像素特征,该填充像素特征,可以被用于作为待填充区域图像进行填充处理的参考依据。The filling pixel feature refers to the pixel feature obtained based on the first pixel feature and the second pixel feature. The filling pixel feature can be used as a reference for filling the image of the area to be filled.
本公开实施例中,在从像素特征信息中解析待填充像素的第一像素特征,从像素特征信息中解析其他区域图像中区域像素的第二像素特征之后,可以根据第一像素特征和第二像素特征,生成填充像素特征,可以利用预训练的机器学习模型对第一像素特征和第二像素特征进行解析处理,以生成填充像素特征。In the embodiment of the present disclosure, after parsing the first pixel feature of the pixel to be filled from the pixel feature information and parsing the second pixel feature of the regional pixels in other regional images from the pixel feature information, the method can be based on the first pixel feature and the second Pixel features are used to generate filled pixel features. A pre-trained machine learning model can be used to analyze the first pixel feature and the second pixel feature to generate filled pixel features.
S510:根据填充像素特征对待填充区域图像进行填充处理,得到目标区域图像。S510: Perform filling processing on the image of the area to be filled according to the characteristics of the filling pixels to obtain the target area image.
本公开实施例在根据第一像素特征和第二像素特征,生成填充像素特征之后,可以根据填充像素特征对待填充区域图像进行填充处理,得到目标区域图像,可以根据填充像素特征确定待填充像素,而后基于该待填充像素对对待填充区域图像进行填充处理,以得到目标区域图像。In the embodiment of the present disclosure, after generating the filling pixel characteristics according to the first pixel characteristics and the second pixel characteristics, the to-be-filled area image can be filled according to the filling pixel characteristics to obtain the target area image, and the pixels to be filled can be determined based on the filling pixel characteristics. Then, the area image to be filled is filled based on the pixels to be filled, so as to obtain the target area image.
本实施例中,将图像尺度信息和像素特征信息作为图像描述信息,通过根据图像尺度信息对初始区域图像进行扩大处理,得到待填充区域图像,从像素特征信息中解析待填充像素的第一像素特征,从像素特征信息中解析其他区域图像中区域像素的第二像素特征,根据第一像素特征和第二像素特征,生成填充像素特征,根据填充像素特征对待填充区域图像进行填充处理,得到目标区域图像,由此,可以在保证所得待填充区域图像的尺寸符合正常图像尺寸的同时,结合第一像素特征和第二像素特征对其进行填充处理,避免因为扩大处理而影响该图像的表征性,从而可以较大程度地提升所得目标区域图像的表征效果。In this embodiment, the image scale information and pixel feature information are used as image description information. The initial area image is expanded according to the image scale information to obtain the area image to be filled. The first pixel of the pixel to be filled is parsed from the pixel feature information. Features, analyze the second pixel features of area pixels in other area images from the pixel feature information, generate filling pixel features based on the first pixel features and second pixel features, fill the area image to be filled according to the filling pixel features, and obtain the target Therefore, while ensuring that the size of the obtained area image to be filled conforms to the normal image size, it can be filled by combining the first pixel feature and the second pixel feature to avoid affecting the representation of the image due to the enlargement process. , which can greatly improve the representation effect of the obtained target area image.
S511:根据目标区域图像,检索目标内容。S511: Retrieve target content based on the target area image.
S511的描述说明可以示例参见上述实施例,在此不再赘述。For the description of S511, reference may be made to the above-mentioned embodiment, and details will not be described again here.
本实施例中,通过根据图像尺度信息对初始区域图像进行扩大处理,得到待填充区域图像,从像素特征信息中解析待填充像素的第一像素特征,从像素特征信息中解析其他区域图像中区域像素的第二像素特征,根据第一像素特征和第二像素特征,生成填充像素特征,根据填充像素特征对待填充区域图像进行填充处理,得到目标区域图像,由此,可以在保证所得待填充区域图像的尺寸符合正常图像尺寸的同时,结合第一像素特征和第二像素特征对其进行填充处理,避免因为扩大处理而影响该图像的表征性,从而可以较大程度地提升所得目标区域图像的表征效果。In this embodiment, the initial region image is expanded according to the image scale information to obtain the region image to be filled, the first pixel feature of the pixel to be filled is parsed from the pixel feature information, and the regions in other region images are parsed from the pixel feature information. The second pixel feature of the pixel is used to generate a filling pixel feature based on the first pixel feature and the second pixel feature. The to-be-filled area image is filled according to the filling pixel feature to obtain the target area image. Thus, the obtained area to be filled can be guaranteed. While the size of the image conforms to the normal image size, the first pixel feature and the second pixel feature are combined to fill it to avoid affecting the representation of the image due to the enlargement process, thereby greatly improving the quality of the obtained target area image. representation effect.
图6是本公开另一实施例提出的结合RPA和AI实现IA的图像检索方法的流程示意图。FIG. 6 is a schematic flowchart of an image retrieval method for implementing IA by combining RPA and AI proposed by another embodiment of the present disclosure.
如图6所示,该结合RPA和AI实现IA的图像检索方法,包括:As shown in Figure 6, the image retrieval method that combines RPA and AI to implement IA includes:
S601:基于机器人流程自动化RPA技术获取初始图像。S601: Obtain the initial image based on Robotic Process Automation (RPA) technology.
S602:确定针对初始图像指定的处理参数信息。S602: Determine the processing parameter information specified for the initial image.
S603:将处理参数信息作为图像描述信息。S603: Use the processing parameter information as image description information.
S604:基于人工智能AI技术从初始图像中截取初始区域图像。S604: Intercept the initial area image from the initial image based on artificial intelligence AI technology.
S601-S604的描述说明可以示例参见上述实施例,在此不再赘述。For descriptions of S601-S604, reference may be made to the above-mentioned embodiments and will not be described again here.
S605:根据处理参数信息处理初始区域图像,得到目标区域图像。S605: Process the initial area image according to the processing parameter information to obtain the target area image.
其中,处理参数信息,是指指预先针对初始图像所指定的扩充倍数、填充颜色、亮度、色调、饱和度以及锐化程度等,对此不做限制。Among them, the processing parameter information refers to the expansion factor, fill color, brightness, hue, saturation, sharpening degree, etc. specified in advance for the initial image, and there is no limit to this.
本公开实施例中,在将处理参数信息作为图像描述信息,并基于人工智能AI技术从初始图像中截取初始区域图像之后,可以根据处理参数信息处理初始区域图像,得到目标区域图像,可以预先确定针对初始图像的处理参数信息(如指定的扩充倍数、填充颜色、亮度、色调、饱和度以及锐化程度等),而后基于该处理参数信息对初始区域图像的各项对应参数进行调整,以得到目标区域图像。In the embodiment of the present disclosure, after using the processing parameter information as image description information and intercepting the initial area image from the initial image based on artificial intelligence AI technology, the initial area image can be processed according to the processing parameter information to obtain the target area image, which can be predetermined Based on the processing parameter information of the initial image (such as the specified expansion factor, fill color, brightness, hue, saturation, sharpening degree, etc.), and then adjust the corresponding parameters of the initial area image based on the processing parameter information to obtain Target area image.
本实施例这种,将处理参数信息作为图像描述信息,通过根据处理参数信息处理初始区域图像,得到目标区域图像,由于处理参数信息可以根据用户配置指令进行对应配置,当基 于处理参数信息处理初始区域图像,可以实现根据应用场景对初始区域图像进行灵活处理,以得到适用于图像检索过程的目标区域图像,从而有效提升该初始区域图像处理过程的灵活性。In this embodiment, the processing parameter information is used as the image description information, and the target area image is obtained by processing the initial area image according to the processing parameter information. Since the processing parameter information can be correspondingly configured according to the user configuration instructions, when the initial area image is processed based on the processing parameter information, The regional image can flexibly process the initial regional image according to the application scenario to obtain the target region image suitable for the image retrieval process, thereby effectively improving the flexibility of the initial regional image processing process.
S606:根据目标区域图像,检索目标内容。S606: Retrieve target content based on the target area image.
S606的描述说明可以示例参见上述实施例,在此不再赘述。For the description of S606, reference may be made to the above-mentioned embodiments, and details will not be described again here.
本实施例中,通过根据处理参数信息处理初始区域图像,得到目标区域图像,由于处理参数信息可以根据用户配置指令进行对应配置,当基于处理参数信息处理初始区域图像,可以实现根据应用场景对初始区域图像进行灵活处理,以得到适用于图像检索过程的目标区域图像,从而有效提升该初始区域图像处理过程的灵活性。In this embodiment, the target area image is obtained by processing the initial area image according to the processing parameter information. Since the processing parameter information can be correspondingly configured according to the user configuration instructions, when the initial area image is processed based on the processing parameter information, the initial area image can be processed according to the application scenario. The regional image is flexibly processed to obtain a target region image suitable for the image retrieval process, thereby effectively improving the flexibility of the initial region image processing process.
图7是本公开一实施例提出的结合RPA和AI实现IA的图像检索装置的结构示意图。Figure 7 is a schematic structural diagram of an image retrieval device that combines RPA and AI to implement IA proposed by an embodiment of the present disclosure.
如图7所示,该结合RPA和AI实现IA的图像检索装置70,应用于自然语言处理NLP领域,包括:As shown in Figure 7, the image retrieval device 70 that combines RPA and AI to implement IA is applied in the field of natural language processing NLP, including:
获取模块701,用于基于机器人流程自动化RPA技术获取初始图像,其中,初始图像具有图像描述信息;The acquisition module 701 is used to acquire an initial image based on robotic process automation RPA technology, where the initial image has image description information;
第一处理模块702,用于基于人工智能AI技术从初始图像中截取初始区域图像;The first processing module 702 is used to intercept the initial area image from the initial image based on artificial intelligence AI technology;
第二处理模块703,用于根据图像描述信息处理初始区域图像,得到目标区域图像;The second processing module 703 is used to process the initial area image according to the image description information to obtain the target area image;
检索模块704,用于根据目标区域图像,检索目标内容。The retrieval module 704 is used to retrieve target content according to the target area image.
在本公开的一些实施例中,第一处理模块702,具体用于:In some embodiments of the present disclosure, the first processing module 702 is specifically used to:
调用自然语言处理NLP服务,以识别初始图像中的主体信息;Call the natural language processing NLP service to identify the subject information in the initial image;
根据主体信息,确定主体对应于初始图像中的位置描述信息;According to the subject information, determine that the subject corresponds to the position description information in the initial image;
从初始图像中截取与位置描述信息对应的区域图像作为初始区域图像。A region image corresponding to the position description information is intercepted from the initial image as the initial region image.
在本公开的一些实施例中,如图8所示,图8是本公开另一实施例提出的结合RPA和AI实现IA的图像检索装置的结构示意图,所述图像检索装置还包括:In some embodiments of the present disclosure, as shown in Figure 8, Figure 8 is a schematic structural diagram of an image retrieval device that combines RPA and AI to implement IA proposed by another embodiment of the present disclosure. The image retrieval device also includes:
确定模块705,用于确定初始图像的图像尺度信息;和/或确定初始图像的像素特征信息;和/或确定针对初始图像指定的处理参数信息;并将图像尺度信息、和/或像素特征信息、和/或处理参数信息作为图像描述信息。Determining module 705, used to determine the image scale information of the initial image; and/or determine the pixel feature information of the initial image; and/or determine the processing parameter information specified for the initial image; and combine the image scale information, and/or the pixel feature information , and/or process parameter information as image description information.
在本公开的一些实施例中,图像描述信息包括:图像尺度信息;In some embodiments of the present disclosure, the image description information includes: image scale information;
其中,第二处理模块703,具体用于:Among them, the second processing module 703 is specifically used for:
根据图像尺度信息对初始区域图像进行扩大处理;Expand the initial area image according to the image scale information;
将扩大处理后的图像作为目标区域图像。The enlarged image is used as the target area image.
在本公开的一些实施例中,图像描述信息包括:像素特征信息;In some embodiments of the present disclosure, the image description information includes: pixel feature information;
其中,第二处理模块703,还用于:Among them, the second processing module 703 is also used for:
获取初始图像区域中的区域像素;Get the area pixels in the initial image area;
从像素特征信息中解析区域像素的区域像素特征;Parse regional pixel features of regional pixels from pixel feature information;
对初始图像区域中各个区域像素的区域像素特征进行增强处理,得到目标区域图像。The regional pixel features of each area pixel in the initial image area are enhanced to obtain the target area image.
在本公开的一些实施例中,图像描述信息包括:图像尺度信息和像素特征信息;In some embodiments of the present disclosure, the image description information includes: image scale information and pixel feature information;
其中,第二处理模块703,还用于:Among them, the second processing module 703 is also used for:
根据图像尺度信息对初始区域图像进行扩大处理,得到待填充区域图像,其中,待填充区域图像包括:待填充像素;The initial area image is expanded according to the image scale information to obtain the area image to be filled, where the area image to be filled includes: pixels to be filled;
从像素特征信息中解析待填充像素的第一像素特征;Parse the first pixel feature of the pixel to be filled from the pixel feature information;
从像素特征信息中解析其他区域图像中区域像素的第二像素特征,其中,初始区域图像和其他区域图像共同构成初始图像;Parse second pixel features of regional pixels in other regional images from the pixel feature information, where the initial regional image and other regional images together constitute the initial image;
根据第一像素特征和第二像素特征,生成填充像素特征;Generate filling pixel features according to the first pixel feature and the second pixel feature;
根据填充像素特征对待填充区域图像进行填充处理,得到目标区域图像。The image of the area to be filled is filled according to the characteristics of the filled pixels to obtain the target area image.
在本公开的一些实施例中,图像描述信息包括:处理参数信息;In some embodiments of the present disclosure, the image description information includes: processing parameter information;
其中,第二处理模块703,还用于:Among them, the second processing module 703 is also used for:
根据处理参数信息处理初始区域图像,得到目标区域图像。The initial area image is processed according to the processing parameter information to obtain the target area image.
在本公开的一些实施例中,检索模块704,包括:In some embodiments of the present disclosure, the retrieval module 704 includes:
确定子模块7041,用于确定目标区域图像的语义表征信息;Determination sub-module 7041, used to determine the semantic representation information of the target area image;
检索子模块7042,用于根据语义表征信息,检索目标内容。The retrieval sub-module 7042 is used to retrieve target content based on semantic representation information.
在本公开的一些实施例中,确定子模块7041,具体用于:In some embodiments of the present disclosure, the determination sub-module 7041 is specifically used for:
从目标区域图像识别出目标物体轮廓;Identify the target object outline from the target area image;
根据目标物体轮廓,确定物体轮廓信息;According to the contour of the target object, determine the object contour information;
处理物体轮廓信息,得到轮廓向量表征;Process the object contour information and obtain the contour vector representation;
将轮廓向量表征作为语义表征信息。Contour vector representation as semantic representation information.
在本公开的一些实施例中,检索子模块7042,具体用于:In some embodiments of the present disclosure, the search sub-module 7042 is specifically used for:
确定与语义表征信息对应的候选相似度层级,其中,候选相似度层级属于预先构建的图状数据结构,候选相似度层级,是其相应所表征内容与初始图像之间的相似程度所属的层级;Determine the candidate similarity level corresponding to the semantic representation information, where the candidate similarity level belongs to a pre-constructed graph data structure, and the candidate similarity level is the level to which the similarity between the corresponding represented content and the initial image belongs;
将图状数据结构中候选相似度层级所表征内容作为目标内容。The content represented by the candidate similarity level in the graph data structure is used as the target content.
与上述图1至图6实施例提供的结合RPA和AI实现IA的图像检索方法相对应,本公开还提供一种结合RPA和AI实现IA的图像检索装置,由于本公开实施例提供的结合RPA和AI实现IA的图像检索装置与上述图1至图6实施例提供的结合RPA和AI实现IA的图像检索方法相对应,因此在结合RPA和AI实现IA的图像检索方法的实施方式也适用于本公开实施例提供的结合RPA和AI实现IA的图像检索装置,在本公开实施例中不再详细描述。Corresponding to the image retrieval method that combines RPA and AI to implement IA provided by the above embodiments of FIG. 1 to FIG. 6 , the present disclosure also provides an image retrieval device that combines RPA and AI to implement IA. Since the embodiment of the disclosure provides an image retrieval method that combines RPA with AI The image retrieval device that implements IA with AI corresponds to the image retrieval method that combines RPA and AI to implement IA provided in the above embodiments of Figures 1 to 6. Therefore, the implementation of the image retrieval method that combines RPA and AI to implement IA is also applicable to The image retrieval device that combines RPA and AI to implement IA provided by the embodiment of the present disclosure will not be described in detail in the embodiment of the present disclosure.
本公开实施例各装置中的各模块的功能可以参见上述方法中的对应描述,在此不再赘述。For the functions of each module in each device of the embodiment of the present disclosure, please refer to the corresponding description in the above method, and will not be described again here.
本实施例中,通过基于机器人流程自动化RPA技术获取初始图像,基于人工智能AI技术从初始图像中截取初始区域图像,根据图像描述信息处理初始区域图像,得到目标区域图像,根据目标区域图像,检索目标内容,能够利用RPA结合人工智能AI实现图像检索的智能自动化IA,能够在图像检索之前及时地对图像进行预处理,以去除图像中的干扰信息,有效提升所得目标区域图像在检索过程中的针对性,有效降低干扰信息对检索过程的影响,从而有效提升图像检索效率和图像检索结果的准确性。In this embodiment, the initial image is obtained based on Robotic Process Automation RPA technology, the initial area image is intercepted from the initial image based on artificial intelligence AI technology, the initial area image is processed according to the image description information, and the target area image is obtained, and the target area image is retrieved based on the target area image. Target content can use RPA combined with artificial intelligence AI to realize intelligent automation IA of image retrieval. It can preprocess images in time before image retrieval to remove interference information in the image and effectively improve the performance of the obtained target area image in the retrieval process. Targeted, effectively reduce the impact of interference information on the retrieval process, thereby effectively improving image retrieval efficiency and the accuracy of image retrieval results.
为了实现上述实施例,本公开还提出一种电子设备,包括:存储器、处理器及存储在存储器上并可在处理器上运行的计算机程序,处理器执行程序时,实现如本公开前述实施例提出的结合RPA和AI实现IA的图像检索方法。In order to implement the above embodiments, the present disclosure also proposes an electronic device, including: a memory, a processor, and a computer program stored in the memory and executable on the processor. When the processor executes the program, the aforementioned embodiments of the present disclosure are implemented. The proposed image retrieval method combines RPA and AI to realize IA.
图9示出根据本公开一实施例的电子设备的结构框图。如图9所示,该电子设备90包括:存储器910和处理器920,存储器910内存储有可在处理器920上运行的计算机程序。处理器920执行该计算机程序时实现上述实施例中的结合RPA和AI实现IA的图像检索方法。存储器910和处理器920的数量可以为一个或多个。FIG. 9 shows a structural block diagram of an electronic device according to an embodiment of the present disclosure. As shown in FIG. 9 , the electronic device 90 includes: a memory 910 and a processor 920 . The memory 910 stores a computer program that can run on the processor 920 . When the processor 920 executes the computer program, it implements the image retrieval method for implementing IA by combining RPA and AI in the above embodiment. The number of memory 910 and processor 920 may be one or more.
该电子设备90还包括:The electronic device 90 also includes:
通信接口930,用于与外界设备进行通信,进行数据交互传输。The communication interface 930 is used to communicate with external devices and perform data interactive transmission.
如果存储器910、处理器920和通信接口930独立实现,则存储器910、处理器920和通信接口930可以通过总线相互连接并完成相互间的通信。该总线可以是工业标准体系结构(Industry Standard Architecture,ISA)总线、外部设备互连(Peripheral Component Interconnect,PCI)总线或扩展工业标准体系结构(Extended Industry Standard Architecture,EISA)总线等。该总线可以分为地址总线、数据总线、控制总线等。为便于表示,图9中仅用一条粗线表示,但并不表示仅有一根总线或一种类型的总线。If the memory 910, the processor 920 and the communication interface 930 are implemented independently, the memory 910, the processor 920 and the communication interface 930 can be connected to each other through a bus and complete communication with each other. The bus can be an Industry Standard Architecture (ISA) bus, a Peripheral Component Interconnect (PCI) bus or an Extended Industry Standard Architecture (EISA) bus, etc. The bus can be divided into address bus, data bus, control bus, etc. For ease of presentation, only one thick line is used in Figure 9, but it does not mean that there is only one bus or one type of bus.
可选的,在具体实现上,如果存储器910、处理器920及通信接口930集成在一块芯片上,则存储器910、处理器920及通信接口930可以通过内部接口完成相互间的通信。Optionally, in specific implementation, if the memory 910, the processor 920 and the communication interface 930 are integrated on one chip, the memory 910, the processor 920 and the communication interface 930 can communicate with each other through the internal interface.
本公开实施例提供了一种计算机可读存储介质,其存储有计算机程序,该程序被处理器执行时实现本公开实施例中提供的方法。Embodiments of the present disclosure provide a computer-readable storage medium, which stores a computer program. When the program is executed by a processor, the method provided in the embodiment of the present disclosure is implemented.
本公开实施例还提供了一种芯片,该芯片包括,包括处理器,用于从存储器中调用并运行存储器中存储的指令,使得安装有芯片的通信设备执行本公开实施例提供的方法。An embodiment of the present disclosure also provides a chip, which includes a processor for calling and running instructions stored in the memory, so that the communication device installed with the chip executes the method provided by the embodiment of the present disclosure.
本公开实施例还提供了一种芯片,包括:输入接口、输出接口、处理器和存储器,输入接口、输出接口、处理器以及存储器之间通过内部连接通路相连,处理器用于执行存储器中的代码,当代码被执行时,处理器用于执行申请实施例提供的方法。Embodiments of the present disclosure also provide a chip, including: an input interface, an output interface, a processor, and a memory. The input interface, the output interface, the processor, and the memory are connected through an internal connection path. The processor is used to execute the code in the memory. , when the code is executed, the processor is used to execute the method provided by the application embodiment.
应理解的是,上述处理器可以是中央处理器(Central Processing Unit,CPU),还可以是其他通用处理器、数字信号处理器(Digital Signal Processing,DSP)、专用集成电路(Application Specific Integrated Circuit,ASIC)、现场可编程门阵列(Field Programmable Gate Array,FPGA)或者其他可编程逻辑器件、分立门或者晶体管逻辑器件、分立硬件组件等。通用处理器可以是微处理器或者是任何常规的处理器等。值得说明的是,处理器可以是支持进阶精简指令集机器(Advanced RISC Machines,ARM)架构的处理器。It should be understood that the above-mentioned processor can be a central processing unit (Central Processing Unit, CPU), or other general-purpose processor, digital signal processor (Digital Signal Processing, DSP), application specific integrated circuit (Application Specific Integrated Circuit, ASIC), Field Programmable Gate Array (FPGA) or other programmable logic devices, discrete gate or transistor logic devices, discrete hardware components, etc. A general-purpose processor can be a microprocessor or any conventional processor, etc. It is worth noting that the processor may be a processor that supports Advanced RISC Machines (ARM) architecture.
进一步地,可选的,上述存储器可以包括只读存储器和随机存取存储器,还可以包括非易失性随机存取存储器。该存储器可以是易失性存储器或非易失性存储器,或可包括易失性和非易失性存储器两者。其中,非易失性存储器可以包括只读存储器(Read-Only Memory,ROM)、可编程只读存储器(Programmable ROM,PROM)、可擦除可编程只读存储器(Erasable PROM,EPROM)、电可擦除可编程只读存储器(Electrically EPROM,EEPROM)或闪存。易失性存储器可以包括随机存取存储器(Random Access Memory,RAM),其用作外部高速缓存。通过示例性但不是限制性说明,许多形式的RAM可用。例如,静态随机存取存储器(Static RAM,SRAM)、动态随机存取存储器(Dynamic Random Access Memory,DRAM)、同步动态随机存取存储器(Synchronous DRAM,SDRAM)、双倍数据速率同步动态随机存取存储器(Double Data Date SDRAM,DDR SDRAM)、增强型同步动态随机存取存储器(Enhanced SDRAM,ESDRAM)、同步连接动态随机存取存储器(Synchlink DRAM,SLDRAM)和直接内存总线随机存取存储器(Direct Rambus RAM,DR RAM)。Further, optionally, the above-mentioned memory may include read-only memory and random access memory, and may also include non-volatile random access memory. The memory may be volatile memory or non-volatile memory, or may include both volatile and non-volatile memory. Among them, non-volatile memory can include read-only memory (Read-Only Memory, ROM), programmable read-only memory (Programmable ROM, PROM), erasable programmable read-only memory (Erasable PROM, EPROM), electrically removable memory. Erase programmable read-only memory (Electrically EPROM, EEPROM) or flash memory. Volatile memory may include Random Access Memory (RAM), which acts as an external cache. By way of illustration, but not limitation, many forms of RAM are available. For example, static random access memory (Static RAM, SRAM), dynamic random access memory (Dynamic Random Access Memory, DRAM), synchronous dynamic random access memory (Synchronous DRAM, SDRAM), double data rate synchronous dynamic random access Memory (Double Data Date SDRAM, DDR SDRAM), enhanced synchronous dynamic random access memory (Enhanced SDRAM, ESDRAM), synchronous link dynamic random access memory (Synchlink DRAM, SLDRAM) and direct memory bus random access memory (Direct Rambus RAM, DR RAM).
在上述实施例中,可以全部或部分地通过软件、硬件、固件或者其任意组合来实现。当使用软件实现时,可以全部或部分地以计算机程序产品的形式实现。计算机程序产品包括一个或多个计算机指令。在计算机上加载和执行计算机程序指令时,全部或部分地产生按照本公开的流程或功能。计算机可以是通用计算机、专用计算机、计算机网络、或者其他可编程装置。计算机指令可以存储在计算机可读存储介质中,或者从一个计算机可读存储介质向另一个计算机可读存储介质传输。In the above embodiments, it may be implemented in whole or in part by software, hardware, firmware, or any combination thereof. When implemented using software, it may be implemented in whole or in part in the form of a computer program product. A computer program product includes one or more computer instructions. When computer program instructions are loaded and executed on a computer, processes or functions in accordance with the present disclosure are produced, in whole or in part. The computer may be a general purpose computer, a special purpose computer, a computer network, or other programmable device. Computer instructions may be stored in or transmitted from one computer-readable storage medium to another computer-readable storage medium.
在本说明书的描述中,参考术语“一个实施例”、“一些实施例”、“示例”、“具体示例”、或“一些示例”等的描述意指结合该实施例或示例描述的具体特征、结构、材料或者特点包括于本公开的至少一个实施例或示例中。而且,描述的具体特征、结构、材料或者特点可以在任一个或多个实施例或示例中以合适的方式结合。此外,在不相互矛盾的情况下,本领域的技术人员可以将本说明书中描述的不同实施例或示例以及不同实施例或示例的特征进行结合和组合。In the description of this specification, reference to the terms "one embodiment," "some embodiments," "an example," "specific examples," or "some examples" or the like means that specific features are described in connection with the embodiment or example. , structures, materials, or features are included in at least one embodiment or example of the present disclosure. Furthermore, the specific features, structures, materials or characteristics described may be combined in any suitable manner in any one or more embodiments or examples. Furthermore, those skilled in the art may combine and combine different embodiments or examples and features of different embodiments or examples described in this specification unless they are inconsistent with each other.
此外,术语“第一”、“第二”仅用于描述目的,而不能理解为指示或暗示相对重要性或者隐含指明所指示的技术特征的数量。由此,限定有“第一”、“第二”的特征可以明示或隐含地包括至少一个该特征。在本公开的描述中,“多个”的含义是两个或两个以上,除非另有明确具体的限定。In addition, the terms “first” and “second” are used for descriptive purposes only and cannot be understood as indicating or implying relative importance or implicitly indicating the quantity of indicated technical features. Thus, features defined as "first" and "second" may explicitly or implicitly include at least one of these features. In the description of the present disclosure, "plurality" means two or more than two, unless otherwise expressly and specifically limited.
流程图中或在此以其他方式描述的任何过程或方法描述可以被理解为,表示包括一个或更多个用于实现特定逻辑功能或过程的步骤的可执行指令的代码的模块、片段或部分。并且本公开的优选实施方式的范围包括另外的实现,其中可以不按所示出或讨论的顺序,包括根据所涉及的功能按基本同时的方式或按相反的顺序,来执行功能。Any process or method descriptions in flowcharts or otherwise described herein may be understood to represent modules, segments, or portions of code that include one or more executable instructions for implementing the specified logical functions or steps of the process. . And the scope of the preferred embodiments of the present disclosure includes additional implementations in which functions may be performed out of the order shown or discussed, including in a substantially concurrent manner or in the reverse order, depending on the functionality involved.
在流程图中表示或在此以其他方式描述的逻辑和/或步骤,例如,可以被认为是用于实现逻辑功能的可执行指令的定序列表,可以具体实现在任何计算机可读介质中,以供指令执行系统、装置或设备(如基于计算机的系统、包括处理器的系统或其他可以从指令执行系统、 装置或设备取指令并执行指令的系统)使用,或结合这些指令执行系统、装置或设备而使用。The logic and/or steps represented in the flowcharts or otherwise described herein, for example, may be considered a sequenced list of executable instructions for implementing the logical functions, and may be embodied in any computer-readable medium, For use by instruction execution systems, devices or equipment (such as computer-based systems, systems including processors or other systems that can fetch instructions from and execute instructions from the instruction execution system, device or equipment), or in combination with these instruction execution systems, devices or equipment.
应理解的是,本公开的各部分可以用硬件、软件、固件或它们的组合来实现。在上述实施方式中,多个步骤或方法可以用存储在存储器中且由合适的指令执行系统执行的软件或固件来实现。上述实施例方法的全部或部分步骤是可以通过程序来指令相关的硬件完成,该程序可以存储于一种计算机可读存储介质中,该程序在执行时,包括方法实施例的步骤之一或其组合。It should be understood that various parts of the present disclosure may be implemented in hardware, software, firmware, or combinations thereof. In the above embodiments, various steps or methods may be implemented in software or firmware stored in a memory and executed by a suitable instruction execution system. All or part of the steps of the method in the above embodiment can be completed by instructing relevant hardware through a program. The program can be stored in a computer-readable storage medium. When executed, the program includes one of the steps of the method embodiment or other steps. combination.
此外,在本公开各个实施例中的各功能单元可以集成在一个处理模块中,也可以是各个单元单独物理存在,也可以两个或两个以上单元集成在一个模块中。上述集成的模块既可以采用硬件的形式实现,也可以采用软件功能模块的形式实现。上述集成的模块如果以软件功能模块的形式实现并作为独立的产品销售或使用时,也可以存储在一个计算机可读存储介质中。该存储介质可以是只读存储器,磁盘或光盘等。In addition, each functional unit in various embodiments of the present disclosure may be integrated into one processing module, each unit may exist physically alone, or two or more units may be integrated into one module. The above integrated modules can be implemented in the form of hardware or software function modules. If the above integrated modules are implemented in the form of software function modules and sold or used as independent products, they can also be stored in a computer-readable storage medium. The storage medium can be a read-only memory, a magnetic disk or an optical disk, etc.
以上所述,仅为本公开的具体实施方式,但本公开的保护范围并不局限于此,任何熟悉本技术领域的技术人员在本公开揭露的技术范围内,可轻易想到其各种变化或替换,这些都应涵盖在本公开的保护范围之内。因此,本公开的保护范围应以权利要求的保护范围为准。The above are only specific embodiments of the present disclosure, but the protection scope of the present disclosure is not limited thereto. Any person familiar with the technical field can easily think of various changes or modifications within the technical scope of the present disclosure. alternatives, these should all be covered by the protection scope of this disclosure. Therefore, the protection scope of the present disclosure should be subject to the protection scope of the claims.

Claims (20)

  1. 一种结合RPA和AI实现IA的图像检索方法,包括:An image retrieval method that combines RPA and AI to implement IA, including:
    基于机器人流程自动化RPA技术获取初始图像,其中,所述初始图像具有图像描述信息;Obtain an initial image based on Robotic Process Automation (RPA) technology, where the initial image has image description information;
    基于人工智能AI技术从所述初始图像中截取初始区域图像;Intercept the initial area image from the initial image based on artificial intelligence AI technology;
    根据所述图像描述信息处理所述初始区域图像,得到目标区域图像;Process the initial area image according to the image description information to obtain a target area image;
    根据所述目标区域图像,检索目标内容。Target content is retrieved based on the target area image.
  2. 如权利要求1所述的方法,其中,所述基于人工智能AI技术从所述初始图像中截取初始区域图像,包括:The method of claim 1, wherein the intercepting an initial area image from the initial image based on artificial intelligence AI technology includes:
    调用自然语言处理NLP服务,以识别所述初始图像中的主体信息;Call the natural language processing NLP service to identify the subject information in the initial image;
    根据所述主体信息,确定主体对应于所述初始图像中的位置描述信息;According to the subject information, it is determined that the subject corresponds to the position description information in the initial image;
    从所述初始图像中截取与所述位置描述信息对应的区域图像作为所述初始区域图像。A region image corresponding to the position description information is intercepted from the initial image as the initial region image.
  3. 如权利要求1或2所述的方法,其中,在所述基于机器人流程自动化RPA技术获取初始图像之后,所述方法还包括:The method according to claim 1 or 2, wherein after the initial image is obtained based on the robotic process automation RPA technology, the method further includes:
    确定所述初始图像的图像尺度信息;和/或Determine the image scale information of the initial image; and/or
    确定所述初始图像的像素特征信息;和/或Determine the pixel feature information of the initial image; and/or
    确定针对所述初始图像指定的处理参数信息;determining processing parameter information specified for the initial image;
    将所述图像尺度信息、和/或所述像素特征信息、和/或所述处理参数信息作为所述图像描述信息。The image scale information, and/or the pixel feature information, and/or the processing parameter information are used as the image description information.
  4. 如权利要求3所述的方法,其中,所述图像描述信息包括:所述图像尺度信息;The method of claim 3, wherein the image description information includes: the image scale information;
    其中,所述根据所述图像描述信息处理所述初始区域图像,得到目标区域图像,包括:Wherein, processing the initial area image according to the image description information to obtain a target area image includes:
    根据所述图像尺度信息对所述初始区域图像进行扩大处理;Expand the initial area image according to the image scale information;
    将扩大处理后的图像作为所述目标区域图像。The enlarged image is used as the target area image.
  5. 如权利要求3所述的方法,其中,所述图像描述信息包括:所述像素特征信息;The method of claim 3, wherein the image description information includes: the pixel feature information;
    其中,所述根据所述图像描述信息处理所述初始区域图像,得到目标区域图像,包括:Wherein, processing the initial area image according to the image description information to obtain a target area image includes:
    获取所述初始图像区域中的区域像素;Obtain regional pixels in the initial image region;
    从所述像素特征信息中解析所述区域像素的区域像素特征;Analyze the regional pixel characteristics of the regional pixels from the pixel characteristic information;
    对所述初始图像区域中各个所述区域像素的区域像素特征进行增强处理,得到所述目标区域图像。The regional pixel features of each regional pixel in the initial image region are enhanced to obtain the target region image.
  6. 如权利要求3所述的方法,其中,所述图像描述信息包括:所述图像尺度信息和所述像素特征信息;The method of claim 3, wherein the image description information includes: the image scale information and the pixel feature information;
    其中,所述根据所述图像描述信息处理所述初始区域图像,得到目标区域图像,包括:Wherein, processing the initial area image according to the image description information to obtain a target area image includes:
    根据所述图像尺度信息对所述初始区域图像进行扩大处理,得到待填充区域图像,其中,所述待填充区域图像包括:待填充像素;The initial area image is expanded according to the image scale information to obtain an area image to be filled, wherein the area image to be filled includes: pixels to be filled;
    从所述像素特征信息中解析所述待填充像素的第一像素特征;Parse the first pixel feature of the pixel to be filled from the pixel feature information;
    从所述像素特征信息中解析其他区域图像中区域像素的第二像素特征,其中,所述初始区域图像和所述其他区域图像共同构成所述初始图像;Parse second pixel features of regional pixels in other regional images from the pixel feature information, wherein the initial regional image and the other regional images together constitute the initial image;
    根据所述第一像素特征和所述第二像素特征,生成填充像素特征;Generate filling pixel features according to the first pixel feature and the second pixel feature;
    根据所述填充像素特征对所述待填充区域图像进行填充处理,得到所述目标区域图像。Perform filling processing on the image of the area to be filled according to the characteristics of the filling pixels to obtain the image of the target area.
  7. 如权利要求3所述的方法,其中,所述图像描述信息包括:所述处理参数信息;The method of claim 3, wherein the image description information includes: the processing parameter information;
    其中,所述根据所述图像描述信息处理所述初始区域图像,得到目标区域图像,包括:Wherein, processing the initial area image according to the image description information to obtain a target area image includes:
    根据所述处理参数信息处理所述初始区域图像,得到所述目标区域图像。The initial area image is processed according to the processing parameter information to obtain the target area image.
  8. 如权利要求1至7中任一项所述的方法,其中,所述根据所述目标区域图像,检索目标内容,包括:The method according to any one of claims 1 to 7, wherein retrieving target content according to the target area image includes:
    确定所述目标区域图像的语义表征信息;Determine the semantic representation information of the target area image;
    根据所述语义表征信息,检索所述目标内容。The target content is retrieved based on the semantic representation information.
  9. 如权利要求8所述的方法,其中,所述确定所述目标区域图像的语义表征信息,包括:The method of claim 8, wherein determining the semantic representation information of the target area image includes:
    从所述目标区域图像识别出目标物体轮廓;Identify the target object outline from the target area image;
    根据所述目标物体轮廓,确定物体轮廓信息;Determine object contour information according to the target object contour;
    处理所述物体轮廓信息,得到轮廓向量表征;Process the object contour information to obtain a contour vector representation;
    将所述轮廓向量表征作为所述语义表征信息。The contour vector representation is used as the semantic representation information.
  10. 如权利要求8或9所述的方法,其中,所述根据所述语义表征信息,检索目标内容,包括:The method of claim 8 or 9, wherein retrieving target content according to the semantic representation information includes:
    确定与所述语义表征信息对应的候选相似度层级,其中,所述候选相似度层级属于预先构建的图状数据结构,所述候选相似度层级,是其相应所表征内容与所述初始图像之间的相似程度所属的层级;Determine a candidate similarity level corresponding to the semantic representation information, wherein the candidate similarity level belongs to a pre-constructed graph data structure, and the candidate similarity level is the difference between its corresponding represented content and the initial image. The level of similarity between them;
    将所述图状数据结构中所述候选相似度层级所表征内容作为所述目标内容。The content represented by the candidate similarity level in the graph data structure is used as the target content.
  11. 一种结合RPA和AI实现IA的图像检索装置,包括:An image retrieval device that combines RPA and AI to implement IA, including:
    获取模块,用于基于机器人流程自动化RPA技术获取初始图像,其中,所述初始图像具有图像描述信息;An acquisition module, configured to acquire an initial image based on Robotic Process Automation (RPA) technology, where the initial image has image description information;
    第一处理模块,用于基于人工智能AI技术从所述初始图像中截取初始区域图像;A first processing module, configured to intercept an initial area image from the initial image based on artificial intelligence AI technology;
    第二处理模块,用于根据所述图像描述信息处理所述初始区域图像,得到目标区域图像;a second processing module, configured to process the initial area image according to the image description information to obtain a target area image;
    检索模块,用于根据所述目标区域图像,检索目标内容。A retrieval module, configured to retrieve target content based on the target area image.
  12. 如权利要求11所述的装置,其中,所述第一处理模块具体用于:The device according to claim 11, wherein the first processing module is specifically used for:
    调用自然语言处理NLP服务,以识别所述初始图像中的主体信息;Call the natural language processing NLP service to identify the subject information in the initial image;
    根据所述主体信息,确定主体对应于所述初始图像中的位置描述信息;According to the subject information, it is determined that the subject corresponds to the position description information in the initial image;
    从所述初始图像中截取与所述位置描述信息对应的区域图像作为所述初始区域图像。A region image corresponding to the position description information is intercepted from the initial image as the initial region image.
  13. 如权利要求11或12所述的装置,还包括:The device of claim 11 or 12, further comprising:
    确定模块,用于确定所述初始图像的图像尺度信息;和/或确定所述初始图像的像素特征信息;和/或确定针对所述初始图像指定的处理参数信息;并将所述图像尺度信息、和/或所述像素特征信息、和/或所述处理参数信息作为所述图像描述信息。a determining module, configured to determine the image scale information of the initial image; and/or determine the pixel feature information of the initial image; and/or determine the processing parameter information specified for the initial image; and convert the image scale information , and/or the pixel feature information, and/or the processing parameter information as the image description information.
  14. 如权利要求13所述的装置,其中,所述图像描述信息包括:所述图像尺度信息;The device of claim 13, wherein the image description information includes: the image scale information;
    其中,所述第二处理模块具体用于:Wherein, the second processing module is specifically used for:
    根据所述图像尺度信息对所述初始区域图像进行扩大处理;Expand the initial area image according to the image scale information;
    将扩大处理后的图像作为所述目标区域图像。The enlarged image is used as the target area image.
  15. 如权利要求13所述的装置,其中,所述图像描述信息包括:所述像素特征信息;The device of claim 13, wherein the image description information includes: the pixel feature information;
    其中,所述第二处理模块,还用于:Wherein, the second processing module is also used for:
    获取所述初始图像区域中的区域像素;Obtain regional pixels in the initial image region;
    从所述像素特征信息中解析所述区域像素的区域像素特征;Analyze the regional pixel characteristics of the regional pixels from the pixel characteristic information;
    对所述初始图像区域中各个所述区域像素的区域像素特征进行增强处理,得到所述目标区域图像。The regional pixel features of each regional pixel in the initial image region are enhanced to obtain the target region image.
  16. 如权利要求13所述的装置,其中,所述图像描述信息包括:所述图像尺度信息和所述像素特征信息;The device of claim 13, wherein the image description information includes: the image scale information and the pixel feature information;
    其中,所述第二处理模块,还用于:Wherein, the second processing module is also used for:
    根据所述图像尺度信息对所述初始区域图像进行扩大处理,得到待填充区域图像,其中,所述待填充区域图像包括:待填充像素;The initial area image is expanded according to the image scale information to obtain an area image to be filled, wherein the area image to be filled includes: pixels to be filled;
    从所述像素特征信息中解析所述待填充像素的第一像素特征;Parse the first pixel feature of the pixel to be filled from the pixel feature information;
    从所述像素特征信息中解析其他区域图像中区域像素的第二像素特征,其中,所述初始区域图像和所述其他区域图像共同构成所述初始图像;Parse second pixel features of regional pixels in other regional images from the pixel feature information, wherein the initial regional image and the other regional images together constitute the initial image;
    根据所述第一像素特征和所述第二像素特征,生成填充像素特征;Generate filling pixel features according to the first pixel feature and the second pixel feature;
    根据所述填充像素特征对所述待填充区域图像进行填充处理,得到所述目标区域图像。Perform filling processing on the image of the area to be filled according to the characteristics of the filling pixels to obtain the image of the target area.
  17. 如权利要求13所述的装置,其中,所述图像描述信息包括:所述处理参数信息;The device of claim 13, wherein the image description information includes: the processing parameter information;
    其中,所述第二处理模块,还用于:Wherein, the second processing module is also used for:
    根据所述处理参数信息处理所述初始区域图像,得到所述目标区域图像。The initial area image is processed according to the processing parameter information to obtain the target area image.
  18. 如权利要求11至17中任一项所述的装置,其中,所述检索模块,包括:The device according to any one of claims 11 to 17, wherein the retrieval module includes:
    确定子模块,用于确定所述目标区域图像的语义表征信息;Determining submodule, used to determine the semantic representation information of the target area image;
    检索子模块,用于根据所述语义表征信息,检索所述目标内容。The retrieval sub-module is used to retrieve the target content according to the semantic representation information.
  19. 一种电子设备,包括:An electronic device including:
    至少一个处理器和存储器;at least one processor and memory;
    所述存储器存储计算机执行指令;The memory stores computer execution instructions;
    所述至少一个处理器执行所述存储器存储的计算机执行指令,使得所述至少一个处理器执行如权利要求1-10任一项所述的结合RPA和AI实现IA的图像检索方法。The at least one processor executes the computer execution instructions stored in the memory, so that the at least one processor executes the image retrieval method for implementing IA by combining RPA and AI according to any one of claims 1-10.
  20. 一种计算机可读存储介质,其中,所述计算机可读存储介质中存储有计算机执行指令,当处理器执行所述计算机执行指令时,实现如权利要求1-10任一项所述的结合RPA和AI实现IA的图像检索方法。A computer-readable storage medium, wherein computer-executable instructions are stored in the computer-readable storage medium. When the processor executes the computer-executable instructions, the combined RPA method as described in any one of claims 1-10 is implemented. and AI implement IA image retrieval method.
PCT/CN2022/106339 2022-05-30 2022-07-18 Image retrieval method and apparatus for realizing achieving ia by combining rpa and ai, and electronic device WO2023231159A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202210600925.4 2022-05-30
CN202210600925.4A CN114840700B (en) 2022-05-30 2022-05-30 Image retrieval method and device for realizing IA by combining RPA and AI and electronic equipment

Publications (1)

Publication Number Publication Date
WO2023231159A1 true WO2023231159A1 (en) 2023-12-07

Family

ID=82572882

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2022/106339 WO2023231159A1 (en) 2022-05-30 2022-07-18 Image retrieval method and apparatus for realizing achieving ia by combining rpa and ai, and electronic device

Country Status (2)

Country Link
CN (1) CN114840700B (en)
WO (1) WO2023231159A1 (en)

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103473570A (en) * 2013-08-16 2013-12-25 电子科技大学 Self-adaptive video scene pedestrian detection method
CN105989174A (en) * 2015-03-05 2016-10-05 欧姆龙株式会社 Extraction device and extraction method of area of interest
CN107316320A (en) * 2017-06-19 2017-11-03 江西洪都航空工业集团有限责任公司 The real-time pedestrian detecting system that a kind of use GPU accelerates
US20180204111A1 (en) * 2013-02-28 2018-07-19 Z Advanced Computing, Inc. System and Method for Extremely Efficient Image and Pattern Recognition and Artificial Intelligence Platform
CN112766272A (en) * 2021-01-15 2021-05-07 北京迈格威科技有限公司 Target detection method, device and electronic system
CN114299478A (en) * 2021-12-14 2022-04-08 北京来也网络科技有限公司 Image processing method and device combining RPA and AI and electronic equipment

Family Cites Families (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101853295B (en) * 2010-05-28 2011-12-07 天津大学 Image search method
CN103369342B (en) * 2013-08-05 2015-04-22 重庆大学 Method for inpainting and restoring processing of vacancy of DIBR (Depth Image Based Rendering) target image
US20180107660A1 (en) * 2014-06-27 2018-04-19 Amazon Technologies, Inc. System, method and apparatus for organizing photographs stored on a mobile computing device
CN105046197B (en) * 2015-06-11 2018-04-17 西安电子科技大学 Multi-template pedestrian detection method based on cluster
CN106126581B (en) * 2016-06-20 2019-07-05 复旦大学 Cartographical sketching image search method based on deep learning
CN110019904A (en) * 2017-10-13 2019-07-16 南京唯实科技有限公司 A kind of image search method based on big data
CN110807731B (en) * 2018-07-20 2023-06-30 宁波舜宇光电信息有限公司 Method, device, system and storage medium for compensating image dead pixel
CN109145763B (en) * 2018-07-27 2021-07-13 天津大学 Video monitoring pedestrian search image text fusion method based on natural language description
CN109145906B (en) * 2018-08-31 2020-04-24 北京字节跳动网络技术有限公司 Target object image determination method, device, equipment and storage medium
CN109829850B (en) * 2019-03-06 2023-04-28 百度在线网络技术(北京)有限公司 Image processing method, device, equipment and computer readable medium
US20210097290A1 (en) * 2019-09-27 2021-04-01 Gyrfalcon Technology Inc. Video retrieval in feature descriptor domain in an artificial intelligence semiconductor solution
US11182604B1 (en) * 2019-11-26 2021-11-23 Automation Anywhere, Inc. Computerized recognition and extraction of tables in digitized documents
US11373057B2 (en) * 2020-05-12 2022-06-28 Kyndryl, Inc. Artificial intelligence driven image retrieval
CN111639719B (en) * 2020-06-08 2023-04-07 安徽大学 Footprint image retrieval method based on space-time motion and feature fusion
CN112766302B (en) * 2020-12-17 2024-03-29 浙江大华技术股份有限公司 Image fusion method and device, storage medium and electronic device
CN113990432A (en) * 2021-10-28 2022-01-28 北京来也网络科技有限公司 Image report pushing method and device based on RPA and AI and computing equipment
CN114372173A (en) * 2022-01-11 2022-04-19 中国人民公安大学 Natural language target tracking method based on Transformer architecture

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20180204111A1 (en) * 2013-02-28 2018-07-19 Z Advanced Computing, Inc. System and Method for Extremely Efficient Image and Pattern Recognition and Artificial Intelligence Platform
CN103473570A (en) * 2013-08-16 2013-12-25 电子科技大学 Self-adaptive video scene pedestrian detection method
CN105989174A (en) * 2015-03-05 2016-10-05 欧姆龙株式会社 Extraction device and extraction method of area of interest
CN107316320A (en) * 2017-06-19 2017-11-03 江西洪都航空工业集团有限责任公司 The real-time pedestrian detecting system that a kind of use GPU accelerates
CN112766272A (en) * 2021-01-15 2021-05-07 北京迈格威科技有限公司 Target detection method, device and electronic system
CN114299478A (en) * 2021-12-14 2022-04-08 北京来也网络科技有限公司 Image processing method and device combining RPA and AI and electronic equipment

Also Published As

Publication number Publication date
CN114840700B (en) 2023-01-13
CN114840700A (en) 2022-08-02

Similar Documents

Publication Publication Date Title
US20210081796A1 (en) Neural architecture search for dense image prediction tasks
US11348249B2 (en) Training method for image semantic segmentation model and server
US11416672B2 (en) Object recognition and tagging based on fusion deep learning models
WO2020098074A1 (en) Face sample picture marking method and apparatus, computer device, and storage medium
CN112232293B (en) Image processing model training method, image processing method and related equipment
WO2020024395A1 (en) Fatigue driving detection method and apparatus, computer device, and storage medium
CN111652974B (en) Method, device, equipment and storage medium for constructing three-dimensional face model
US20210342593A1 (en) Method and apparatus for detecting target in video, computing device, and storage medium
WO2022057309A1 (en) Lung feature recognition method and apparatus, computer device, and storage medium
CN109657675B (en) Image annotation method and device, computer equipment and readable storage medium
WO2023050651A1 (en) Semantic image segmentation method and apparatus, and device and storage medium
CN110807362A (en) Image detection method and device and computer readable storage medium
CN111242083A (en) Text processing method, device, equipment and medium based on artificial intelligence
WO2023284608A1 (en) Character recognition model generating method and apparatus, computer device, and storage medium
WO2019128735A1 (en) Imaging processing method and device
CN111444313B (en) Knowledge graph-based question and answer method, knowledge graph-based question and answer device, computer equipment and storage medium
CN112001399A (en) Image scene classification method and device based on local feature saliency
JP2022185143A (en) Text detection method, and text recognition method and device
US11232561B2 (en) Capture and storage of magnified images
KR20230132350A (en) Joint perception model training method, joint perception method, device, and storage medium
CN116821307B (en) Content interaction method, device, electronic equipment and storage medium
WO2020192523A1 (en) Translation quality detection method and apparatus, machine translation system, and storage medium
WO2023231159A1 (en) Image retrieval method and apparatus for realizing achieving ia by combining rpa and ai, and electronic device
CN111985471A (en) License plate positioning method and device and storage medium
WO2023066142A1 (en) Target detection method and apparatus for panoramic image, computer device and storage medium

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 22944473

Country of ref document: EP

Kind code of ref document: A1