WO2014158635A1 - Adaptive data path for computer-vision applications - Google Patents

Adaptive data path for computer-vision applications Download PDF

Info

Publication number
WO2014158635A1
WO2014158635A1 PCT/US2014/018943 US2014018943W WO2014158635A1 WO 2014158635 A1 WO2014158635 A1 WO 2014158635A1 US 2014018943 W US2014018943 W US 2014018943W WO 2014158635 A1 WO2014158635 A1 WO 2014158635A1
Authority
WO
WIPO (PCT)
Prior art keywords
image
computer
processing unit
application
vision
Prior art date
Application number
PCT/US2014/018943
Other languages
French (fr)
Inventor
Khosro Mohammad Rabii
Francis Bernard Macdougall
Evan Robbert Hildreth
Original Assignee
Qualcomm Incorporated
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Qualcomm Incorporated filed Critical Qualcomm Incorporated
Priority to JP2016500456A priority Critical patent/JP6706198B2/en
Priority to BR112015022860A priority patent/BR112015022860A2/en
Priority to CN201480012145.1A priority patent/CN105190685B/en
Priority to EP14717531.9A priority patent/EP2973386B1/en
Priority to KR1020157027143A priority patent/KR102188613B1/en
Publication of WO2014158635A1 publication Critical patent/WO2014158635A1/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N23/00Cameras or camera modules comprising electronic image sensors; Control thereof
    • H04N23/60Control of cameras or camera modules
    • H04N23/617Upgrading or updating of programs or applications for camera control
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/002Specific input/output arrangements not covered by G06F3/01 - G06F3/16
    • G06F3/005Input arrangements through a video camera
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T1/00General purpose image data processing
    • G06T1/20Processor architectures; Processor configuration, e.g. pipelining

Definitions

  • the imaging core includes a collection of hardware and/or software components providing a data path through which data can flow from an image sensor (e.g., camera) to an application processing unit, such as a general purpose processor, and/or a display.
  • an image sensor e.g., camera
  • an application processing unit such as a general purpose processor, and/or a display.
  • the imaging core is also utilized for computer-vision applications (also known as "machine-vision applications").
  • Embodiments of the present invention provide an adaptive data path for computer-vision applications.
  • the data path can adapt to a computer-vision application to provide data.
  • the data path can be adapted by applying one or more filters to image data from one or more sensors.
  • Some embodiments may utilize a computer- vision processing unit comprising a specialized instruction-based, in-line processor capable of interpreting commands from a computer- vision application.
  • An example apparatus for providing an adaptive data path for computer-vision applications includes an application processing unit and a processing unit, separately programmable from the application processing unit, and communicatively coupled to an image sensor module and the application processing unit.
  • the processing unit is configured to receive a first image from the image sensor module, select a first subset of image processing functions from a plurality of image processing functions, based on a first computer-vision application executed by the application processing unit, and process the first image using the first subset of image processing functions.
  • An example method for providing an adaptive data path for computer-vision applications includes receiving a first image, selecting a first subset of image processing functions from a plurality of image processing functions, based on a first computer-vision application executed by an application processing unit, and processing the first image, using the first subset of image processing functions.
  • the processing occurs in a unit separate from the application processing unit.
  • An example processor for providing an adaptive data path for computer-vision applications includes means for receiving a first image, means for selecting a first subset of image processing functions from a plurality of image processing functions, based on a first computer-vision application executed by an application processing unit, and means for processing the first image, using the first subset of image processing functions.
  • the means for processing the first image are separate from the application processing unit.
  • An example non-transitory computer-readable medium is encoded with instructions that, when executed, operate to cause a processing unit to perform operations including receiving a first image, selecting a first subset of image processing functions from a plurality of image processing functions, based o a first computer- vision application executed by an application processing unit, and processing the first image, using the first subset of image processing functions.
  • An example method includes receiving at an image signal processor intonnation derived from an image captured by an optical sensor, and receiving at the image signal processor an indicator of data for output. The method further includes processing the received information based on the received indicator, and outputting the processed information.
  • the method can include one or more of the following features.
  • the indicator of data for output can include an indication of an application for which data is being generated or a type of application for which data is being generated.
  • the indicator of data for output can include an indication of a type of data requested.
  • the processing can include reducing an amount or type of the received information, and/or comparing the received information to desired features and removing portions of the received information that are irrelevant to the desired features. Additionally or alternatively, the processing can include comparing the received information to known information, and outputting the processed information can include outputting an alert when the received information differs from the known information.
  • Another example method can include receiving at an image signal processor information from an optical sensor, and receiving at the image signal processor an identification of a computer vision application, a type of computer vision application, or a state of a computer vision application.
  • the method can further include selecting a computer vision module of the image signal processor based on the identification, and processing the mformation at the image signal processor with the selected computer vision module.
  • Yet another example method can include receiving at an image signal processor information derived from an image captured by an optical sensor, processing the received information, and outputting the processed information from the image signal processor to an application processor based on an application or application state which will consume the processed information.
  • the method can include one or more of the following features.
  • a type or amount of the processed mformation can be dependent on the application or application state.
  • the application can include a computer vision application.
  • the processed information can include pixel data for further processing by the application, and/or an interrupt or wakeup for the application processor.
  • the application processor can be asleep (e.g., in a low -power state) while the image signal processor is processing the received information.
  • the method can further include selecting a data path in the image signal processor based on the application or application state, where the processing comprises processing the received information using the selected data path.
  • Items and/or techniques described herein may provide one or more of the following capabilities, as well as other capabilities not mentioned. Techniques can provide for increased efficiency by adapting the data path to computer- vision applications, thereby reducing processing overhead. Furthermore, by offloading computer-vision funct onality from an application processing unit to the data path, the application processing unit can enter a low-power mode of operation while the data path performs the offloaded functions, further reducing power and processing demands.
  • FIG. 1 is a simplified block diagram illustrating certain components of a device, according to one embodiment.
  • FIG. 2 is a functional block diagram illustrating a data path for providing data from a sensor module to computer- vision applications and services, according to one embodiment.
  • FIG. 3 is a diagram of a set of available filters of a computer-vision specialization module, according to one embodiment.
  • FIG. 4 is a block diagram illustrating how components of the computer-vision specialization module of FIG. 3 can be grouped into different functional portions of the data path.
  • FIGS. 5 A and 513 are diagrams illustrating how the computer- vision specialization can employ different filter chains, adaptively adjusting the data path based on the needs of the computer- vision applications and sendees.
  • FIG. 6 is an illustration of a data path using data switching to provide data ports for different segments of data, according to one embodiment.
  • FIG. 7 illustrates an embodiment of a method for providing an adaptive data path for computer- vision applications, according to one embodiment.
  • computer- vision applications can include a wide variety of applications providing an equally-wide spectrum of functionality.
  • computer- vision applications can provide one or more of the following functions:
  • Visual object feature detection e.g., colors, proximity, motion, etc, of a visual object
  • Some computer- vision applications may provide other functions in addition or as an alternative to these functions, indeed, as the sensing and processing capabilities of mobile devices continue to grow, additional functions will likely emerge.
  • a mobile device may switch operation between active states, requiring a relatively large amount of processing, and states requiring relativ ely little amount of processing.
  • Low- processing states are, for example, states in which the computer-vision applications do not receive features of interest from the data path that would require relatively high amounts of processing.
  • efficient execution of both high- and low-processing states can provide longer battery life.
  • the imaging core for mobile devices typically has a fixed data path with functionality favoring imaging applications; the data path being configured, for example, to provide images with high resolutions and/or at high frame rates.
  • the data path being configured, for example, to provide images with high resolutions and/or at high frame rates.
  • traditional mobile devices will often provide data paths with much more overhead than necessary.
  • data paths in traditional mobile devices often will not provide filtering needed by computer- vision applications, much of this filtering takes place on a general application processing unit. This can not only adversely affect the electronic device's battery life, but also a user's quality of experience (QOE).
  • QOE quality of experience
  • Embodiments disclosed herein provide for an adaptive data path for computer-vision applications that can adjust to requirements of computer-vision applications, offloading filters for computer-vision applications from an application processing unit to the data path, and providing different data (and different types of data) to the application processing unit, depending on the computer-vision applications. This ultimately provides for increased battery life and/or QOE.
  • FIG. I is a block diagram illustrating certain components of a device 100 having an adaptive data path for computer-vision applications, according to one embodiment.
  • the components can include sensor(s) 1 15, a sensor controller 1 17, an application processing unit 130, graphics processing unit 140, display 170, display controller 160, memory subsystem 150, and a computer-vision processing unit 120.
  • the device 100 can include various additional components that are not shown, such as a communication interface, user interface, and more.
  • FIG. 1 is meant only to provide a generalized illustration of various components, any or all of which may be utilized as appropriate. Different embodiments may add, omit, substitute, combine, and/or divide any of the components shown. A person of ordinary skill in the art will recognize many alterations.
  • the device 100 comprises a mobile device.
  • the device 100 may comprise a mobile phone, tablet, personal media player, portable gaming system, digital camera, camcorder, and the like. It can be rioted that, although techniques provided herein can be utilized on mobile devices to provide power savings and/or other advantages such as reduced latency in some embodiments, the device 100 is not so iimiied. That is, embodiments may include devices 100 and may not be considered mobile.
  • a television may include a data path for imaging applications (e.g., a video conferencing application) and a data path for machine-vision (e.g., a gesture control application).
  • a game system may include a data path for imaging applications (e.g., in-game video chat) and a data path for machine- vision (e.g., gesture and/or body control). Other embodiments involving non-mobile devices are contemplated.
  • the sensor(s) 1 15 can include one or more of a variety of sensors, depending on the functionality of the device 100, This can include sensors that are not associated with camera or camcorder functions.
  • the sensor(s) could include, for example, sensor(s) for detecting infrared (IR) and/or near IR, sensor(s) for determining different colorimetry, sensor(s) for determining depth (e.g., time-of-flight, structured-light, etc.), as well as one or more cameras (e.g., stereoscopic cameras, front- and/or back-facing cameras, etc.), and/or other imaging sensor(s).
  • IR infrared
  • near IR IR
  • sensor(s) for determining depth e.g., time-of-flight, structured-light, etc.
  • sensors e.g., time-of-flight, structured-light, etc.
  • cameras e.g., stereoscopic cameras, front- and/or back-facing cameras, etc.
  • other imaging sensor(s) e.g., stereoscopic cameras, front- and/or back-facing cameras, etc.
  • the sensor controller 1 17 can provide an interface with the sensor(s) 1 15.
  • the sensor controller 1 17 can control the sensor(s) 1 15 based on input from other components, such as the application processing unit 130 and/or the computer-vision processing unit 120.
  • the sensor controller 1 17 can provide, for example, while-balance control, exposure control, focus control, binning & skip control, 2D/3D windowing/boxing, and/or other functions.
  • the sensor(s) 115 and/or sensor controller 1 17 can form at least a portion of an image sensor module 1 10 of the device 100.
  • the application processing unit 130 can include without limitation one or more general-purpose processors, one or more special-puipose processors (such as digital signal processors, graphics acceleration processors, and/or the like), and/or other processing structure, which can be configured to execute computer-vision and other software applications, which may be stored in the memor subsystem 150.
  • the application processing unit 130 comprises an application or "apps" processor configured with one or more cores.
  • the memory subsystem 150 can include one or more non-transitory computer- readable storage media providing application and/or system storage.
  • Such storage devices can include a disk drive, a drive array, an optical storage device, a solid-state storage device, such as a random access memory (“RAM”), and/or a read-only memory (“ROM”), which can be programmable, flash-updateable, and/or the like.
  • RAM random access memory
  • ROM read-only memory
  • Such storage devices may be configured to implement any appropriate data stores, including without limitation, various file systems, database structures, and/or the like.
  • Software elements can be included in the memory subsystem as instructions embedded on the one or more non-transitory computer-readable storage media.
  • Such software elements can include an operating system, device drivers, executable libraries, and/or other code, such as one or more application programs, which may be designed to cause the application processing unit 130, computer- vision processing unit 120, a d'or other components to implement methods (or portions thereof), and/or configure systems, as provided by embodiments described herein.
  • one or more procedures described herein, including the method described with respect to FIG. 7, might be implemented as code and'or instructions executable by the computer-vision processing unit 120 and/or application processing unit 130.
  • the graphics processing unit 140 and display controller 160 can be used to show one or more images on a display 170, depending on the application(s) executed by the application processing unit 130. It can be noted that imaging applications such as video or still image capture typically display images, and therefore require the utilization of the display 170, display controller 160, and'or graphics processing unit 140. Many computer-vision applications, on the other hand, may not need to utilize these components at all. Accordingly, these components of the device 100 can be included or excluded from the data path to adapt to computer-vision applications executed by the application processing unit 130.
  • the computer- vision processing unit 120 can include hardware and/or software subcomponents for providing an adaptive data path from the sensor(s) 1 15 to the application processing unit 130.
  • This can include, for example, a video front end (VFE), image-signal processor (ISP), digital-signal processor (DSP), and'or other processing unit separately programmable from the application processing unit.
  • VFE video front end
  • ISP image-signal processor
  • DSP digital-signal processor
  • the computer-vision processing unit 12.0 can read from and/or write to the memory subsystem 150.
  • the computer-vis on processing unit 120 can be dynamically programmed, based on a computer-vision application executed by the application processing unit 130. Accordingly, the computer-vision processing unit 120 may execute applications stored in the memory subsystem 150 and/or a memory internal to the computer-vision processing unit 120 (which can include features similar to those described above in relation to the memory subsystem 150).
  • the computer- is on processing unit 120 may be able to provide the adaptive data path without the use of the memory subsystem 150, thereby enabling the memory subsystem 150 to enter a low- power mode in addition to the application processing unit 130,
  • the computer- vision processing unit 120 can comprise a specialized instruction-based, in-line processor adapted to implement various filters for computer-vision applications.
  • the computer-vision applications can call the instructions to implement the tllter(s) needed without the need to utilize an interpretive engine in some embodiments.
  • filters implemented by the computer- vision processing unit 12.0 can correspond to a set of standardized instruction-based commands tied to an interpretive language that is translated, in real time, by the specialized instruction-based, in-line processor to identify the needed fiiter(s) for the computer- vision application,
  • Techniques provided herein for an adaptive data path for computer-vision applications can enable programmability controlled at an application layer. That is, a computer-vision application executed by the application processing unit 130 can provide instructions and/or other information to the computer-vision processing unit 120 at any time in some embodiments. Such functionality grants the application more power, allowing for a dynamically programmable computer- vision processing unit 120 that provides computer-vision data in an efficient manner, without any additional knowledge of the context in which the computer-vision data is used.
  • the computer- vision application can determine the context and pro vide instructions to the computer- vision processing unit 120 accordingly.
  • embodiments can allow computer-vision applications to calf instructions to implement desired filter(s) without the need to utilize an interpretive engine.
  • FIG. 2 is a functional block diagram illustrating a data path 200 providing data from a sensor module 210 to computer-vision applications and services 240, according to one embodiment.
  • the data path 200 can be executed by one or more components of the device 100 of FIG. 1 , and/or similar means.
  • the data path 200 can include a core data path 220 and a computer-vision specialization module 230.
  • Other embodiments may vary from the embodiment shown by utilizing different competent in a similar functional manner. A person of ordinary skill in the art will recognize many alterations.
  • the sensor module 210 can comprise sensors and/or other components that output sensor data, such as the image sensor module 1 10 of FIG. 1 , and/or similar means. Depending on the functionality of the device 100 and/or the capabilities of the sensor(s) of the sensor module 210, the data from the sensor module can vary. The data from the sensor module 210 is then passed to the core data path 220.
  • the functionality of the core data path 220 can provide an interface between the sensor module 210 and the computer- vision specialization module 230. It may be adapted to accommodate specific sensor data of the sensor module 210 and output the data in a uniform manner, according to some embodiments. Furthermore, the core data path 220 may perform some initial processing of the data of the sensor module 210, employing, for example, certain filters that are used in imaging applications, computer- vision applications, and/or other common applications executable by the device 100. Such filters can include, for example, optimization filters for lighting, color, exposure, focus, and the like.
  • the core data path 2.20 may be implemented by portions of the computer- vision processing unit 120 and/or image sensor module 1 10 of FIG.
  • the computer-vision specialization module 230 comprises an adaptable component of the data path 200 that can be executed by the computer-vision processing unit 12.0 and/or memory subsystem 150 of FIG. 1.
  • the computer-vision specialization module 230 is configured to determine an input for computer-vision applications and services 240 and adapt the data path accordingly, thereby adapting the data from the sensor module 210 to the input for the computer- vision applications and services 240.
  • the computer-vision specialization module 230 can determine the input based on a requirement or need of one or more of the computer- vision applications and services 240 or based on an instruction therefrom or an interface thereof, in particular, the computer-vision specialization module 230 can receive an image (or other data) from the sensor module 210, via the core data path 220, and process the image using at least a subset of available image processing functions (also referred to herein as "filters") based on the needs of the computer-vision applications and services 240.
  • the computer- vision specialization moduie 230 or portions of its functionality is implemented by an ISP.
  • the computer-vision applications and services 240 can be executed by the application processing unit 130 and/or the memory subsystem 150 of FIG. 1 , and/or similar means. As discussed in more detail below, the computer-vision applications and services 240 can provide an indication of desired data or information to the computer- vision specialization module 230 by providing the computer- vision specialization module 230 with a reference (an image, command, etc.). As disclosed previously, computer-vision applications (and the needs thereof) can vary substantially, depending on application. It will be understood that computer-vision applications and services 240, as described herein, can include one or more computer-vision applications and/or computer-vision sendees. Where multiple computer-vision applications and/or services have many needs, the computer-vision specialization moduie 230 can provide an optimal data path to serve the needs of all of them in some embodiments.
  • FIG. 3 illustrates a set of available filters 315-355 of a computer- vision specialization moduie 230, according to one embodiment.
  • the filters are at least partially implemented in hardware, providing faster, more-efficient filtering than if executed in software alone. Further, implementation of such filters in the computer-vision specialization module 230 instead of in the computer-vision applications and services 240 and/or the application processing unit 130 may increase processing speed and/or reduce power consumption in some embodiments.
  • FIG. 3 illustrates the filters as providing a certain data path from an input 310 received from the core data path 22.0 to an output 375 provided to the computer- vision applications and sendees 240 in a particular order, the order can he altered, depending on application.
  • filters can be added, omitted, substituted, combined, and/or di vided depending on application. Furt her, any one or more of the filters may be selected for use for a given function or process without necessitating the use of other filters in the computer-vision specialization module 230 in some embodiments.
  • the computer-vision applications and sendees 2.40 can provide a reference 360 via an input 365.
  • the reference can be, for example, a reference image.
  • the reference can, additionally or alternati ely, include information about an image (e.g., a histogram providing a color distribution for an image).
  • the reference can include a command or set of instructions from the computer-vision applications and services 240, which may adhere to a certain protocol and/or instruction set.
  • a reference may be obtained by the computer-vision specialization module 230 directly from the core data path 22.0 in addition or as an alternative to receiving a reference from computer-vision applications and services 240. Once the reference is obtained, the computer-vision specialization module 230 can store it in memory.
  • Dynamic view selection 325 may be configured to select a certain view for subsequent processing.
  • a field of view can be sampled by multiple image-sensors (e.g., at the least two for stereo- imaging).
  • the field of view from several image-sensors can be dynamically adjusted to minimize the required processing for computer-vision.
  • hand-gesture recognition may track a hand, the image of which is captured by multiple image sensors.
  • Dynamic view selection 32.5 can select, from the plurality of images, the best image for hand tracking.
  • the dynamic view selection 325 may- track a fiducial marker and select the image with the best vie w of (e.g.
  • View narrowing 315 and view localization 320 may be configured to remove portions of an image an application is not interested in. For example, a particular computer -vision application may be interested in a particular object thai is placed in one quadrant of a 2-megapixei image. In such a case, only a portion of the image (e.g., 500,000 pixels) would need to be processed and passed to the computer-vision applications and services 240.
  • a portion of the image e.g., 500,000 pixels
  • the computer-vision specialization module 230 can be adapted to employ view narrowing 315 to, for example, isolate the relevant portion of the image and/or estimate the number of pixels and/or lines that represent the object.
  • view narrowing 315 can be employed to, for example, isolate the relevant portion of the image and/or estimate the number of pixels and/or lines that represent the object.
  • the computer-vision specialization module 230 can adapt accordingly.
  • the computer-vision specialization module 230 can use view narrowing 315 and'Or view localization 320 to provide different resolutions at different times.
  • Variable-angle rotation 345 may be configured to rotate an image and/or object within an image with respect to a sensor capturing the image. For example, a computer- vision application may only identify a rectangle in an image when the rectangle is in a certain orientation respect to a sensor. Rather than forcing a computer- vision application to execute algorithms for adjusting the image's orientation (e.g., running the algorithms on the application processing unit 130), the computer- vision specialization module 230 can adapt the data path to rotate all or a portion of the image using the variable-angle rotation 345.
  • Color transformation 350 can be implemented in the data path when, for example, the computer- vision applications and services 240 indicate a need for color-recognition, and/or manipulation.
  • Frame-drop control 340 may be configured to reduce the amount of images (frames) delivered to the computer-vision applications and services 240, thereby reducing bandwidth and power needs. For example, once programmed, a sensor of the sensor module 210 may deliver a certain amount of frames per second. However, if this amount is larger than what the computer- vision applications and services 240 needs, the data path can be configured to drop any extra frames using the frame-drop control 340.
  • Scene change detection & motion estimation 355 may be configured to determine whether a change in sensor input has occurred. The change can be determined, for example, by comparing a current image with a past image. As indicated previously, a past image may be maintained as a reference 360, retrieved from the data path or received from the computer- vision applications and sendees 240 via input 365.
  • the computer-vision specialization module 230 may also include one or more filters configured to identify and/or output features, and/or one or more modules configured to track elements or features and'Or output a position thereof.
  • the computer- vision specialization module 230 may comprise one or more functions configured to extract features from an image such as a Scale Invariant Feature Transfonn (SIFT), PhonySIFT, Speeded-up Robust Features (SURF), and/or Features from Accelerated Segment Test (FAST) comer detector.
  • SIFT Scale Invariant Feature Transfonn
  • PhonySIFT PhonySIFT
  • Speeded-up Robust Features SURF
  • FAST Accelerated Segment Test
  • the functions may include one or more functions to compute spectral features or characteristics.
  • the computer-vision specialization module 230 can also include a computer- vision interrupt processor 370. With the computer-vision interrupt processor 370, a computer-vision specialization module 230 can allow the underlying hardware executing the computer- vision applications and services 240 (e.g., an application processing unit 130 and'Or memory subsystem 150) to enter a low-power (e.g., "standby") mode. When the computer-vision specialization module 230 determines that a triggering event has taken place, the computer-vision specialization module 230 can utilize the computer-vision interrupt processor 370 to provide an interrupt to underlying hardware to cause the underlying hardware to exit the low-power mode.
  • a computer-vision interrupt processor 370 can allow the underlying hardware executing the computer- vision applications and services 240 (e.g., an application processing unit 130 and'Or memory subsystem 150) to enter a low-power (e.g., "standby") mode.
  • the computer-vision specialization module 230 determines that a triggering event has taken place, the computer-vision special
  • the computer- vision interrupt processor 370 can allow the computer- vision specialization module 230 to perform a variety of functions in the data path, while requiring little or no processing from the underlying hardware executing the computer-vision applications and services 240.
  • computer -vision applications and services 240 may include a security application in which the device 100 monitors an area with a camera and sounds an alarm if it detects a change in the monitored area.
  • the underlying hardware executing the computer-vision applications and services 240 can enter a standby mode while the computer-vision specialization module 230 employs scene change detection & motion estimation 355 to compare sensor input with a reference image of the monitored area. If the scene change detection & motion estimation 355 determines that there is a change in the monitored area (e.g., someone or something enters the scene viewed by the camera), it can notify the computer- vision interrupt processor 370. The computer-vision interrupt processor 370, in turn, can generate an interrupt for the underlying hardware executing the computer-vision applications and services 240, which can then cause the security application to sound the alarm.
  • the computer-vision interrupt processor 370 can generate an interrupt for the underlying hardware executing the computer-vision applications and services 240, which can then cause the security application to sound the alarm.
  • the device 100 is able execute the security application in an efficient manner without unnecessary- processing by the underlying hardware executing the computer- vision applications and services 240.
  • FIG. 4 is a block diagram 400 illustrating how components of the computer- vision specialization module 230 can be grouped into different functional portions of the data path for computer- vision applications and services 240.
  • components may be grouped into the following functional portions: an input layer 420, primitive recognition 430, and structure recognition 440.
  • Aspects of the data path not included in the computer-vision specialization module 230 can also contribute to the functionality of the input layer 420, primitive recognition 430, and/or structure recognition 440, depending on desired functionality. That is, the core data path 220 can provide an input 310 for which portions of the input layer 420, primitive recognition 430, and/or structure recognition 440 may have already been performed.
  • the input layer 420 comprises a portion of the data path that brings in data from the sensor module 210 while potentially reducing the required processing in subsequent computer-vision processing blocks.
  • the input layer 420 may be configured with one or more image-processing filters including, for example, color transformation 350, color filtering and masking 330, dynamic view selection 325, view localization 320, view narrowing 315, variable-angle rotation 345, color transformation 350, color filtering & masking 330, and/or frame-drop control 340, and the like.
  • the input layer 420 can provide image-processing filters optimizing environment lighting, color, exposure, and/or focus, as well as filters configured for feature extraction, and/or other filters.
  • 00S4J Th e primitive recognition 430 portion of the data path can be configured based on the needs of the computer-vision applications and services 240, This can include determining whether certain features are present in the data from the sensor module 2.10, For instance, if the computer-vision applications and sendees 2.40 include hand gesture recognition, the primitive recognition 430 could be configured to recognize a hand. For applications interested in certain colors, the primitive recognition 430 could be configured to examine the colorimetry of the data from the sensor module 210 to determine if one or more colors are present. To provide this functionality, the primitive recognition 430 can be configured with one or more image-processing filters including, for example, view localization 320 and/or view narrowing 315, and the like.
  • the primitive recognition 430 can be configured with image-processing filters including segmentation, posture detection, and/or other filters.
  • the structure recognition 440 can be configured to track features in the data from the sensor module 210 over time and define structures, which can also be based on the needs of the computer- vision applications and services 240.
  • the features in the data can include identifiable features such as edges, corners, SIFT (scale-invariant featitre transform), and the like. Structures defined by the tracked features can include, for example, symbols and/or vision gestures.
  • the structure recognition 440 can be configured wiih one or more image-processing filters including, for example, scene change detection & motion estimation 355, frame-drop control 340, histogram analysis 335, and the like.
  • the structure recognition 440 can be configured with image-processing filters including tracking, prediction, gesture detection, and/or other filters.
  • the tracked features can vary depending on the needs of the application requesting and/or consuming the structure recognition.
  • a gesture-based application for example, may require tracking of a hand to determine gestures
  • a context-aware application or augmented reality application on the other hand, may track any of a variety of objects for object recognition.
  • FIGS, 5A and 5B provide examples of how the computer-vision specialization can employ different filter chains, adaptively adjusting the data path 200 based on the needs of the computer- ision applications and services 240. In FIG.
  • the computer- vision specialization module 230 adapts the data path to a security application by utilizing a filter chain comprising histogram analysis 335, scene change detection & motion estimation 355, and computer- vision interrupt processor 370 to provide an output 375 to the computer-vision applications and services 240.
  • FIG. SB a data path 200 is illustrated in which the computer-vision specialization module 230 adapts the data path to meet the needs of a computer-vision application for object recognition by utilizing a filter chain comprising view narrowing 315, variable- angle rotation 345, and frame-drop control 340.
  • the filter chains provided in the examples shown in FIGS. 5A and 5B both include three filters, the computer-vision specialization module 230 can include a larger or smaller amount of filters, based on the needs of the computer-vision applications and services 240.
  • Parameters for each filter utilized by the computer-vision specialization module 230 can also vary, depending on needs of the computer- vision applications and services 240. For example, view narrowing 315 may be defined to provide the upper- right quadrant of an image for a first application, and a lower-left quadrant of an image for a second application. Furthermore, the computer-vision specialization module 230 may adapt the filters and their parameters according the changing needs of a single application. Thus, not only can the computer-vision specialization module 230 adapt the data path for different computer-vision applications, but also for different states and/or needs of a particular computer-vision application. 8058] Tn addition to the filters shown in FIG, 3, embodiments can utilize data switching to provide data ports for different segments of data.
  • a computer- vision specialization module 230 can employ a filter chain in which a multiple view-port data switch 620 separates data into different ports 630, With the multiple view-port data switch 620, the data path 200 can separate data from a single image into separate sub-image outputs, as if the images were taken by separate sensors.
  • the computer- vision specialization module 230 can use the filter(s) 610 and multiple view-port data switch 620 to extract, from an image of the faces of four people, separate images of each face, where the data for each image has a corresponding viewing port 630,
  • the facial-recognition computer- vision application can utilize the data from each port 630 and perform facial recognition from each port 630 separately.
  • Data switching in this manner can also facilitate how data is tagged in system memory 640 of a device, which can comprise the memory subsystem 150 of FIG, 1.
  • Data tagging can be particularly useful for devices with multiple sensors providing information to the data path 200.
  • embodiments can be configured to allow the multiple view-port data switch 620 of a computer-vis on specialization module 230 to tag the data of each data port 630 and write to the system memory 640.
  • the multiple view-port data switch 620 can be configured to align the data such that paging in the system memory 640 is reduced, thereby reducing latency and power consumption.
  • FIG. 7 illustrates an embodiment of a method 700 for providing an adaptive data path for computer-vision applications.
  • the method can be performed by, for example, a computer- vision processing unit, such as the computer- vision processing unit 120 of FIG. I .
  • means for performing each step of method 700 can include hardware and/or software components as described herein.
  • the method 700 may be performed by a specialized instruction-based, in-fine processor.
  • a memory subsystem of a device and'or memory internal to a computer- vision processing unit can be encoded with instructions for causing the device and/or computer- vis on processing unit to perform one or more of the steps of the method 700.
  • a first image is received.
  • the image can be received from one or more sensor(s) of an image sensor module, for example the sensor(s) 1 15.
  • some preliminary filtering may occur to the image before it is received by, for example, a core data path.
  • a first subset of im ge processing functions, or filters is selected based on a first computer- vision application executed by an application processing unit.
  • the first subset is selected from a plurality of image processing functions capable of being used in the processing of the first image.
  • selecting the first subset of image processing functions can be based on an input provided by the computer-vision application executed by an application processing unit, for example the application processing unit 130.
  • the input can be, for example, a reference image.
  • the input can be an instruction-based command interpreted by the computer-vision processing unit without a separate interpretive engine.
  • the input can include an instruction generated at the application layer by a computer-vision application.
  • the computer-vision processing unit (and/or other hardware and/or software implementing the method 700) can be dynamically programmed, based on the input, to implement the selected image processing functions.
  • the filters or functions may comprise any of the filters 315-355 or other filters not illustrated or described herein.
  • the fsrst image is processed using the first subset of image processing functions.
  • an interrupt or other output can be provided to the application processing unit and/or a memory subsystem to cause either or both to exit a low-power mode.
  • a plurality of sub-image outputs from the fsrst image can be provided, such as when a multiple view-port data switch is used to separate sub-images from a single image, as described previously in relation to FIG. 6
  • Optional blocks 740-760 illustrate the adaptability of the data path with regards to a second computer- ision application.
  • a second image is received, for example from sensor(s) 1 1 .
  • a second subset of image processing functions is selected, for example from the functions 315-355 or other functions, based on a second computer-vision application executed by the application processing unit.
  • the needs of the second computer- vision application can vary from the needs of the first compute -vision application, therefore resulting in the selection of a second subset of image processing functions, as illustrated above with regard to FIGS. 5 A and 5B.
  • the second image is processed using the second subset of image processing functions.
  • FIG. 7 illustrates an example method for providing an adaptive data path for computer- vision applications.
  • Alternative embodiments may include alterations to the embodiments shown.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Human Computer Interaction (AREA)
  • General Engineering & Computer Science (AREA)
  • Software Systems (AREA)
  • Signal Processing (AREA)
  • Studio Devices (AREA)
  • Image Processing (AREA)
  • User Interface Of Digital Computer (AREA)

Abstract

Embodiments of the present invention provide an adaptive data path for computer-vision applications. Utilizing techniques provided herein, the data path can adapt to the needs of a computer-vision application to provide the needed data. The data path can be adapted by applying one or more filters to image data from one or more sensors. Some embodiments may utilize a computer-vision processing unit comprising a specialized instruction-based, in-line processor capable of interpreting commands from a computer-vision application.

Description

[θθβί] Mobile phones, cameras, and other electronic devices often utilize an imaging core to capture and process imaging data for imagmg applications such as video or still image capture. The imaging core includes a collection of hardware and/or software components providing a data path through which data can flow from an image sensor (e.g., camera) to an application processing unit, such as a general purpose processor, and/or a display. In many electronic devices, the imaging core is also utilized for computer-vision applications (also known as "machine-vision applications").
SUMMARY
[8002] Embodiments of the present invention provide an adaptive data path for computer-vision applications. Utilizing techniques provided herein, the data path can adapt to a computer-vision application to provide data. The data path can be adapted by applying one or more filters to image data from one or more sensors. Some embodiments may utilize a computer- vision processing unit comprising a specialized instruction-based, in-line processor capable of interpreting commands from a computer- vision application.
[8003] An example apparatus for providing an adaptive data path for computer-vision applications, according to the disclosure, includes an application processing unit and a processing unit, separately programmable from the application processing unit, and communicatively coupled to an image sensor module and the application processing unit. The processing unit is configured to receive a first image from the image sensor module, select a first subset of image processing functions from a plurality of image processing functions, based on a first computer-vision application executed by the application processing unit, and process the first image using the first subset of image processing functions. 8004] An example method for providing an adaptive data path for computer-vision applications, according to the disclosure, includes receiving a first image, selecting a first subset of image processing functions from a plurality of image processing functions, based on a first computer-vision application executed by an application processing unit, and processing the first image, using the first subset of image processing functions. The processing occurs in a unit separate from the application processing unit.
[0005] An example processor for providing an adaptive data path for computer-vision applications, according to the disclosure, includes means for receiving a first image, means for selecting a first subset of image processing functions from a plurality of image processing functions, based on a first computer-vision application executed by an application processing unit, and means for processing the first image, using the first subset of image processing functions. The means for processing the first image are separate from the application processing unit.
[8006] An example non-transitory computer-readable medium, according to the disclosure, is encoded with instructions that, when executed, operate to cause a processing unit to perform operations including receiving a first image, selecting a first subset of image processing functions from a plurality of image processing functions, based o a first computer- vision application executed by an application processing unit, and processing the first image, using the first subset of image processing functions.
[0007] An example method, according to the description, includes receiving at an image signal processor intonnation derived from an image captured by an optical sensor, and receiving at the image signal processor an indicator of data for output. The method further includes processing the received information based on the received indicator, and outputting the processed information.
[0008] The method can include one or more of the following features. The indicator of data for output can include an indication of an application for which data is being generated or a type of application for which data is being generated. The indicator of data for output can include an indication of a type of data requested. The processing can include reducing an amount or type of the received information, and/or comparing the received information to desired features and removing portions of the received information that are irrelevant to the desired features. Additionally or alternatively, the processing can include comparing the received information to known information, and outputting the processed information can include outputting an alert when the received information differs from the known information. The processing can include identifying or generating the data for output from the received information, and the outputting the processed information can include waking another unit up (e.g., causing another unit to exit a low-power state by sending a signal, command, etc) only when the data is generated or identified. The other unit can be an application processor.
[8009] Another example method, according to the disclosure, can include receiving at an image signal processor information from an optical sensor, and receiving at the image signal processor an identification of a computer vision application, a type of computer vision application, or a state of a computer vision application. The method can further include selecting a computer vision module of the image signal processor based on the identification, and processing the mformation at the image signal processor with the selected computer vision module.
[8010] Yet another example method, accordmg to the disclosure, can include receiving at an image signal processor information derived from an image captured by an optical sensor, processing the received information, and outputting the processed information from the image signal processor to an application processor based on an application or application state which will consume the processed information.
[8011] The method can include one or more of the following features. A type or amount of the processed mformation can be dependent on the application or application state. The application can include a computer vision application. The processed information can include pixel data for further processing by the application, and/or an interrupt or wakeup for the application processor. The application processor can be asleep (e.g., in a low -power state) while the image signal processor is processing the received information. The method can further include selecting a data path in the image signal processor based on the application or application state, where the processing comprises processing the received information using the selected data path.
[8012] Items and/or techniques described herein may provide one or more of the following capabilities, as well as other capabilities not mentioned. Techniques can provide for increased efficiency by adapting the data path to computer- vision applications, thereby reducing processing overhead. Furthermore, by offloading computer-vision funct onality from an application processing unit to the data path, the application processing unit can enter a low-power mode of operation while the data path performs the offloaded functions, further reducing power and processing demands. These and other embodiments, along with many of its advantages and features, are described in more detail in conjunction with the text below and the attached figures,
BRIEF DESCRIPTION OF THE DRAWINGS
[0013] FIG. 1 is a simplified block diagram illustrating certain components of a device, according to one embodiment.
[0014] FIG. 2 is a functional block diagram illustrating a data path for providing data from a sensor module to computer- vision applications and services, according to one embodiment.
[8015] FIG. 3 is a diagram of a set of available filters of a computer-vision specialization module, according to one embodiment.
[0016] FIG. 4 is a block diagram illustrating how components of the computer-vision specialization module of FIG. 3 can be grouped into different functional portions of the data path.
[8017] FIGS. 5 A and 513 are diagrams illustrating how the computer- vision specialization can employ different filter chains, adaptively adjusting the data path based on the needs of the computer- vision applications and sendees.
[8018] FIG. 6 is an illustration of a data path using data switching to provide data ports for different segments of data, according to one embodiment.
[801 ] FIG. 7 illustrates an embodiment of a method for providing an adaptive data path for computer- vision applications, according to one embodiment.
DETAILED DESCRIPTION
[Θ02Θ] The following description is provided with reference to the drawings, where like reference numerals are used to refer to like elements throughout. While various details of one or more techniques are described herein, other techniques are also possible. In some instances, structures and devices are shown in block diagram form in order to facilitate describing various techniques, [8021] Mobile devices such as mobile phones, tablets, cameras, and the like often utilize an imaging core when capturing and processing imaging data for imaging applications such as video or still image capture. To execute imaging applications, mobile devices typically include one or more sensors allowing for image capture, a data path (implemented in hardware and/or software) that brings imaging data from the sensor(s) into the platform, and some processing. Because mobile devices have been historically used for imaging applications, mobile devices usually include an imaging core with a data path that is configured primarily for imaging applications.
[0022] However, many mobile devices are now capable of additionally executing computer- vision applications. In contrast to imaging applications, which typically include a limited number of applications dealing with camera- and camcorder-related functions, computer- vision applications can include a wide variety of applications providing an equally-wide spectrum of functionality. For example, computer- vision applications can provide one or more of the following functions:
1. Def ection of a variance in light,
2. Detection of a variation in viewing scene,
3. V i sua! obj ect d election ,
4. Visual object feature detection (e.g., colors, proximity, motion, etc, of a visual object),
5. Object tracking, and/or
6. Platform stability detection.
Some computer- vision applications may provide other functions in addition or as an alternative to these functions, indeed, as the sensing and processing capabilities of mobile devices continue to grow, additional functions will likely emerge.
[8023] in providing a sufficient data path for such computer-vision applications, a mobile device may switch operation between active states, requiring a relatively large amount of processing, and states requiring relativ ely little amount of processing. Low- processing states are, for example, states in which the computer-vision applications do not receive features of interest from the data path that would require relatively high amounts of processing. For mobile devices which are battery powered, efficient execution of both high- and low-processing states can provide longer battery life.
However, as noted previously, the imaging core for mobile devices typically has a fixed data path with functionality favoring imaging applications; the data path being configured, for example, to provide images with high resolutions and/or at high frame rates. Thus, rather than providing a data path that can allow the mobile device to efficiently execute both high- and low -processing states for computer-vision applications, traditional mobile devices will often provide data paths with much more overhead than necessary. Additionally, because data paths in traditional mobile devices often will not provide filtering needed by computer- vision applications, much of this filtering takes place on a general application processing unit. This can not only adversely affect the electronic device's battery life, but also a user's quality of experience (QOE). Embodiments disclosed herein provide for an adaptive data path for computer-vision applications that can adjust to requirements of computer-vision applications, offloading filters for computer-vision applications from an application processing unit to the data path, and providing different data (and different types of data) to the application processing unit, depending on the computer-vision applications. This ultimately provides for increased battery life and/or QOE.
[8024] FIG. I is a block diagram illustrating certain components of a device 100 having an adaptive data path for computer-vision applications, according to one embodiment. The components can include sensor(s) 1 15, a sensor controller 1 17, an application processing unit 130, graphics processing unit 140, display 170, display controller 160, memory subsystem 150, and a computer-vision processing unit 120. The device 100 can include various additional components that are not shown, such as a communication interface, user interface, and more. As with other figures provided herein, FIG. 1 is meant only to provide a generalized illustration of various components, any or all of which may be utilized as appropriate. Different embodiments may add, omit, substitute, combine, and/or divide any of the components shown. A person of ordinary skill in the art will recognize many alterations.
[0025] In some embodiments, the device 100 comprises a mobile device. For example, the device 100 may comprise a mobile phone, tablet, personal media player, portable gaming system, digital camera, camcorder, and the like. It can be rioted that, although techniques provided herein can be utilized on mobile devices to provide power savings and/or other advantages such as reduced latency in some embodiments, the device 100 is not so iimiied. That is, embodiments may include devices 100 and may not be considered mobile. For example, a television may include a data path for imaging applications (e.g., a video conferencing application) and a data path for machine-vision (e.g., a gesture control application). A game system may include a data path for imaging applications (e.g., in-game video chat) and a data path for machine- vision (e.g., gesture and/or body control). Other embodiments involving non-mobile devices are contemplated. 0026J The sensor(s) 1 15 can include one or more of a variety of sensors, depending on the functionality of the device 100, This can include sensors that are not associated with camera or camcorder functions. The sensor(s) could include, for example, sensor(s) for detecting infrared (IR) and/or near IR, sensor(s) for determining different colorimetry, sensor(s) for determining depth (e.g., time-of-flight, structured-light, etc.), as well as one or more cameras (e.g., stereoscopic cameras, front- and/or back-facing cameras, etc.), and/or other imaging sensor(s).
[8027] The sensor controller 1 17 can provide an interface with the sensor(s) 1 15. The sensor controller 1 17 can control the sensor(s) 1 15 based on input from other components, such as the application processing unit 130 and/or the computer-vision processing unit 120. in some embodiments, the sensor controller 1 17 can provide, for example, while-balance control, exposure control, focus control, binning & skip control, 2D/3D windowing/boxing, and/or other functions. The sensor(s) 115 and/or sensor controller 1 17 can form at least a portion of an image sensor module 1 10 of the device 100.
[8028] The application processing unit 130 can include without limitation one or more general-purpose processors, one or more special-puipose processors (such as digital signal processors, graphics acceleration processors, and/or the like), and/or other processing structure, which can be configured to execute computer-vision and other software applications, which may be stored in the memor subsystem 150. In some embodiments, the application processing unit 130 comprises an application or "apps" processor configured with one or more cores. [8029] The memory subsystem 150 can include one or more non-transitory computer- readable storage media providing application and/or system storage. Such storage devices can include a disk drive, a drive array, an optical storage device, a solid-state storage device, such as a random access memory ("RAM"), and/or a read-only memory ("ROM"), which can be programmable, flash-updateable, and/or the like. Such storage devices may be configured to implement any appropriate data stores, including without limitation, various file systems, database structures, and/or the like.
[0030] Software elements can be included in the memory subsystem as instructions embedded on the one or more non-transitory computer-readable storage media. Such software elements can include an operating system, device drivers, executable libraries, and/or other code, such as one or more application programs, which may be designed to cause the application processing unit 130, computer- vision processing unit 120, a d'or other components to implement methods (or portions thereof), and/or configure systems, as provided by embodiments described herein. Merely by way of example, one or more procedures described herein, including the method described with respect to FIG. 7, might be implemented as code and'or instructions executable by the computer-vision processing unit 120 and/or application processing unit 130.
[8031] The graphics processing unit 140 and display controller 160 can be used to show one or more images on a display 170, depending on the application(s) executed by the application processing unit 130. it can be noted that imaging applications such as video or still image capture typically display images, and therefore require the utilization of the display 170, display controller 160, and'or graphics processing unit 140. Many computer-vision applications, on the other hand, may not need to utilize these components at all. Accordingly, these components of the device 100 can be included or excluded from the data path to adapt to computer-vision applications executed by the application processing unit 130.
[0032] The computer- vision processing unit 120 can include hardware and/or software subcomponents for providing an adaptive data path from the sensor(s) 1 15 to the application processing unit 130. This can include, for example, a video front end (VFE), image-signal processor (ISP), digital-signal processor (DSP), and'or other processing unit separately programmable from the application processing unit.
Optionally, the computer-vision processing unit 12.0 can read from and/or write to the memory subsystem 150. In some embodiments, the computer-vis on processing unit 120 can be dynamically programmed, based on a computer-vision application executed by the application processing unit 130. Accordingly, the computer-vision processing unit 120 may execute applications stored in the memory subsystem 150 and/or a memory internal to the computer-vision processing unit 120 (which can include features similar to those described above in relation to the memory subsystem 150). In some embodiments, depending on the size of the memory internal to the computer- vision processing unit 120 and/or needs of a computer-vision application, the computer- is on processing unit 120 may be able to provide the adaptive data path without the use of the memory subsystem 150, thereby enabling the memory subsystem 150 to enter a low- power mode in addition to the application processing unit 130,
[8033] In one embodiment, the computer- vision processing unit 120 can comprise a specialized instruction-based, in-line processor adapted to implement various filters for computer-vision applications. As such, the computer-vision applications can call the instructions to implement the tllter(s) needed without the need to utilize an interpretive engine in some embodiments. In other words, filters implemented by the computer- vision processing unit 12.0 can correspond to a set of standardized instruction-based commands tied to an interpretive language that is translated, in real time, by the specialized instruction-based, in-line processor to identify the needed fiiter(s) for the computer- vision application,
[8(534] Techniques provided herein for an adaptive data path for computer-vision applications can enable programmability controlled at an application layer. That is, a computer-vision application executed by the application processing unit 130 can provide instructions and/or other information to the computer-vision processing unit 120 at any time in some embodiments. Such functionality grants the application more power, allowing for a dynamically programmable computer- vision processing unit 120 that provides computer-vision data in an efficient manner, without any additional knowledge of the context in which the computer-vision data is used. The computer- vision application can determine the context and pro vide instructions to the computer- vision processing unit 120 accordingly. As indicated above, embodiments can allow computer-vision applications to calf instructions to implement desired filter(s) without the need to utilize an interpretive engine. [8035] it will be apparent to those skilled in the art that substantial variations may be made to the components shown in FIG. 1 in accordance with specific requirements. For example, customized hardware might also be used, and/or particular elements might be implemented in hardware, software, or both, in addition or as an alternative to the descriptions above. Further, connection to other devices and/or sensors may be employed.
[8036] FIG. 2 is a functional block diagram illustrating a data path 200 providing data from a sensor module 210 to computer-vision applications and services 240, according to one embodiment. The data path 200 can be executed by one or more components of the device 100 of FIG. 1 , and/or similar means. In addition to the sensor module 210 and computer- vision applications and services 240, the data path 200 can include a core data path 220 and a computer-vision specialization module 230. Other embodiments may vary from the embodiment shown by utilizing different competent in a similar functional manner. A person of ordinary skill in the art will recognize many alterations.
[8037] The sensor module 210 can comprise sensors and/or other components that output sensor data, such as the image sensor module 1 10 of FIG. 1 , and/or similar means. Depending on the functionality of the device 100 and/or the capabilities of the sensor(s) of the sensor module 210, the data from the sensor module can vary. The data from the sensor module 210 is then passed to the core data path 220.
[8038] The functionality of the core data path 220 can provide an interface between the sensor module 210 and the computer- vision specialization module 230. It may be adapted to accommodate specific sensor data of the sensor module 210 and output the data in a uniform manner, according to some embodiments. Furthermore, the core data path 220 may perform some initial processing of the data of the sensor module 210, employing, for example, certain filters that are used in imaging applications, computer- vision applications, and/or other common applications executable by the device 100. Such filters can include, for example, optimization filters for lighting, color, exposure, focus, and the like. The core data path 2.20 may be implemented by portions of the computer- vision processing unit 120 and/or image sensor module 1 10 of FIG. 1 , as well as one or more intermediary components (not shown), and/or similar means. In some embodiments the core data path 220 may be implemented, at least in part, by a video front end (VFE). [8039] The computer-vision specialization module 230 comprises an adaptable component of the data path 200 that can be executed by the computer-vision processing unit 12.0 and/or memory subsystem 150 of FIG. 1. The computer-vision specialization module 230 is configured to determine an input for computer-vision applications and services 240 and adapt the data path accordingly, thereby adapting the data from the sensor module 210 to the input for the computer- vision applications and services 240. For example, the computer-vision specialization module 230 can determine the input based on a requirement or need of one or more of the computer- vision applications and services 240 or based on an instruction therefrom or an interface thereof, in particular, the computer-vision specialization module 230 can receive an image (or other data) from the sensor module 210, via the core data path 220, and process the image using at least a subset of available image processing functions (also referred to herein as "filters") based on the needs of the computer-vision applications and services 240. In some embodiments, the computer- vision specialization moduie 230 or portions of its functionality is implemented by an ISP.
[ΘΘ40] The computer-vision applications and services 240 can be executed by the application processing unit 130 and/or the memory subsystem 150 of FIG. 1 , and/or similar means. As discussed in more detail below, the computer-vision applications and services 240 can provide an indication of desired data or information to the computer- vision specialization module 230 by providing the computer- vision specialization module 230 with a reference (an image, command, etc.). As disclosed previously, computer-vision applications (and the needs thereof) can vary substantially, depending on application. It will be understood that computer-vision applications and services 240, as described herein, can include one or more computer-vision applications and/or computer-vision sendees. Where multiple computer-vision applications and/or services have many needs, the computer-vision specialization moduie 230 can provide an optimal data path to serve the needs of all of them in some embodiments.
[8(541] FIG. 3 illustrates a set of available filters 315-355 of a computer- vision specialization moduie 230, according to one embodiment. In some embodiments, the filters are at least partially implemented in hardware, providing faster, more-efficient filtering than if executed in software alone. Further, implementation of such filters in the computer-vision specialization module 230 instead of in the computer-vision applications and services 240 and/or the application processing unit 130 may increase processing speed and/or reduce power consumption in some embodiments. Although FIG. 3 illustrates the filters as providing a certain data path from an input 310 received from the core data path 22.0 to an output 375 provided to the computer- vision applications and sendees 240 in a particular order, the order can he altered, depending on application. Moreover, filters can be added, omitted, substituted, combined, and/or di vided depending on application. Furt her, any one or more of the filters may be selected for use for a given function or process without necessitating the use of other filters in the computer-vision specialization module 230 in some embodiments.
[ΘΘ42] To indicate needs, the computer-vision applications and sendees 2.40 can provide a reference 360 via an input 365. Depending on the application, the reference can be, for example, a reference image. The reference can, additionally or alternati ely, include information about an image (e.g., a histogram providing a color distribution for an image). Optionally, the reference can include a command or set of instructions from the computer-vision applications and services 240, which may adhere to a certain protocol and/or instruction set. In some instances, a reference may be obtained by the computer-vision specialization module 230 directly from the core data path 22.0 in addition or as an alternative to receiving a reference from computer-vision applications and services 240. Once the reference is obtained, the computer-vision specialization module 230 can store it in memory.
[8(543] Dynamic view selection 325 may be configured to select a certain view for subsequent processing. In particular, depending on the computer- vision application, a field of view can be sampled by multiple image-sensors (e.g., at the least two for stereo- imaging). As such, the field of view from several image-sensors can be dynamically adjusted to minimize the required processing for computer-vision. In one example, hand-gesture recognition may track a hand, the image of which is captured by multiple image sensors. Dynamic view selection 32.5 can select, from the plurality of images, the best image for hand tracking. In another example, the dynamic view selection 325 may- track a fiducial marker and select the image with the best vie w of (e.g. most perpendicular to) the fiducial marker, for example for use with an augmented reality (AR) application. This can help reduce subsequent processing by reducing the amount of images to process. [8044] View narrowing 315 and view localization 320 may be configured to remove portions of an image an application is not interested in. For example, a particular computer -vision application may be interested in a particular object thai is placed in one quadrant of a 2-megapixei image. In such a case, only a portion of the image (e.g., 500,000 pixels) would need to be processed and passed to the computer-vision applications and services 240. Accordingly, the computer-vision specialization module 230 can be adapted to employ view narrowing 315 to, for example, isolate the relevant portion of the image and/or estimate the number of pixels and/or lines that represent the object. As needs of the computer-vision applications and services 240 change (e.g., the particular computer-vision application enters a different state and'Or another computer- vision application is executed), the computer-vision specialization module 230 can adapt accordingly. Thus, the computer-vision specialization module 230 can use view narrowing 315 and'Or view localization 320 to provide different resolutions at different times.
[8(545] Variable-angle rotation 345 may be configured to rotate an image and/or object within an image with respect to a sensor capturing the image. For example, a computer- vision application may only identify a rectangle in an image when the rectangle is in a certain orientation respect to a sensor. Rather than forcing a computer- vision application to execute algorithms for adjusting the image's orientation (e.g., running the algorithms on the application processing unit 130), the computer- vision specialization module 230 can adapt the data path to rotate all or a portion of the image using the variable-angle rotation 345.
[8046] Color transformation 350 , color filtering & masking 330, and histogram analysis 335 can be implemented in the data path when, for example, the computer- vision applications and services 240 indicate a need for color-recognition, and/or manipulation.
[8847] Frame-drop control 340 may be configured to reduce the amount of images (frames) delivered to the computer-vision applications and services 240, thereby reducing bandwidth and power needs. For example, once programmed, a sensor of the sensor module 210 may deliver a certain amount of frames per second. However, if this amount is larger than what the computer- vision applications and services 240 needs, the data path can be configured to drop any extra frames using the frame-drop control 340. [8048] Scene change detection & motion estimation 355 may be configured to determine whether a change in sensor input has occurred. The change can be determined, for example, by comparing a current image with a past image. As indicated previously, a past image may be maintained as a reference 360, retrieved from the data path or received from the computer- vision applications and sendees 240 via input 365.
[0049] Although not illustrated in Fig. 3, the computer-vision specialization module 230 may also include one or more filters configured to identify and/or output features, and/or one or more modules configured to track elements or features and'Or output a position thereof. For example, the computer- vision specialization module 230 may comprise one or more functions configured to extract features from an image such as a Scale Invariant Feature Transfonn (SIFT), PhonySIFT, Speeded-up Robust Features (SURF), and/or Features from Accelerated Segment Test (FAST) comer detector.
Further, the functions may include one or more functions to compute spectral features or characteristics.
[8(558] The computer-vision specialization module 230 can also include a computer- vision interrupt processor 370. With the computer-vision interrupt processor 370, a computer-vision specialization module 230 can allow the underlying hardware executing the computer- vision applications and services 240 (e.g., an application processing unit 130 and'Or memory subsystem 150) to enter a low-power (e.g., "standby") mode. When the computer-vision specialization module 230 determines that a triggering event has taken place, the computer-vision specialization module 230 can utilize the computer-vision interrupt processor 370 to provide an interrupt to underlying hardware to cause the underlying hardware to exit the low-power mode.
[8051] When combined with other filters, the computer- vision interrupt processor 370 can allow the computer- vision specialization module 230 to perform a variety of functions in the data path, while requiring little or no processing from the underlying hardware executing the computer-vision applications and services 240. For example, computer -vision applications and services 240 may include a security application in which the device 100 monitors an area with a camera and sounds an alarm if it detects a change in the monitored area. To preserve battery power, the underlying hardware executing the computer-vision applications and services 240 (e.g., an application processing unit 130 and/or memory subsystem 150) can enter a standby mode while the computer-vision specialization module 230 employs scene change detection & motion estimation 355 to compare sensor input with a reference image of the monitored area. If the scene change detection & motion estimation 355 determines that there is a change in the monitored area (e.g., someone or something enters the scene viewed by the camera), it can notify the computer- vision interrupt processor 370. The computer-vision interrupt processor 370, in turn, can generate an interrupt for the underlying hardware executing the computer-vision applications and services 240, which can then cause the security application to sound the alarm. By providing the computer-vision interrupt processor 370 and scene change detection & motion estimation 355 in the data path in this manner, rather than in the computer-vision applications and services 240, the device 100 is able execute the security application in an efficient manner without unnecessary- processing by the underlying hardware executing the computer- vision applications and services 240.
[8052] FIG. 4 is a block diagram 400 illustrating how components of the computer- vision specialization module 230 can be grouped into different functional portions of the data path for computer- vision applications and services 240. For example, components may be grouped into the following functional portions: an input layer 420, primitive recognition 430, and structure recognition 440. Aspects of the data path not included in the computer-vision specialization module 230 (included in the core data path 220, for example) can also contribute to the functionality of the input layer 420, primitive recognition 430, and/or structure recognition 440, depending on desired functionality. That is, the core data path 220 can provide an input 310 for which portions of the input layer 420, primitive recognition 430, and/or structure recognition 440 may have already been performed.
[8053] The input layer 420 comprises a portion of the data path that brings in data from the sensor module 210 while potentially reducing the required processing in subsequent computer-vision processing blocks. To do so, the input layer 420 may be configured with one or more image-processing filters including, for example, color transformation 350, color filtering and masking 330, dynamic view selection 325, view localization 320, view narrowing 315, variable-angle rotation 345, color transformation 350, color filtering & masking 330, and/or frame-drop control 340, and the like.
Additionally or alternatively, the input layer 420 can provide image-processing filters optimizing environment lighting, color, exposure, and/or focus, as well as filters configured for feature extraction, and/or other filters. 00S4J Th e primitive recognition 430 portion of the data path can be configured based on the needs of the computer-vision applications and services 240, This can include determining whether certain features are present in the data from the sensor module 2.10, For instance, if the computer-vision applications and sendees 2.40 include hand gesture recognition, the primitive recognition 430 could be configured to recognize a hand. For applications interested in certain colors, the primitive recognition 430 could be configured to examine the colorimetry of the data from the sensor module 210 to determine if one or more colors are present. To provide this functionality, the primitive recognition 430 can be configured with one or more image-processing filters including, for example, view localization 320 and/or view narrowing 315, and the like.
Additionally or alternatively, the primitive recognition 430 can be configured with image-processing filters including segmentation, posture detection, and/or other filters.
[8(555] The structure recognition 440 can be configured to track features in the data from the sensor module 210 over time and define structures, which can also be based on the needs of the computer- vision applications and services 240. The features in the data can include identifiable features such as edges, corners, SIFT (scale-invariant featitre transform), and the like. Structures defined by the tracked features can include, for example, symbols and/or vision gestures. To provide this functionality, and/or to potentially reduce the required processing in primitive and/or structure recognition processing blocks, the structure recognition 440 can be configured wiih one or more image-processing filters including, for example, scene change detection & motion estimation 355, frame-drop control 340, histogram analysis 335, and the like.
Additionally or alternatively, the structure recognition 440 can be configured with image-processing filters including tracking, prediction, gesture detection, and/or other filters. The tracked features can vary depending on the needs of the application requesting and/or consuming the structure recognition. A gesture-based application, for example, may require tracking of a hand to determine gestures, A context-aware application or augmented reality application, on the other hand, may track any of a variety of objects for object recognition. [8056] FIGS, 5A and 5B provide examples of how the computer-vision specialization can employ different filter chains, adaptively adjusting the data path 200 based on the needs of the computer- ision applications and services 240. In FIG. 5A, the computer- vision specialization module 230 adapts the data path to a security application by utilizing a filter chain comprising histogram analysis 335, scene change detection & motion estimation 355, and computer- vision interrupt processor 370 to provide an output 375 to the computer-vision applications and services 240. In FIG. SB, a data path 200 is illustrated in which the computer-vision specialization module 230 adapts the data path to meet the needs of a computer-vision application for object recognition by utilizing a filter chain comprising view narrowing 315, variable- angle rotation 345, and frame-drop control 340. It can be noted that, although the filter chains provided in the examples shown in FIGS. 5A and 5B both include three filters, the computer-vision specialization module 230 can include a larger or smaller amount of filters, based on the needs of the computer-vision applications and services 240.
[8057] Parameters for each filter utilized by the computer-vision specialization module 230 can also vary, depending on needs of the computer- vision applications and services 240. For example, view narrowing 315 may be defined to provide the upper- right quadrant of an image for a first application, and a lower-left quadrant of an image for a second application. Furthermore, the computer-vision specialization module 230 may adapt the filters and their parameters according the changing needs of a single application. Thus, not only can the computer-vision specialization module 230 adapt the data path for different computer-vision applications, but also for different states and/or needs of a particular computer-vision application. 8058] Tn addition to the filters shown in FIG, 3, embodiments can utilize data switching to provide data ports for different segments of data. An example of this functionality is illustrated in FIG. 6. Here, in addition to any other filter(s) 610 (such as filters 315-355 described in relation to FIG. 3), a computer- vision specialization module 230 can employ a filter chain in which a multiple view-port data switch 620 separates data into different ports 630, With the multiple view-port data switch 620, the data path 200 can separate data from a single image into separate sub-image outputs, as if the images were taken by separate sensors. For example, for a facial-recognition computer- vision application in the computer-vision applications and services 240, the computer- vision specialization module 230 can use the filter(s) 610 and multiple view-port data switch 620 to extract, from an image of the faces of four people, separate images of each face, where the data for each image has a corresponding viewing port 630, The facial-recognition computer- vision application can utilize the data from each port 630 and perform facial recognition from each port 630 separately.
[8059] Data switching in this manner can also facilitate how data is tagged in system memory 640 of a device, which can comprise the memory subsystem 150 of FIG, 1. Data tagging can be particularly useful for devices with multiple sensors providing information to the data path 200. As shown in FIG. 6, embodiments can be configured to allow the multiple view-port data switch 620 of a computer-vis on specialization module 230 to tag the data of each data port 630 and write to the system memory 640. The multiple view-port data switch 620 can be configured to align the data such that paging in the system memory 640 is reduced, thereby reducing latency and power consumption.
FIG. 7 illustrates an embodiment of a method 700 for providing an adaptive data path for computer-vision applications. The method can be performed by, for example, a computer- vision processing unit, such as the computer- vision processing unit 120 of FIG. I . Accordingly, means for performing each step of method 700 can include hardware and/or software components as described herein. In one embodiment, the method 700 may be performed by a specialized instruction-based, in-fine processor. Furthermore, a memory subsystem of a device and'or memory internal to a computer- vision processing unit can be encoded with instructions for causing the device and/or computer- vis on processing unit to perform one or more of the steps of the method 700.
[8061] At block 710, a first image is received. The image can be received from one or more sensor(s) of an image sensor module, for example the sensor(s) 1 15. In some embodiments, some preliminary filtering may occur to the image before it is received by, for example, a core data path.
[8062] At block 720, a first subset of im ge processing functions, or filters, is selected based on a first computer- vision application executed by an application processing unit. The first subset is selected from a plurality of image processing functions capable of being used in the processing of the first image. As indicated previously, selecting the first subset of image processing functions can be based on an input provided by the computer-vision application executed by an application processing unit, for example the application processing unit 130. The input can be, for example, a reference image. Where the method 700 is performed by a specialized computer- vision processing unit, the input can be an instruction-based command interpreted by the computer-vision processing unit without a separate interpretive engine. The input can include an instruction generated at the application layer by a computer-vision application.
Additionally or alternatively, the computer-vision processing unit (and/or other hardware and/or software implementing the method 700) can be dynamically programmed, based on the input, to implement the selected image processing functions. The filters or functions may comprise any of the filters 315-355 or other filters not illustrated or described herein.
[0063] At block 730, the fsrst image is processed using the first subset of image processing functions. Depending on desired functionality, and/or needs of the computer-vision application, an interrupt or other output can be provided to the application processing unit and/or a memory subsystem to cause either or both to exit a low-power mode. In some instances, a plurality of sub-image outputs from the fsrst image can be provided, such as when a multiple view-port data switch is used to separate sub-images from a single image, as described previously in relation to FIG. 6
[0064] Optional blocks 740-760 illustrate the adaptability of the data path with regards to a second computer- ision application. At block 740, a second image is received, for example from sensor(s) 1 1 . At block 750, a second subset of image processing functions is selected, for example from the functions 315-355 or other functions, based on a second computer-vision application executed by the application processing unit. Here, for example, the needs of the second computer- vision application can vary from the needs of the first compute -vision application, therefore resulting in the selection of a second subset of image processing functions, as illustrated above with regard to FIGS. 5 A and 5B. However, as explained previously, not only can the data path alter the image processing functions used and/ or the parameters of each image processing function based on different computer-vision applications executed by the application processing unit, but also after the image processing functions used and/or the parameters of each image processing function based on the state of a particular computer-vision application. At block 760, the second image is processed using the second subset of image processing functions.
[0065] it should be appreciated that the specific steps illustrated in FIG. 7 illustrate an example method for providing an adaptive data path for computer- vision applications. Alternative embodiments may include alterations to the embodiments shown.
Furthermore, additional features may be added or removed depending on the particular applications. One of ordinary skill in the art would recognize many variations, modifications, and alternatives.
[8066] The methods, systems, and devices discussed herein are examples. Various embodiments may omit, substitute, or add various procedures or components as appropriate. For instance, features described with respect to certain embodiments may be combined in various other embodiments. Different aspects and elements of the embodiments may be combined in a similar manner. The various components of the figures provided herein can be embodied in hardware and/or software. Also, technology evol v es and, thus, many of the elements are examples that do not limit the scope of the disclosure to those specific examples.
[8067] Specific details are given in the description to provide a thorough
understanding of the embodiments. However, embodiments may be practiced without these specific details. For example, well-known circuits, processes, algorithms, structures, and techniques have been shown without unnecessary detail in order to avoid obscuring the embodiments. This description provides example embodiments only, and is not intended to limit the scope, applicability, or configuration of the invention.
Rather, the preceding description of the embodiments will provide those skilled in the art with an enabling description for implementing embodiments of ihe invention.
Various changes may be made in the function and arrangement of elements without departing from the spirit and scope of the invention.
[0068] Having described several embodiments, various modifications, alternative constructions, and equivalents may be used without departing from the spirit of the disclosure. For example, the above elements may merely be a component of a larger system, wherein other rules may iake precedence over or otherwise modify the application of the invention. Also, a number of steps may be undertaken before, during. or after the above elements are considered. Accordingly, the above description does not limit the scope of the disclosure.

Claims

WHAT' IS CLAIMED IS:
1 1. An apparatus for providing an adaptive data path for eomputer-
2. vision applications, the apparatus comprising:
3 an application processing unit; and
4 a processing unit, separately programmable from the application
5 processing unit, and communicatively coupled to an image sensor module and the
6 application processing unit and configured to:
7 receive a first image from the image sensor module;
8 select a first subset of image processing functions from a plurality
9 of image processing functions, based on a first computer-vision application 0 executed by the application processing unit; and
1 process the first image using the first subset of image processing2 functions.
1 2. The apparatus of claim 1 , wherein the processing unit is further
2 configured to:
3 receive a second image from the image sensor module; and
4 select a second subset of image processing functions from the
5 plurality of image processing functions, wherein the second subset of image
6 processing functions is:
7 based on a second computer-vision application executed
8 by the application processing unit; and
9 different than the first subset of image processing0 functions; and
1 process the second image using the second subset of image 2 processing functions,
1 3. The apparatus of claim 1 , wherein the processing unit is
2 configured to select the first subset of image processing functions based on an input
3 provided by the application processing unit.
1 4. The apparatus of claim 3, wherein the processing unit is
2 configured to be dynamically programmed, based on the input.
1 5. The apparatus of claim 3, wherein:
2 the input comprises an instruction-based command; and
3 the processing unit is further configured to interpret the instruction-based
4 command.
1 6. The apparatus of claim 3, wherein the input comprises an
2 instruction generated at an application layer of the apparatus.
1 7. The apparatus of claim I , wherein the processing unit is
2. configured to process the first image based on a reference image maintained by the
3 processing unit.
1 8. The apparatus of claim 1 , further comprising a memory
2 subsystem, wherein the processing unit is configured to provide an output that causes
3 either or both the application processing unit or the memory subsystem to exit a low-
4 power mode.
1 9. The apparatus of claim 1 , wherein the processing unit is
2 configured to provide a plurality of sub-image outputs from the first image.
1 10. The apparatus of claim 1, wherein the processing unit comprises
2 an image signal processor.
1 1 1. A method for providing an adaptive data path for computer-
2 vision applications, the method comprising:
3 receiving a first image;
4 selecting a first subset of image processing functions from a plurality of
5 image processing functions, based on a first computer-vision application executed by an
6 application processing unit; and
7 processing the first image, using the first subset of image processing
8 functions, wherein the processing occurs in a unit separate from the application
9 processing unit.
1 12. The method of claim 1 1 , further comprising:
2 receiving a second image; 3 selecting a second subset of image processing functions from the
4 plurality of image processing functions, wherein the second subset of image processing
5 functions is:
6 based on a second computer-vision application executed by the
7 application processing unit: and
8 different than the first subset of image processing functions; and
9 processing the second image using the second subset of image processing0 functions.
1 13. The method of claim 1 1, wherein the selecting a first subset of
2 image processing functions is based on an input provided by the application processing
3 unit.
1 14. The method of claim 13, wherein processing the first image is
2 performed by a processing unit that is dynamically programmed based on the input.
1 15. The method of claim 13, wherein the input comprises an
2 instruction-based command, the method further comprising interpreting the instruct! on-
3 based command.
1 16. The method of claim 13, wherein the input comprises an
2 instruction generated by the first computer- vision application.
1 17. The method of claim 1 1, further comprising:
2. maintaining a reference image;
3 wherein processing the first image is based on the reference image.
1 18. The method of claim 1 1, further comprising providing an output
2 that causes either or both the application processing unit or a memory subsystem to exit
3 a low-power mode.
1 19. The method of claim 1 1 , further comprising providing a plurality
2 of sub-image outputs from the first image.
1 20. The method of claim 1 1, wherein processing the first image is
2. performed using an image signal processor,
21. A processor for providing an adaptive data path for computer- vision applications, the processor comprising:
means for receiving a first image;
means for selecting a first subset of image processmg functions from a plurality of image processing functions, based on a first computer-vision application executed by an application processing unit; and
means for processing the first image, using the first subset of image processing functions, wherein the means for processing the first image are separate from the application processing unit. 22. The processor of claim 21 , comprising:
means for receiving a second image;
means for selecting a second subset of image processing functions from the plurality of image processing functions, wherein the second subset of image processing functions is:
based on a second computer-vision application executed by the application processing unit; and
different than the first subset of image processing functions; and means for processing the second image using the second subset of image processmg functions. 23. The processor of claim 21, wherein the means for selecting the first subset of image processing functions include means for basing the selection on an input provided by the application processing unit. 24. The processor of claim 23, wherein the processor is dynamically programmable based on the input. 25. The processor of claim 23, wherein the input comprises an instruction-based command, the processor further comprising means for interpreting the ins!ruciion-based command. 26. The processor of claim 23, wherein the input comprises an instruction generated by the first computer- vision application.
27. The processor of claim 21, further comprising means for maintaining a reference image, wherem the means for processing the first image are configured to process the first image based on the reference image, 28. The processor of claim 21 , further comprising means for providing an output that causes either or both the application processing unit or a memory subsystem to exit a low-power mode. 2.9. The processor of claim 21 , further comprising means for providing a plurality of sub-image outputs from the first image. 30. A non-transitory computer-readable medium encoded with instructions that, when executed, operate to cause a processing unit to perform operations comprising:
receiving a first image;
selecting a first subset of image processing functions from a plurality of image processing functions, based on a first computer-vision application executed by an application processing unit; and
processing the first image, using the first subset of image processing functions,
PCT/US2014/018943 2013-03-12 2014-02-27 Adaptive data path for computer-vision applications WO2014158635A1 (en)

Priority Applications (5)

Application Number Priority Date Filing Date Title
JP2016500456A JP6706198B2 (en) 2013-03-12 2014-02-27 Method and processor for providing an adaptive data path for computer vision applications
BR112015022860A BR112015022860A2 (en) 2013-03-12 2014-02-27 adaptive data path for computer view applications
CN201480012145.1A CN105190685B (en) 2013-03-12 2014-02-27 Method and apparatus for providing the self-adapting data path for being used for computer vision application program
EP14717531.9A EP2973386B1 (en) 2013-03-12 2014-02-27 Adaptive data path for computer-vision applications
KR1020157027143A KR102188613B1 (en) 2013-03-12 2014-02-27 Adaptive data path for computer―vision applications

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US13/797,580 2013-03-12
US13/797,580 US9052740B2 (en) 2013-03-12 2013-03-12 Adaptive data path for computer-vision applications

Publications (1)

Publication Number Publication Date
WO2014158635A1 true WO2014158635A1 (en) 2014-10-02

Family

ID=50487106

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2014/018943 WO2014158635A1 (en) 2013-03-12 2014-02-27 Adaptive data path for computer-vision applications

Country Status (7)

Country Link
US (2) US9052740B2 (en)
EP (1) EP2973386B1 (en)
JP (1) JP6706198B2 (en)
KR (1) KR102188613B1 (en)
CN (1) CN105190685B (en)
BR (1) BR112015022860A2 (en)
WO (1) WO2014158635A1 (en)

Families Citing this family (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9052740B2 (en) * 2013-03-12 2015-06-09 Qualcomm Incorporated Adaptive data path for computer-vision applications
KR102037283B1 (en) * 2013-06-18 2019-10-28 삼성전자주식회사 Image sensor, image signal processor and electronic device including the same
US9659380B1 (en) * 2016-01-07 2017-05-23 International Business Machines Corporation Object position tracking using motion estimation
AU2017361245B2 (en) * 2016-11-16 2023-06-22 Magic Leap, Inc. Mixed reality system with reduced power rendering
BR112019014550A2 (en) * 2017-01-23 2020-02-18 Qualcomm Incorporated EXECUTION OF APPLICATIONS AND CONTROL OF SINGLE PROCESSOR COMPUTATIONAL VIEW HARDWARE
CN110168565B (en) * 2017-01-23 2024-01-05 高通股份有限公司 Low power iris scan initialization
US10742834B2 (en) * 2017-07-28 2020-08-11 Advanced Micro Devices, Inc. Buffer management for plug-in architectures in computation graph structures
KR20190106251A (en) 2018-03-08 2019-09-18 삼성전자주식회사 electronic device including interface coupled to image sensor and interface coupled between a plurality of processors
JP7403279B2 (en) * 2019-10-31 2023-12-22 キヤノン株式会社 Image processing device and image processing method
JP6885640B1 (en) * 2020-10-01 2021-06-16 株式会社ラムダシステムズ Image processing device
US20220314492A1 (en) * 2021-04-05 2022-10-06 Sysdyne Technologies LLC Concrete mixer truck drum rotation measurement using camera
US11783453B2 (en) * 2021-06-10 2023-10-10 Bank Of America Corporation Adapting image noise removal model based on device capabilities

Family Cites Families (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1093668C (en) * 1997-05-20 2002-10-30 致伸实业股份有限公司 Computerized image processing system
JP3532781B2 (en) * 1999-02-12 2004-05-31 株式会社メガチップス Image processing circuit of image input device
US7327396B2 (en) 2002-04-10 2008-02-05 National Instruments Corporation Smart camera with a plurality of slots for modular expansion capability through a variety of function modules connected to the smart camera
US7256788B1 (en) * 2002-06-11 2007-08-14 Nvidia Corporation Graphics power savings system and method
JP2004104347A (en) * 2002-09-06 2004-04-02 Ricoh Co Ltd Image processing apparatus, image processing method, program, and recording medium
JP2004213423A (en) * 2003-01-06 2004-07-29 Sony Corp Information processor and method, and information processing program
US7453506B2 (en) * 2003-08-25 2008-11-18 Fujifilm Corporation Digital camera having a specified portion preview section
TWI300159B (en) 2004-12-24 2008-08-21 Sony Taiwan Ltd Camera system
JP2007053691A (en) 2005-08-19 2007-03-01 Micron Technol Inc Extended digital data route structure using sub-lsb
JP4442644B2 (en) * 2007-06-15 2010-03-31 株式会社デンソー Pipeline arithmetic unit
JP5059551B2 (en) * 2007-10-31 2012-10-24 株式会社デンソー Vehicle occupant detection device
JP2010068190A (en) * 2008-09-10 2010-03-25 Nikon Corp Digital camera, image processing apparatus and digital camera system
US8868925B2 (en) * 2008-12-09 2014-10-21 Nvidia Corporation Method and apparatus for the secure processing of confidential content within a virtual machine of a processor
JP2010282429A (en) * 2009-06-04 2010-12-16 Canon Inc Image processing device and control method thereof
KR20110133699A (en) * 2010-06-07 2011-12-14 삼성전자주식회사 Apparatus and method for acquiring image in portable terminal
US8488031B2 (en) 2011-01-14 2013-07-16 DigitalOptics Corporation Europe Limited Chromatic noise reduction method and apparatus
US9092267B2 (en) * 2011-06-20 2015-07-28 Qualcomm Incorporated Memory sharing in graphics processing unit
US9052740B2 (en) * 2013-03-12 2015-06-09 Qualcomm Incorporated Adaptive data path for computer-vision applications

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
CHEN J C ET AL: "CRISP: Coarse-Grained Reconfigurable Image Stream Processor for Digital Still Cameras and Camcorders", IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, IEEE SERVICE CENTER, PISCATAWAY, NJ, US, vol. 18, no. 9, 1 September 2008 (2008-09-01), pages 1223 - 1236, XP011231979, ISSN: 1051-8215, DOI: 10.1109/TCSVT.2008.928529 *
FUNG J ET AL: "OpenVIDIA: parallel GPU computer vision", ACM MULTIMEDIA, 2004 : PROCEEDINGS OF THE 12TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA ; OCTOBER 10 - 16, 2004, NEW YORK, NY, USA, ASSOC. FOR COMPUTING MACHINERY, NEW YORK, NY, USA, 1 January 2005 (2005-01-01), pages 849 - 852, XP008096985, ISBN: 978-1-58113-893-1, DOI: 10.1145/1101149.1101334 *
MATHIEU THEVENNIN ET AL: "The eISP low-power and tiny silicon footprint programmable video architecture", JOURNAL OF REAL-TIME IMAGE PROCESSING, vol. 6, no. 1, 17 June 2010 (2010-06-17), pages 33 - 46, XP055119423, ISSN: 1861-8200, DOI: 10.1007/s11554-010-0163-8 *
POUDEL PRAMOD ET AL: "Optimization of image processing algorithms on mobile platforms", REAL-TIME IMAGE AND VIDEO PROCESSING 2011, SPIE, 1000 20TH ST. BELLINGHAM WA 98225-6705 USA, vol. 7871, no. 1, 10 February 2011 (2011-02-10), pages 1 - 14, XP060005219, DOI: 10.1117/12.876520 *

Also Published As

Publication number Publication date
CN105190685A (en) 2015-12-23
BR112015022860A2 (en) 2017-07-18
JP2016515260A (en) 2016-05-26
EP2973386B1 (en) 2018-12-05
JP6706198B2 (en) 2020-06-03
US9052740B2 (en) 2015-06-09
KR20150126888A (en) 2015-11-13
US20140267790A1 (en) 2014-09-18
KR102188613B1 (en) 2020-12-08
EP2973386A1 (en) 2016-01-20
US9549120B2 (en) 2017-01-17
CN105190685B (en) 2018-10-26
US20150229839A1 (en) 2015-08-13

Similar Documents

Publication Publication Date Title
US9549120B2 (en) Adaptive data path for computer-vision applications
US11102398B2 (en) Distributing processing for imaging processing
JP7266672B2 (en) Image processing method, image processing apparatus, and device
US10021381B2 (en) Camera pose estimation
KR102018887B1 (en) Image preview using detection of body parts
CN108399349B (en) Image recognition method and device
US20140071245A1 (en) System and method for enhanced stereo imaging
US20110148868A1 (en) Apparatus and method for reconstructing three-dimensional face avatar through stereo vision and face detection
KR102155895B1 (en) Device and method to receive image by tracking object
WO2012093381A1 (en) Camera assembly with an integrated content analyzer
CN111880711B (en) Display control method, display control device, electronic equipment and storage medium
CN1980384A (en) Space mobile-object locking aim-searching device and method
US20220345628A1 (en) Method for image processing, electronic device, and storage medium
CN112132070B (en) Driving behavior analysis method, device, equipment and storage medium
US20220414830A1 (en) Method and apparatus for improved object detection
CN108495038B (en) Image processing method, image processing device, storage medium and electronic equipment
US10282633B2 (en) Cross-asset media analysis and processing
CN112672057B (en) Shooting method and device
CN111064886B (en) Shooting method of terminal equipment, terminal equipment and storage medium
CN118355405A (en) Segmentation with monocular depth estimation
CN118043859A (en) Efficient visual perception
CN111064889A (en) Shooting method of terminal equipment, terminal equipment and storage medium

Legal Events

Date Code Title Description
WWE Wipo information: entry into national phase

Ref document number: 201480012145.1

Country of ref document: CN

121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 14717531

Country of ref document: EP

Kind code of ref document: A1

DPE1 Request for preliminary examination filed after expiration of 19th month from priority date (pct application filed from 20040101)
WWE Wipo information: entry into national phase

Ref document number: 2014717531

Country of ref document: EP

ENP Entry into the national phase

Ref document number: 2016500456

Country of ref document: JP

Kind code of ref document: A

NENP Non-entry into the national phase

Ref country code: DE

ENP Entry into the national phase

Ref document number: 20157027143

Country of ref document: KR

Kind code of ref document: A

REG Reference to national code

Ref country code: BR

Ref legal event code: B01A

Ref document number: 112015022860

Country of ref document: BR

ENP Entry into the national phase

Ref document number: 112015022860

Country of ref document: BR

Kind code of ref document: A2

Effective date: 20150911