WO2021089059A1 - Method and apparatus for smart object recognition, object recognition device, terminal device, and storage medium - Google Patents

Method and apparatus for smart object recognition, object recognition device, terminal device, and storage medium Download PDF

Info

Publication number
WO2021089059A1
WO2021089059A1 PCT/CN2020/138696 CN2020138696W WO2021089059A1 WO 2021089059 A1 WO2021089059 A1 WO 2021089059A1 CN 2020138696 W CN2020138696 W CN 2020138696W WO 2021089059 A1 WO2021089059 A1 WO 2021089059A1
Authority
WO
WIPO (PCT)
Prior art keywords
article
image
recognition
content text
data
Prior art date
Application number
PCT/CN2020/138696
Other languages
French (fr)
Chinese (zh)
Inventor
石翔
Original Assignee
昆山提莫智能科技有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 昆山提莫智能科技有限公司 filed Critical 昆山提莫智能科技有限公司
Publication of WO2021089059A1 publication Critical patent/WO2021089059A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/40Document-oriented image-based pattern recognition
    • G06V30/41Analysis of document content
    • G06V30/413Classification of content, e.g. text, photographs or tables
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/60Information retrieval; Database structures therefor; File system structures therefor of audio data
    • G06F16/68Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/683Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
    • G06F16/685Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content using automatically derived transcript of audio data, e.g. lyrics
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/02Methods for producing synthetic speech; Speech synthesisers

Definitions

  • the present invention relates to the technical field of intelligent terminals, in particular to an intelligent object recognition method and device, intelligent object recognition equipment, terminal equipment and computer storage media.
  • Another way is to set up a recognition card.
  • Each card sets a recognition pressing point.
  • the name of the item corresponding to the card is played.
  • This way is for children to operate Sex is a bit complicated, and it is easy for children to lose interest.
  • the recognition card is also easy to fail. Compared with the previous method, there is still the problem of knowledge limitations.
  • the purpose of the present invention is to provide an intelligent object-recognition method and device, intelligent object-recognition equipment, terminal equipment and computer storage medium, which are used to solve the problems of knowledge limitation, independence and interest in children's object-recognition.
  • the present invention provides an intelligent object recognition method, including:
  • the preprocessing of the image of the object to be recognized to obtain image feature data specifically includes:
  • Compression, image binarization, gray-scale image processing, SIFT feature extraction, and intersection feature extraction are performed on the image of the object to be recognized to obtain image feature data.
  • said performing recognition in an article database according to the image feature data, obtaining a recognition result and generating article content text, and generating a first audio link according to the article content text specifically includes:
  • the matching in a resource library according to the article content text, obtaining resource data corresponding to the recognized text, and generating a second audio link according to the resource data specifically includes:
  • a search formula is generated according to the article content text, and a search is performed in the resource library according to the search formula. For example, resource data that exactly matches the search formula is retrieved, and the resource data includes the text corresponding to the article content For video, animation or audio data, one or more of the resource data is selected to generate a second audio link.
  • searching in the item database according to the feature of the object and obtaining the item name corresponding to the feature of the object specifically includes:
  • the article database is set in a remote server, the article database includes a characteristic article mapping table, and the corresponding article name is obtained according to the characteristics of the article;
  • the data interface of the article database is connected with a third-party knowledge base, and the data interface is used to update the characteristic article mapping table in real time.
  • the present invention also provides an intelligent object recognition device, the intelligent object recognition device comprising:
  • the image acquisition unit is used to acquire an image of the object to be identified
  • An image preprocessing unit configured to preprocess the image of the object to be recognized to obtain image feature data
  • the recognition unit is used to send the image feature data to the item database for recognition, obtain the recognition result and generate the item content text, generate the first audio link according to the item content text, and perform the process in the resource database according to the item content text Match, obtain resource data corresponding to the recognized text, and generate a second audio link according to the resource data;
  • the playing unit is configured to receive and select to play the first audio link and/or the second audio link.
  • the identification unit specifically includes:
  • the sending module is used to send the image feature data to the item database, wherein the item database is set in a remote server, and the server extracts the object feature corresponding to the core feature point data according to the image feature data, and then According to the object feature search in the article database, the article name corresponding to the object feature is obtained, the recognition result is output, the article content text is generated, and the article content text is synthesized into the first audio link.
  • the identification unit specifically includes:
  • a sending module for sending the image feature data to the article database
  • the first audio generation module is configured to search and obtain recognition results in the article database according to the image characteristics, generate article content text, and generate a first audio link based on the article content text;
  • the second audio generating module is configured to perform matching in the resource library according to the article content text, obtain resource data corresponding to the recognized text, and generate a second audio link according to the resource data.
  • the present invention further provides an embodiment, in which an intelligent object recognition device based on the aforementioned intelligent object recognition method, the intelligent object recognition device includes:
  • the camera module includes a camera
  • the camera is used to obtain the image of the object to be identified
  • the housing has a plurality of installation positions for installing the camera
  • a camera is installed on at least one of the installation positions, and any two straight lines where the installation positions are located intersect the straight lines where the handles are located, and if there are two installation positions, they are symmetrically arranged on the left and right sides of the children's encyclopedia knowledge device On both sides,
  • the housing includes a front cover, a rear cover, a front housing, and a rear housing.
  • the installation location is located on the front housing, and the rear cover has a through hole through which light can pass through the through hole to reach the The photosensitive area of the camera; the back cover has two symmetrically arranged circular protrusions, and the through hole is provided at the center of the protrusions.
  • the intelligent object recognition device further includes:
  • a circuit board the circuit board includes a main body portion accommodated in the housing and a connecting portion extending from the main body portion to the inside of the handle, the connecting portion is provided with an identification trigger, and the handle includes and The button matched with the object-recognizing trigger can be pressed to drive the object-recognizing trigger to act, and the object-recognizing method can be started.
  • the button part passes through the through hole and is detachably connected to the object triggering member.
  • the intelligent object recognition device further includes a display assembly
  • the display assembly includes a display panel, a mounting frame for fixing the display panel is provided on the front housing, and the display panel is mounted on the front housing.
  • the body is close to one side of the rear housing, and the display assembly is used for playing the first audio or the second audio.
  • the embodiment of the present invention also provides a terminal device, including a memory, a processor, and a computer program stored in the memory and running on the processor, and the processor realizes intelligent recognition when the computer program is run.
  • a terminal device including a memory, a processor, and a computer program stored in the memory and running on the processor, and the processor realizes intelligent recognition when the computer program is run. The steps of the physical method.
  • the embodiment of the present invention also provides a computer-readable storage medium, the computer-readable storage medium stores a computer program, and when the computer program is executed by a processor, the steps of the intelligent identification method are realized.
  • the intelligent object recognition method and device, intelligent object recognition equipment, terminal equipment and computer storage medium provided by the present invention can at least bring about the following beneficial effects:
  • a learning resource library is also provided.
  • the educational resources related to the item are also retrieved in the resource library, such as the historical origin, evolution process, composition, function and distribution of the item, etc. While enriching knowledge, it also enhances the fun of children's learning.
  • the third-party program interface is connected to the database of the object through the server, and the object of the object is expanded.
  • the object of the object is expanded infinitely, and the capacity of the knowledge base is increased. More importantly, the powerful computing power of the knowledge base can greatly increase the capacity of the knowledge base. Improve the speed of recognizing objects, and the user experience can be greatly improved.
  • the item classification can be quickly located according to the main features of the item, and then the object name can be further identified according to other features. On the one hand, it improves the speed of item recognition, and on the other hand, it also improves the item Accuracy of recognition.
  • Figure 1 is a schematic diagram of the process of the intelligent object recognition method in the present invention
  • FIG. 2 is a schematic diagram of the structure of an intelligent object recognition device in the present invention.
  • Figure 3 is a schematic diagram of another intelligent object recognition device in the present invention.
  • Figure 4 is a schematic diagram of the structure of yet another intelligent object recognition device in the present invention.
  • FIG. 5 is a schematic diagram of the front structure of a handheld children's encyclopedia recognition device according to a specific embodiment of the present invention.
  • FIG. 6 is an exploded schematic diagram of various components of the handheld children's encyclopedia recognition device according to a specific embodiment of the present invention.
  • FIG. 7 is a schematic diagram of the front shell structure of the handheld children's encyclopedia recognition device according to a specific embodiment of the present invention.
  • FIG. 8 is a schematic structural diagram of a camera module of a handheld children's encyclopedia object-recognizing device according to a specific embodiment of the present invention.
  • FIG. 9 is a schematic diagram of the front cover and the back cover of the handheld children's encyclopedia recognition device according to a specific embodiment of the present invention.
  • FIG. 10 is a schematic diagram of another structure of the front cover and the back cover of the handheld children's encyclopedia recognition device according to a specific embodiment of the present invention.
  • FIG. 11 is a schematic structural diagram of a terminal device according to a specific embodiment of the present invention.
  • FIG. 1 is an embodiment of the present invention, a method for intelligently identifying objects, including:
  • the user When the user needs to recognize an object, he can first press the camera button to focus on the object, and collect a picture of the object through the camera. Since children have not yet set appropriate camera parameters, users can configure the frequency and exposure time of the camera to obtain more suitable pictures through mobile phones, Bluetooth, wifi, or peer-to-peer, etc., so as to obtain more suitable pictures to facilitate follow-up. It is easy to obtain the main feature information of the object during feature recognition.
  • the preprocessing of the image of the object to be recognized to obtain image feature data specifically includes:
  • Compression, image binarization, gray-scale image processing, SIFT feature extraction, and intersection feature extraction are performed on the image of the object to be recognized to obtain image feature data.
  • Preprocess the collected images including but not limited to compressed pictures, image binarization, grayscale image processing, SIFT feature extraction, and intersection feature extraction.
  • the pre-processing action can be performed by the object-recognizing device, can also be completed by a mobile phone connected to the object-recognizing device, or even sent to a remote server for completion.
  • the above examples do not represent that the embodiments of the present invention limit the implementation subject of the image preprocessing process.
  • the identification in the article database based on the image feature data, obtaining the identification result and generating article content text, and generating the first audio link based on the article content text specifically includes:
  • the embodiment of the present invention waits in advance. Identify the core feature or main feature of the object picture or image, compare it with the object feature in the item database, obtain the object corresponding to the core feature point, and generate the item content text.
  • the article database is also provided with article classification, and a certain kind of article contains category characteristics, which is convenient for quickly searching and locating the name of the article.
  • the matching in a resource library according to the article content text, obtaining resource data corresponding to the recognized text, and generating a second audio link according to the resource data specifically includes:
  • a search formula is generated according to the article content text, and a search is performed in the resource library according to the search formula. For example, resource data that exactly matches the search formula is retrieved, and the resource data includes the text corresponding to the article content For video, animation or audio data, one or more of the resource data is selected to generate a second audio link.
  • the image content is first recognized and the text of the main object name in the image is output.
  • This text is the recognition result; then the text content is synthesized into voice content through TTS voice technology, and this voice content link is the first Audio link; then use the object name text to search in the server's audio, video, animation and other educational resource libraries, with complete overlap as the matching criterion, select the voice, animation, and video content, if the matching is successful.
  • This voice content link is the second audio link.
  • the generation of the first audio link and the second audio link may be completed by the identification device itself, or completed by the identification device and the server respectively, or all completed by the server.
  • searching in the item database according to the feature of the object and obtaining the item name corresponding to the feature of the object specifically includes:
  • the article database is set in a remote server, the article database includes a characteristic article mapping table, and the corresponding article name is obtained according to the characteristics of the article;
  • the data interface of the article database is connected with a third-party knowledge base, and the data interface is used to update the characteristic article mapping table in real time.
  • the collected or preprocessed images can be uploaded to the server via wifi or 4G/5G network, and the server can complete the feature recognition and search, and then obtain the article name text.
  • the article database is set in the remote
  • the server also includes a feature item mapping table, which directly maps object names from features to achieve rapid identification of items.
  • Embodiment 6 of the present invention provides an intelligent object-recognizing device 100, and the intelligent object-recognizing device includes:
  • the image acquisition unit 110 is used to acquire an image of the object to be identified
  • the image preprocessing unit 120 is configured to preprocess the image of the object to be recognized to obtain image feature data
  • the recognition unit 130 is configured to send the image feature data to the article database for identification, obtain the recognition result and generate article content text, generate a first audio link according to the article content text, and store the article content text in the resource database according to the article content text Performing matching, obtaining resource data corresponding to the recognized text, and generating a second audio link according to the resource data;
  • the playing unit 140 is configured to receive and select to play the first audio link and/or the second audio link.
  • the image acquisition unit includes: a camera; a processor, a camera button (which can be a physical button or a touch screen button), a speaker, an LCD display and an LED indicator, and the processor uses a wifi or bluetooth module or 4G
  • the /5G network is connected to the mobile phone, the camera and the camera button are connected to the input end of the processor, and the output end of the processor is connected to the speaker, LCD display and LED indicator.
  • the identification unit specifically includes:
  • the sending module 1301 is configured to send the image feature data to an item database, where the item database is set in a remote server, and the server extracts the object feature corresponding to the core feature point data according to the image feature data, and Search in the article database according to the object feature, obtain the article name corresponding to the object feature, output the recognition result, generate the article content text, and synthesize the article content text into the first audio link.
  • the article database and the resource library are both located in the server, the retrieval formula is generated according to the article content text, and the retrieval is performed in the resource database according to the retrieval formula. If the retrieval formula is completely retrieved, Matching resource data, where the resource data includes video, animation, or audio data corresponding to the content text of the article, then one or more of the resource data is selected to generate a second audio link.
  • the resource library is located in an object recognition device or a third-party mobile phone, a search formula is generated according to the content text of the article, and the resource library is searched according to the search formula. If the resource data that exactly matches the search formula, the resource data includes video, animation, or audio data corresponding to the content text of the article, then one or more of the resource data is selected to generate a second audio link.
  • the identification unit specifically includes:
  • the sending module 1301 is used to send the image feature data to the article database
  • the first audio generating module 1302 is configured to search and obtain a recognition result in the article database according to the image feature, generate article content text, and generate a first audio link according to the article content text;
  • the second audio generating module 1303 is configured to perform matching in the resource library according to the article content text, obtain resource data corresponding to the recognized text, and generate a second audio link according to the resource data.
  • an intelligent object recognition device based on the foregoing intelligent object recognition method includes:
  • the camera module includes a camera
  • the camera is used to obtain the image of the object to be identified
  • the housing has a plurality of installation positions for installing the camera
  • a camera is installed on at least one of the installation positions, and any two straight lines where the installation positions are located intersect the straight lines where the handles are located, and if there are two installation positions, they are symmetrically arranged on the left and right sides of the children's encyclopedia knowledge device On both sides,
  • the housing includes a front cover, a rear cover, a front housing, and a rear housing.
  • the installation location is located on the front housing, and the rear cover has a through hole through which light can pass through the through hole to reach the The photosensitive area of the camera; the back cover has two symmetrically arranged circular protrusions, and the through hole is provided at the center of the protrusions.
  • the intelligent object recognition device further includes:
  • a circuit board the circuit board includes a main body portion accommodated in the housing and a connecting portion extending from the main body portion to the inside of the handle, the connecting portion is provided with an identification trigger, and the handle includes and The button matched with the object-recognizing trigger can be pressed to drive the object-recognizing trigger to act, and the object-recognizing method can be started.
  • the button part passes through the through hole and is detachably connected to the object triggering member.
  • the intelligent object recognition device further includes a display assembly
  • the display assembly includes a display panel, a mounting frame for fixing the display panel is provided on the front housing, and the display panel is mounted on the front housing.
  • the body is close to one side of the rear housing, and the display assembly is used for playing the first audio or the second audio.
  • FIG. 5 it is a schematic diagram of the front structure of a smart object-recognizing device.
  • the smart object-recognizing device is a child's smart encyclopedia object-recognizing device or a child's camera device.
  • the children’s smart encyclopedia knowledge device includes a housing 20, a handle 10, and a camera module 60.
  • the camera module 60 includes a camera 61.
  • the camera 61 can be a color camera, a black and white camera, and a wide-angle camera.
  • the housing 20 has a plurality of installation positions for installing the camera, at least one of the installation positions is equipped with a camera, and the line where any two installation positions are located intersects the line where the handle 10 is located, and the housing 20 and the handle 10 are formed T-shaped.
  • the line where the two installation positions are located is perpendicular to the line where the handle 10 is located.
  • the line connecting the center points of the position forms a straight line L1, which extends in the horizontal direction, and the straight line L2 where the handle 10 is located extends in the vertical direction.
  • the children’s smart encyclopedia object recognition equipment of this application may include one camera or multiple cameras, but at least two camera installation positions can be installed. Cameras can be installed in both installation positions or only one of them can be installed according to actual needs.
  • the other mounting position can be equipped with a magnifying glass, flashlight or flashlight.
  • the two mounting positions form two eyes of the children’s encyclopedia knowledge device in appearance, which is convenient for the design of cartoon characters or animals that children like.
  • the two mounting positions are symmetrically arranged More beautiful.
  • FIG. 6 is an exploded schematic diagram of the components of the children's encyclopedia identification device of a specific embodiment of the present invention.
  • the children's encyclopedia identification device of the present application further includes a circuit board 30, which includes a main body 31 housed in a housing And the connecting portion 32 extending from the main body portion 31 to the inside of the handle 10, the connecting portion 32 is provided with an object-sensing trigger, the handle 10 includes a button 11 matched with the object-sensing trigger, pressing the button 11 can drive the object-sensing trigger to act , Start the smart identification device to work.
  • the object recognition trigger is an electronic component.
  • the camera module 60 is electrically connected to the circuit board 30. Specifically, the camera module 60 includes a camera 61, a soft board 62, and a connector 63. The camera 61 is mounted on the housing 20, and the connector 63 is electrically connected to the circuit board 30.
  • a battery 80 is provided in the handle 10, and the battery provides power to the circuit board 30.
  • the display assembly 40 includes a display panel 41, a protective foam 42, a display panel fixing member 43, and a front cover 44 for protecting the display panel.
  • the installation frame of the display panel 41, the display panel 41 is installed on the side of the front casing 21 close to the rear casing 22, and the front cover 44 is installed on the side of the front casing 21 away from the rear casing 22 to protect the foam 42 is adhered to a circumference of the mounting frame to protect the display panel.
  • the display panel 41 is mounted on the display panel fixing member 43 and then fixed to the mounting frame to better protect the display panel from damage.
  • the children’s encyclopedia identification device of the present application further includes a display panel 41, a protective foam 42, and a display panel fixing member 43.
  • a mounting frame for fixing the display panel 41 is provided on the housing, and the protective foam adheres to the circumference of the mounting frame.
  • the display panel 41 is installed on the display panel fixing member, and then fixed to the mounting frame, which better protects the display panel from damage.
  • the display panel can only have display functions, or it can integrate display and touch Control function.
  • the display panel may be a liquid crystal display panel (Liquid Crystal Display, LCD) or an organic light-emitting diode display panel (Organic light-emitting diode, OLED).
  • the housing 20 includes a front cover 24, a rear cover 23, a front housing 21 and a rear housing 22.
  • the handle 10 includes a handle front shell 12, a handle rear shell 13 and a silicone rear hand guard 14.
  • the front shell 21 and the handle front shell 12 The front case is integrally formed, and the rear case 22 and the handle rear case 13 are integrally disposed to form a rear case.
  • the front cover 24, the rear cover 23, the front case and the rear case are connected in sequence.
  • the front cover 24 and the rear cover 23 can be integrally formed, or Through the integrated structure of the assembled form, the front shell and the rear shell are fixedly connected by screws.
  • the front shell and the rear shell are jointly enclosed to form an accommodating cavity.
  • the circuit board 30, the battery 80, and the display panel 41 are all located in the accommodating cavity.
  • the battery 80 is electrically connected to the circuit board to provide power for the circuit board 30.
  • the handle 10 is roughly cylindrical, which is convenient to hold.
  • the handle 10 also includes a silicone front hand guard 15. Both the silicone front hand guard 15 and the silicone rear hand guard 14 have anti-slip structures.
  • the anti-slip structure in the present invention is an anti-slip stripe; at the same time,
  • the handle 10 also includes a supporting part for supporting the intelligent object recognition device to stand.
  • the bottom of the handle 10 is a flat surface, and the flat bottom forms the supporting part.
  • the camera module 60 is electrically connected to the circuit board 30. Specifically, the camera module 60 further includes a soft board 62 and a connector 63. The connector 63 is electrically connected to the circuit board 30.
  • the camera module 60 is used to collect images to form Corresponding to the image signal, the circuit board 30 is provided with a chip for processing the image signal. The chip preprocesses the image and uploads the image to the server via WiFi or 4G/5G.
  • the server first outputs the object name in text form after identifying the content of the image. Then, the text content is synthesized into voice content and output. Next, the object name is matched with the optimal animation content of the server's educational resource library. If the matching is successful, the animation content is output, and the handheld electronic device plays the voice content and the animation content in sequence.
  • Figure 7 is a schematic diagram of the front shell structure of the children's encyclopedia recognition device in a specific embodiment of the present invention
  • Figure 8 is a schematic diagram of the camera module structure of the children's encyclopedia recognition device in a specific embodiment of the present invention
  • the camera 61 is installed On the side of the front housing 21 away from the rear housing 22, two installation positions are provided on one side of the front housing 21 where the camera is installed.
  • the center connection of the two installation positions forms a straight line L1, which is along the length direction of the housing 20 Extend, the camera is installed on one of the installation positions in the present application.
  • the installation position is set as a receiving groove 211 with side walls on all sides.
  • the camera is fixed at the bottom of the receiving groove.
  • the camera 61 is fixed in the receiving groove 211
  • the soft board 62 passes through the opening 212
  • the connector 63 is electrically connected to the circuit board 30 located in the accommodating cavity.
  • Figure 9 is a schematic diagram of the front cover and the back cover of the children's encyclopedia recognition device in a specific embodiment of the present invention
  • FIG. 10 is another front cover and back cover of the children's encyclopedia recognition device in a specific embodiment of the present invention
  • the back cover 23 has a through hole 232 through which light can reach the photosensitive area of the camera 61.
  • the back cover 23 has two symmetrically arranged circular protrusions 231, and the through hole 232 is provided with At the center of the protrusion 231, the back cover 23 has a groove 233 for accommodating the microphone, and the bottom of the groove has a hole through which external sound can pass.
  • the front cover 24 includes a circular protrusion 231 for passing through
  • a small hole is also opened on the front cover 24 at a position corresponding to the bottom hole of the groove 233, so that the microphone can sensitively receive external sounds.
  • the children’s encyclopedia knowledge device has a convenient handle, and the camera has multiple optional installation positions, which is convenient to choose the appropriate installation position according to different shapes. Most children like animals or cartoon characters, and the camera can be used as an animal in shape Or the eyes of cartoon characters are novel in shape, which enhances the fun, and the handle is convenient for children to hold.
  • the embodiment of the present invention also provides a terminal device, including a memory, a processor, and a computer program that is stored in the memory and can run on the processor.
  • a terminal device including a memory, a processor, and a computer program that is stored in the memory and can run on the processor.
  • the processor runs the computer program, Steps to realize the method of intelligent object recognition.
  • FIG. 11 is a schematic structural diagram of a terminal device provided in an embodiment of the present invention.
  • the terminal device 200 includes: a processor 220, a memory 210, and a computer program stored in the memory 210 and running on the processor 220 211, such as: intelligent object recognition program.
  • the processor 220 executes the computer program 211, the steps in the foregoing embodiments of the smart object recognition method are implemented, or when the processor 220 executes the computer program 211, the functions of the modules in the foregoing embodiments of the smart object recognition device are implemented.
  • the terminal device 200 may be a notebook, a palmtop computer, a tablet computer, a mobile phone, and other devices.
  • the terminal device 200 may include, but is not limited to, a processor 220 and a memory 210.
  • FIG. 11 is only an example of the terminal device 200, and does not constitute a limitation on the terminal device 200. It may include more or less components than those shown in the figure, or a combination of certain components, or different components.
  • the terminal device 200 may also include input and output devices, display devices, network access devices, buses, and so on.
  • the processor 220 may be a central processing unit (Central Processing Unit, CPU), or other general-purpose processors, digital signal processors (Digital Signal Processor, DSP), application specific integrated circuits (Application Specific Integrated Circuit, ASIC), on-site Field-Programmable GateArray (FPGA) or other programmable logic devices, discrete gates or transistor logic devices, discrete hardware components, etc.
  • the general-purpose processor 220 may be a microprocessor or the processor may also be any conventional processor or the like.
  • the memory 210 may be an internal storage unit of the terminal device 200, such as a hard disk or memory of the terminal device 200.
  • the memory 210 may also be an external storage device of the terminal device 200, such as a plug-in hard disk equipped on the terminal device 200, a smart memory card (SmartMedia Card, SMC), a Secure Digital (SD) card, and a flash memory card (Flash). Card) and so on.
  • the memory 210 may also include both an internal storage unit of the terminal device 200 and an external storage device.
  • the memory 210 is used to store the computer program 211 and other programs and data required by the terminal device 200.
  • the memory 210 may also be used to temporarily store data that has been output or will be output.
  • the intelligent object recognition method and device, intelligent object recognition equipment, terminal equipment and computer storage medium provided by the present invention can at least bring the following beneficial effects:
  • a learning resource library is also provided.
  • the educational resources related to the item are also retrieved in the resource library, such as the historical origin, evolution process, composition, function and distribution of the item, etc. While enriching knowledge, it also enhances the fun of children's learning.
  • the third-party program interface is connected to the database of the object through the server, and the object of the object is expanded.
  • the object of the object is expanded infinitely, and the capacity of the knowledge base is increased. More importantly, the powerful computing power of the knowledge base can greatly increase the capacity of the knowledge base. Improve the speed of recognizing objects, and the user experience can be greatly improved.
  • the item classification can be quickly located according to the main features of the item, and then the object name can be further identified according to other features. On the one hand, it improves the speed of item recognition, and on the other hand, it also improves the item Accuracy of recognition.
  • the disclosed terminal device and method may be implemented in other ways.
  • the terminal device embodiments described above are merely illustrative.
  • the division of modules or units is only a logical function division. In actual implementation, there may be other division methods.
  • multiple units or components may be Combined or can be integrated into another system, or some features can be ignored or not implemented.
  • the displayed or discussed mutual coupling or direct coupling or communication connection may be indirect coupling or communication connection through some interfaces, devices or units, and may be in electrical, mechanical or other forms.
  • the units described as separate components may or may not be physically separated, and the components displayed as units may or may not be physical units, that is, they may be located in one place, or they may be distributed on multiple network units. Some or all of the units may be selected according to actual needs to achieve the objectives of the solutions of the embodiments.
  • the functional units in the various embodiments of the present invention may be integrated into one processing unit, or each unit may exist alone physically, or two or more units may be integrated into one unit.
  • the above-mentioned integrated unit can be implemented in the form of hardware or software functional unit.
  • the embodiment of the present invention also provides a computer-readable storage medium, the computer-readable storage medium stores a computer program, and when the computer program is executed by a processor, the steps of the intelligent identification method are realized.
  • the integrated module/unit is implemented in the form of a software functional unit and sold or used as an independent product, it can be stored in a computer-readable storage medium.
  • the present invention implements all or part of the processes in the above-mentioned embodiment methods, and can also be completed by sending instructions to the relevant hardware through the computer program 211.
  • the computer program 211 can be stored in a computer-readable storage medium. When executed by the processor 220, 211 may implement the steps of the foregoing method embodiments.
  • the computer program 211 includes: computer program code, and the computer program code may be in the form of source code, object code, executable file, or some intermediate forms.
  • the computer-readable storage medium may include: any entity or device capable of carrying the computer program 211 code, recording medium, U disk, mobile hard disk, magnetic disk, optical disk, computer memory, read-only memory (ROM, Read-Only Memory), random Access memory (RAM, Random Access Memory), electric carrier signal, telecommunications signal, software distribution medium, etc. It should be noted that the content contained in the computer-readable storage medium can be appropriately added or deleted according to the requirements of the legislation and patent practice in the jurisdiction. For example, in some jurisdictions, according to the legislation and patent practice, the computer-readable medium Does not include electrical carrier signals and telecommunication signals.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Theoretical Computer Science (AREA)
  • Acoustics & Sound (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Artificial Intelligence (AREA)
  • Library & Information Science (AREA)
  • Human Computer Interaction (AREA)
  • General Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • General Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Electrically Operated Instructional Devices (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)
  • Toys (AREA)
  • User Interface Of Digital Computer (AREA)

Abstract

Provided in the present invention is a smart object recognition method, comprising acquiring an image of an object to be recognized; performing pre-processing on the image of the object to be recognized to obtain image feature data; on the basis of the image feature data, performing recognition in an object database to acquire a recognition result and generate an object content text, and on the basis of the object content text, generating a first audio link; on the basis of the object content text, performing matching in a resource library to acquire resource data corresponding to the recognition text, and on the basis of the resource data, generating a second audio link; and selecting to play the first audio link and/or the second audio link, thus solving the problems of enjoyment, independence, and knowledge limitation when a child is identifying objects.

Description

智能识物方法及装置、识物设备、终端设备、存储介质Intelligent object recognition method and device, object recognition equipment, terminal equipment, storage medium 技术领域Technical field
本发明涉及智能终端技术领域,尤指一种智能识物方法及装置、智能识物设备、终端设备及计算机存储介质。The present invention relates to the technical field of intelligent terminals, in particular to an intelligent object recognition method and device, intelligent object recognition equipment, terminal equipment and computer storage media.
背景技术Background technique
目前,市面上的越来越多的教育产品,或对儿童进行认知启蒙,或对儿童进行知识的教授,针对对世界充满好奇的孩童,其对周遭事物的认识的渴望催生了一系列的教育终端。At present, there are more and more educational products on the market, either for children’s cognitive enlightenment, or for children’s knowledge teaching. For children who are curious about the world, their desire to understand the things around them has given birth to a series of Education terminal.
例如,在儿童对物体的认知上,现有技术中,在儿童认识物体的实现方式主要是将物体印刷在图书上,儿童通过阅读文字或者他人指导下认识物体,这是最常见的方式,显然,该种教育方式一方面需要家长陪同,另一方面,基于印刷刊物的知识有限性,该种教授方式存在很大的知识局限性,趣味性也不足。For example, in children’s cognition of objects, in the prior art, the main way for children to recognize objects is to print the objects on books. This is the most common way for children to recognize objects by reading text or under the guidance of others. Obviously, on the one hand, this kind of education requires the accompany of parents. On the other hand, based on the limited knowledge of printed publications, this kind of teaching method has great knowledge limitations and lack of interest.
还有一种方式是,通过设置识物卡片,每张卡片设置识物按压点,当按下某个识物按压点时,播放该卡片对应的物品的名称,这种方式对于孩童来说,操作性有些复杂,容易使孩子失去兴致,同时识物卡片也容易失灵,与前种方式相比,仍然存在知识局限性的问题。Another way is to set up a recognition card. Each card sets a recognition pressing point. When a recognition pressing point is pressed, the name of the item corresponding to the card is played. This way is for children to operate Sex is a bit complicated, and it is easy for children to lose interest. At the same time, the recognition card is also easy to fail. Compared with the previous method, there is still the problem of knowledge limitations.
发明内容Summary of the invention
本发明的目的是提供一种智能识物方法及装置、智能识物设备、终端设备及计算机存储介质,用来解决儿童识物时的知识局限性、独立性和趣味性的问题。The purpose of the present invention is to provide an intelligent object-recognition method and device, intelligent object-recognition equipment, terminal equipment and computer storage medium, which are used to solve the problems of knowledge limitation, independence and interest in children's object-recognition.
为了实现上述发名目的,在一种实施例中,本发明提供了一种智能识物方法,包括:In order to achieve the above purpose of naming, in one embodiment, the present invention provides an intelligent object recognition method, including:
获取待识别物体的图像;Obtain an image of the object to be recognized;
对所述待识别物体的图像进行预处理,获得图像特征数据;Preprocessing the image of the object to be recognized to obtain image feature data;
根据所述图像特征数据在物品数据库进行识别,获得识别结果并生成物品 内容文本,根据所述物品内容文本生成第一音频链接;Perform recognition in an article database according to the image feature data, obtain a recognition result and generate article content text, and generate a first audio link according to the article content text;
根据所述物品内容文本在资源库中进行匹配,获得与所述识别文本相对应的资源数据,并根据所述资源数据生成第二音频链接;Matching in the resource library according to the content text of the article, obtaining resource data corresponding to the recognized text, and generating a second audio link according to the resource data;
选择播放所述第一音频链接和/或第二音频链接。Select to play the first audio link and/or the second audio link.
进一步的,所述对所述待识别物体的图像进行预处理,获得图像特征数据具体包括:Further, the preprocessing of the image of the object to be recognized to obtain image feature data specifically includes:
对所述待识别物体的图像进行压缩、图像二值化、灰度图处理、SIFT特征提取和交点特征提取,获得图像特征数据。Compression, image binarization, gray-scale image processing, SIFT feature extraction, and intersection feature extraction are performed on the image of the object to be recognized to obtain image feature data.
进一步的,所述根据所述图像特征数据在物品数据库进行识别,获得识别结果并生成物品内容文本,根据所述物品内容文本生成第一音频链接具体包括:Further, said performing recognition in an article database according to the image feature data, obtaining a recognition result and generating article content text, and generating a first audio link according to the article content text specifically includes:
根据所述图像特征数据,提取核心特征点数据对应的物体特征;Extracting the object features corresponding to the core feature point data according to the image feature data;
在物品数据库中根据所述物体特征查找,获取与所述物体特征对应的物品名称,输出识别结果,生成物品内容文本;Search in the article database according to the object feature, obtain the article name corresponding to the object feature, output the recognition result, and generate the article content text;
将所述物品内容文本合成为第一音频链接。Synthesize the content text of the article into a first audio link.
进一步的,所述根据所述物品内容文本在资源库中进行匹配,获得与所述识别文本相对应的资源数据,并根据所述资源数据生成第二音频链接具体包括:Further, the matching in a resource library according to the article content text, obtaining resource data corresponding to the recognized text, and generating a second audio link according to the resource data specifically includes:
根据所述所述物品内容文本生成检索式,按照所述检索式在资源库中进行检索,如检索到所述检索式完全匹配的资源数据,所述资源数据包括与所述物品内容文本对应的视频、动画或者音频数据,则选择所述资源数据中一种或多种,生成第二音频链接。A search formula is generated according to the article content text, and a search is performed in the resource library according to the search formula. For example, resource data that exactly matches the search formula is retrieved, and the resource data includes the text corresponding to the article content For video, animation or audio data, one or more of the resource data is selected to generate a second audio link.
进一步的,所述在物品数据库中根据所述物体特征查找,获取与所述物体特征对应的物品名称具体包括:Further, the searching in the item database according to the feature of the object and obtaining the item name corresponding to the feature of the object specifically includes:
所述物品数据库设置于远程服务器中,所述物品数据库包括一特征物品映射表,根据所述物体特征获取对应的物品名称;The article database is set in a remote server, the article database includes a characteristic article mapping table, and the corresponding article name is obtained according to the characteristics of the article;
其中,所述物品数据库的数据接口与第三方知识库连接,所述数据接口用来实时更新所述特征物品映射表。Wherein, the data interface of the article database is connected with a third-party knowledge base, and the data interface is used to update the characteristic article mapping table in real time.
在另一种实施例下,本发明还提供了一种智能识物装置,所述智能识物装置包括:In another embodiment, the present invention also provides an intelligent object recognition device, the intelligent object recognition device comprising:
图像获取单元,用于获取待识别物体的图像;The image acquisition unit is used to acquire an image of the object to be identified;
图像预处理单元,用于对所述待识别物体的图像进行预处理,获得图像特征数据;An image preprocessing unit, configured to preprocess the image of the object to be recognized to obtain image feature data;
识别单元,用于发送所述图像特征数据到物品数据库进行识别,获得识别结果并生成物品内容文本,根据所述物品内容文本生成第一音频链接,以及根据所述物品内容文本在资源库中进行匹配,获得与所述识别文本相对应的资源数据,并根据所述资源数据生成第二音频链接;The recognition unit is used to send the image feature data to the item database for recognition, obtain the recognition result and generate the item content text, generate the first audio link according to the item content text, and perform the process in the resource database according to the item content text Match, obtain resource data corresponding to the recognized text, and generate a second audio link according to the resource data;
播放单元,用于接收并选择播放所述第一音频链接和/或第二音频链接。The playing unit is configured to receive and select to play the first audio link and/or the second audio link.
进一步的,所述识别单元具体包括:Further, the identification unit specifically includes:
发送模块,用于发送所述图像特征数据到物品数据库中,其中,所述物品数据库设置于远程服务器中,所述服务器根据所述图像特征数据,提取核心特征点数据对应的物体特征,并在物品数据库中根据所述物体特征查找,获取与所述物体特征对应的物品名称,输出识别结果,生成物品内容文本,将所述物品内容文本合成为第一音频链接。The sending module is used to send the image feature data to the item database, wherein the item database is set in a remote server, and the server extracts the object feature corresponding to the core feature point data according to the image feature data, and then According to the object feature search in the article database, the article name corresponding to the object feature is obtained, the recognition result is output, the article content text is generated, and the article content text is synthesized into the first audio link.
在另一种实现下,所述识别单元具体包括:In another implementation, the identification unit specifically includes:
发送模块,用于发送所述图像特征数据到物品数据库中;A sending module for sending the image feature data to the article database;
第一音频生成模块,用于根据所述图像特征在物品数据库中查找并获得识别结果,生成物品内容文本,根据所述物品内容文本生成第一音频链接;The first audio generation module is configured to search and obtain recognition results in the article database according to the image characteristics, generate article content text, and generate a first audio link based on the article content text;
第二音频生成模块,用于根据所述物品内容文本在资源库中进行匹配,获得与所述识别文本相对应的资源数据,并根据所述资源数据生成第二音频链接。The second audio generating module is configured to perform matching in the resource library according to the article content text, obtain resource data corresponding to the recognized text, and generate a second audio link according to the resource data.
相应的,本发明又提供了一种实施例,在该实施例下的基于前述智能识物方法的智能识物设备,所述智能识物设备包括:Correspondingly, the present invention further provides an embodiment, in which an intelligent object recognition device based on the aforementioned intelligent object recognition method, the intelligent object recognition device includes:
包括壳体、手柄和摄像模组,所述摄像模组包括摄像头,所述摄像头用来获取所述待识别物体的图像,所述壳体上具有多个用于安装所述摄像头的安装位,至少一个所述安装位上安装有摄像头,任意两个所述安装位所在直线与所 述手柄所在直线相交,若所述安装位有2个则对称地设置在所述儿童百科识物设备的左右两侧,Comprising a housing, a handle and a camera module, the camera module includes a camera, the camera is used to obtain the image of the object to be identified, the housing has a plurality of installation positions for installing the camera, A camera is installed on at least one of the installation positions, and any two straight lines where the installation positions are located intersect the straight lines where the handles are located, and if there are two installation positions, they are symmetrically arranged on the left and right sides of the children's encyclopedia knowledge device On both sides,
其中,壳体包括前盖、后盖和前壳体、后壳体,所述安装位位于所述前壳体上,所述后盖上具有通孔,光线可穿过所述通孔到达所述摄像头的感光区域;所述后盖具有对称设置的两个圆形凸出部,所述通孔设于所述凸出部的中心位置。Wherein, the housing includes a front cover, a rear cover, a front housing, and a rear housing. The installation location is located on the front housing, and the rear cover has a through hole through which light can pass through the through hole to reach the The photosensitive area of the camera; the back cover has two symmetrically arranged circular protrusions, and the through hole is provided at the center of the protrusions.
进一步的,所述智能识物设备还包括:Further, the intelligent object recognition device further includes:
电路板,所述电路板包括容纳于所述壳体内的主体部和自所述主体部延伸至所述手柄内部的连接部,所述连接部上设置有识物触发件,所述手柄包括与所述识物触发件相配合的按键,按压所述按键可带动所述识物触发件动作,启动识物方法。A circuit board, the circuit board includes a main body portion accommodated in the housing and a connecting portion extending from the main body portion to the inside of the handle, the connecting portion is provided with an identification trigger, and the handle includes and The button matched with the object-recognizing trigger can be pressed to drive the object-recognizing trigger to act, and the object-recognizing method can be started.
优选的,所述按键部分穿过所述通孔,与所述识物触发件可脱离地连接。Preferably, the button part passes through the through hole and is detachably connected to the object triggering member.
优选的,所述智能识物设备还包括显示组件,所述显示组件包括显示面板,所述前壳体上设有用于固定所述显示面板的安装框,所述显示面板安装于所述前壳体靠近所述后壳体的一侧,所述显示组件用于播放所述第一音频或第二音频。Preferably, the intelligent object recognition device further includes a display assembly, the display assembly includes a display panel, a mounting frame for fixing the display panel is provided on the front housing, and the display panel is mounted on the front housing. The body is close to one side of the rear housing, and the display assembly is used for playing the first audio or the second audio.
本发明实施例还提供了一种终端设备,包括存储器、处理器以及存储在所述存储器中并可在所述处理器上运行的计算机程序,所述处理器运行所述计算机程序时实现智能识物方法的步骤。The embodiment of the present invention also provides a terminal device, including a memory, a processor, and a computer program stored in the memory and running on the processor, and the processor realizes intelligent recognition when the computer program is run. The steps of the physical method.
本发明实施例还提供了一种计算机可读存储介质,所述计算机可读存储介质存储有计算机程序,所述计算机程序被处理器执行时实现所述智能识物方法的步骤。The embodiment of the present invention also provides a computer-readable storage medium, the computer-readable storage medium stores a computer program, and when the computer program is executed by a processor, the steps of the intelligent identification method are realized.
本发明提供的智能识物方法及装置、智能识物设备、终端设备及计算机存储介质,至少能够带来以下有益效果:The intelligent object recognition method and device, intelligent object recognition equipment, terminal equipment and computer storage medium provided by the present invention can at least bring about the following beneficial effects:
1、通过摄像头实时获取待识别物体的图像,通过图像处理获得待识别物体的特征从而进行特征识别,获得物体识别结果,并播报,对于儿童来说趣味性十足,知识的渴望可以得到及时满足,实时性高,且识物对象不局限于手头的刊物或卡片,任意物体都在识别范围,提高了知识的广度。1. Obtain the image of the object to be recognized through the camera in real time, and obtain the feature of the object to be recognized through image processing to perform feature recognition, obtain the object recognition result, and broadcast it. It is full of fun for children, and the desire for knowledge can be satisfied in time. The real-time performance is high, and the object of recognizing is not limited to the publication or card at hand, and any object is in the recognition range, which increases the breadth of knowledge.
2.除了播放物品的名称,还提供了学习资源库,针对识别到的物体,还在资源库中检索物品相关的教育资源,例如物品的历史由来、进化过程、成分,作用及分布等等,丰富了知识的同时,还提高了孩童学习的乐趣。2. In addition to the name of the item to be played, a learning resource library is also provided. For the identified object, the educational resources related to the item are also retrieved in the resource library, such as the historical origin, evolution process, composition, function and distribution of the item, etc. While enriching knowledge, it also enhances the fun of children's learning.
3.通过服务器对识物的数据库外接第三方程序接口,进行了识物对象的扩展,将识别对象无限扩大,提高了知识库的容量,更重要的,通过知识库的强大计算能力,可以大大提高识别物体的速度,用户体验度得以大大提高。3. The third-party program interface is connected to the database of the object through the server, and the object of the object is expanded. The object of the object is expanded infinitely, and the capacity of the knowledge base is increased. More importantly, the powerful computing power of the knowledge base can greatly increase the capacity of the knowledge base. Improve the speed of recognizing objects, and the user experience can be greatly improved.
4.通过预先设置的物品特征映射表,可以快速地根据物品的主要特征来定位物品分类,继而根据其他特征来进一步识别物体名称,一方面提高了物品识别的速度,另一方面也提高了物品识别的精度。4. Through the pre-set item feature mapping table, the item classification can be quickly located according to the main features of the item, and then the object name can be further identified according to other features. On the one hand, it improves the speed of item recognition, and on the other hand, it also improves the item Accuracy of recognition.
附图说明Description of the drawings
下面将以明确易懂的方式,结合附图说明优选实施例,对上述特性、技术特征、优点及其实现方式予以进一步说明。Hereinafter, the preferred embodiments will be described in a clear and easy-to-understand manner in conjunction with the accompanying drawings to further illustrate the above-mentioned characteristics, technical features, advantages and implementation methods.
图1为本发明中智能识物方法流程示意图;Figure 1 is a schematic diagram of the process of the intelligent object recognition method in the present invention;
图2为本发明中一种智能识物装置结构示意图;2 is a schematic diagram of the structure of an intelligent object recognition device in the present invention;
图3为本发明中另一种智能识物装置结构示意图;Figure 3 is a schematic diagram of another intelligent object recognition device in the present invention;
图4为本发明中再一种智能识物装置结构示意图;Figure 4 is a schematic diagram of the structure of yet another intelligent object recognition device in the present invention;
图5为本发明具体实施例手持式儿童百科识物设备的正面结构示意图;5 is a schematic diagram of the front structure of a handheld children's encyclopedia recognition device according to a specific embodiment of the present invention;
图6为本发明具体实施例手持式儿童百科识物设备的各部件分解示意图;6 is an exploded schematic diagram of various components of the handheld children's encyclopedia recognition device according to a specific embodiment of the present invention;
图7为本发明具体实施例手持式儿童百科识物设备的前壳结构示意图;FIG. 7 is a schematic diagram of the front shell structure of the handheld children's encyclopedia recognition device according to a specific embodiment of the present invention;
图8为本发明具体实施例手持式儿童百科识物设备的摄像模组结构示意图;FIG. 8 is a schematic structural diagram of a camera module of a handheld children's encyclopedia object-recognizing device according to a specific embodiment of the present invention;
图9为本发明具体实施例手持式儿童百科识物设备的前盖和后盖结构示意图;9 is a schematic diagram of the front cover and the back cover of the handheld children's encyclopedia recognition device according to a specific embodiment of the present invention;
图10为本发明具体实施例手持式儿童百科识物设备的前盖和后盖另一结构示意图;10 is a schematic diagram of another structure of the front cover and the back cover of the handheld children's encyclopedia recognition device according to a specific embodiment of the present invention;
图11为本发明具体实施例终端设备的结构示意图。FIG. 11 is a schematic structural diagram of a terminal device according to a specific embodiment of the present invention.
具体实施方式Detailed ways
为了更清楚地说明本发明实施例或现有技术中的技术方案,下面将对照附图说明本发明的具体实施例。显而易见地,下面描述中的附图仅仅是本发明的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图,并获得其他的实施例。In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, specific embodiments of the present invention will be described below with reference to the drawings. Obviously, the drawings in the following description are only some embodiments of the present invention. For those of ordinary skill in the art, without creative work, other drawings can be obtained based on these drawings and obtained Other embodiments.
如图1所示为本发明实施例一种智能识物的方法,包括:As shown in FIG. 1 is an embodiment of the present invention, a method for intelligently identifying objects, including:
S10.获取待识别物体的图像;S10. Obtain an image of the object to be recognized;
S20.对所述待识别物体的图像进行预处理,获得图像特征数据;S20. Preprocess the image of the object to be recognized to obtain image feature data;
S30.根据所述图像特征数据在物品数据库进行识别,获得识别结果并生成物品内容文本,根据所述物品内容文本生成第一音频链接;S30. Perform recognition in an article database according to the image feature data, obtain a recognition result and generate article content text, and generate a first audio link based on the article content text;
S40.根据所述物品内容文本在资源库中进行匹配,获得与所述识别文本相对应的资源数据,并根据所述资源数据生成第二音频链接;S40. Perform matching in the resource library according to the article content text, obtain resource data corresponding to the recognized text, and generate a second audio link according to the resource data;
S50.选择播放所述第一音频链接和/或第二音频链接。S50. Select to play the first audio link and/or the second audio link.
当用户需要认识某个物体时,可以先按下拍照按钮,对物体进行对焦,通过摄像头采集物体图片。由于孩童尚不会设置合适的拍照参数,用户可以通过手机,通过蓝牙、wifi或者点对点等方式,对识物设备的拍照采集频率,曝光时间进行配置,从而获取更合适的图片,以利于后续进行特征识别时很容易的获取物体的主要特征信息。When the user needs to recognize an object, he can first press the camera button to focus on the object, and collect a picture of the object through the camera. Since children have not yet set appropriate camera parameters, users can configure the frequency and exposure time of the camera to obtain more suitable pictures through mobile phones, Bluetooth, wifi, or peer-to-peer, etc., so as to obtain more suitable pictures to facilitate follow-up. It is easy to obtain the main feature information of the object during feature recognition.
基于实施例1,在本发明的另外的实施例2中,所述对所述待识别物体的图像进行预处理,获得图像特征数据具体包括:Based on embodiment 1, in another embodiment 2 of the present invention, the preprocessing of the image of the object to be recognized to obtain image feature data specifically includes:
对所述待识别物体的图像进行压缩、图像二值化、灰度图处理、SIFT特征提取和交点特征提取,获得图像特征数据。Compression, image binarization, gray-scale image processing, SIFT feature extraction, and intersection feature extraction are performed on the image of the object to be recognized to obtain image feature data.
对采集到的图像进行预处理,包括但不限于压缩图片、图像二值化、灰度图处理、SIFT特征提取和交点特征提取。该预处理动作可以由识物设备进行,也可以由与该识物设备连接的手机完成,甚至发送至远程服务器完成。以上举例不代表本发明实施例限定该图像预处理过程的实施主体。Preprocess the collected images, including but not limited to compressed pictures, image binarization, grayscale image processing, SIFT feature extraction, and intersection feature extraction. The pre-processing action can be performed by the object-recognizing device, can also be completed by a mobile phone connected to the object-recognizing device, or even sent to a remote server for completion. The above examples do not represent that the embodiments of the present invention limit the implementation subject of the image preprocessing process.
在本发明实施例3中,基于前述实施方式,所述根据所述图像特征数据在物品数据库进行识别,获得识别结果并生成物品内容文本,根据所述物品内容 文本生成第一音频链接具体包括:In Embodiment 3 of the present invention, based on the foregoing implementation, the identification in the article database based on the image feature data, obtaining the identification result and generating article content text, and generating the first audio link based on the article content text specifically includes:
根据所述图像特征数据,提取核心特征点数据对应的物体特征;Extracting the object features corresponding to the core feature point data according to the image feature data;
在物品数据库中根据所述物体特征查找,获取与所述物体特征对应的物品名称,输出识别结果,生成物品内容文本;Search in the article database according to the object feature, obtain the article name corresponding to the object feature, output the recognition result, and generate the article content text;
将所述物品内容文本合成为第一音频链接。Synthesize the content text of the article into a first audio link.
在提取了特征数据后,由于采集到的图像的特征较多,甚至可能因为拍照角度的问题,还含有其他物体的特征,但一般会偏重于主要物体的特征,因此,本发明实施例提前待识别物体图片或者图像的核心特征,或者主要特征,与物品数据库中的物体特征进行对比,获取核心特征点对应的物体,生成物品内容文本。After the feature data is extracted, the collected images have many features, and may even contain features of other objects due to the camera angle. However, the features of the main objects are generally emphasized. Therefore, the embodiment of the present invention waits in advance. Identify the core feature or main feature of the object picture or image, compare it with the object feature in the item database, obtain the object corresponding to the core feature point, and generate the item content text.
在另一种实施例5下,所述物品数据库还设置了物品分类,某类物品包含类别特征,便于快速查找定位物品名称。In another embodiment 5, the article database is also provided with article classification, and a certain kind of article contains category characteristics, which is convenient for quickly searching and locating the name of the article.
进一步的,所述根据所述物品内容文本在资源库中进行匹配,获得与所述识别文本相对应的资源数据,并根据所述资源数据生成第二音频链接具体包括:Further, the matching in a resource library according to the article content text, obtaining resource data corresponding to the recognized text, and generating a second audio link according to the resource data specifically includes:
根据所述所述物品内容文本生成检索式,按照所述检索式在资源库中进行检索,如检索到所述检索式完全匹配的资源数据,所述资源数据包括与所述物品内容文本对应的视频、动画或者音频数据,则选择所述资源数据中一种或多种,生成第二音频链接。A search formula is generated according to the article content text, and a search is performed in the resource library according to the search formula. For example, resource data that exactly matches the search formula is retrieved, and the resource data includes the text corresponding to the article content For video, animation or audio data, one or more of the resource data is selected to generate a second audio link.
通过对接百度/阿里/腾讯AI服务,首先识别出图片内容并输出图片中主要物体名称的文本,此文本为识别结果;然后将文本内容通过TTS语音技术合成语音内容,此语音内容链接为第一音频链接;紧接着使用物体名称文本在服务器的音频、视频、动画等教育资源库中进行检索,以完全重合为匹配标准,选择语音、动画、视频内容,如果匹配成功。此语音内容链接为第二音频链接。By docking with Baidu/Ali/Tencent AI services, the image content is first recognized and the text of the main object name in the image is output. This text is the recognition result; then the text content is synthesized into voice content through TTS voice technology, and this voice content link is the first Audio link; then use the object name text to search in the server's audio, video, animation and other educational resource libraries, with complete overlap as the matching criterion, select the voice, animation, and video content, if the matching is successful. This voice content link is the second audio link.
第一音频链接和第二音频链接的生成,可以由识物设备自己完成,或者分别由识物设备和服务器完成,或者全部由服务器完成。The generation of the first audio link and the second audio link may be completed by the identification device itself, or completed by the identification device and the server respectively, or all completed by the server.
进一步的,所述在物品数据库中根据所述物体特征查找,获取与所述物体特征对应的物品名称具体包括:Further, the searching in the item database according to the feature of the object and obtaining the item name corresponding to the feature of the object specifically includes:
所述物品数据库设置于远程服务器中,所述物品数据库包括一特征物品映射表,根据所述物体特征获取对应的物品名称;The article database is set in a remote server, the article database includes a characteristic article mapping table, and the corresponding article name is obtained according to the characteristics of the article;
其中,所述物品数据库的数据接口与第三方知识库连接,所述数据接口用来实时更新所述特征物品映射表。Wherein, the data interface of the article database is connected with a third-party knowledge base, and the data interface is used to update the characteristic article mapping table in real time.
在一种实现方式下,可以将采集到的或者经过预处理的图像通过wifi或4G/5G网络上传至服务器,由服务器完成特征识别查找,进而得到物品名称文本,优选的,物品数据库设置于远程服务器中,还包括了一特征物品映射表,由特征直接映射物体名称,实现物品的快速识别。In one implementation mode, the collected or preprocessed images can be uploaded to the server via wifi or 4G/5G network, and the server can complete the feature recognition and search, and then obtain the article name text. Preferably, the article database is set in the remote The server also includes a feature item mapping table, which directly maps object names from features to achieve rapid identification of items.
如图2所示,本发明实施例6提供了一种智能识物装置100,所述智能识物装置包括:As shown in FIG. 2, Embodiment 6 of the present invention provides an intelligent object-recognizing device 100, and the intelligent object-recognizing device includes:
图像获取单元110,用于获取待识别物体的图像;The image acquisition unit 110 is used to acquire an image of the object to be identified;
图像预处理单元120,用于对所述待识别物体的图像进行预处理,获得图像特征数据;The image preprocessing unit 120 is configured to preprocess the image of the object to be recognized to obtain image feature data;
识别单元130,用于发送所述图像特征数据到物品数据库进行识别,获得识别结果并生成物品内容文本,根据所述物品内容文本生成第一音频链接,以及根据所述物品内容文本在资源库中进行匹配,获得与所述识别文本相对应的资源数据,并根据所述资源数据生成第二音频链接;The recognition unit 130 is configured to send the image feature data to the article database for identification, obtain the recognition result and generate article content text, generate a first audio link according to the article content text, and store the article content text in the resource database according to the article content text Performing matching, obtaining resource data corresponding to the recognized text, and generating a second audio link according to the resource data;
播放单元140,用于接收并选择播放所述第一音频链接和/或第二音频链接。The playing unit 140 is configured to receive and select to play the first audio link and/or the second audio link.
例如,所述图像获取单元包括:摄像头;处理器,拍照按钮(可以为一物理按键,也可以为触摸屏按钮),扬声器,LCD显示屏和LED指示灯,所述处理器用wifi或蓝牙模块或4G/5G网络与手机连接,摄像头和拍照按钮与处理器的输入端连接,处理器输出端与扬声器、LCD显示屏和LED指示灯相连。For example, the image acquisition unit includes: a camera; a processor, a camera button (which can be a physical button or a touch screen button), a speaker, an LCD display and an LED indicator, and the processor uses a wifi or bluetooth module or 4G The /5G network is connected to the mobile phone, the camera and the camera button are connected to the input end of the processor, and the output end of the processor is connected to the speaker, LCD display and LED indicator.
如图3所示,实施例7中,所述识别单元具体包括:As shown in FIG. 3, in Embodiment 7, the identification unit specifically includes:
发送模块1301,用于发送所述图像特征数据到物品数据库中,其中,所述物品数据库设置于远程服务器中,所述服务器根据所述图像特征数据,提取核心特征点数据对应的物体特征,并在物品数据库中根据所述物体特征查找,获取与所述物体特征对应的物品名称,输出识别结果,生成物品内容文本,将所述物品内容文本合成为第一音频链接。The sending module 1301 is configured to send the image feature data to an item database, where the item database is set in a remote server, and the server extracts the object feature corresponding to the core feature point data according to the image feature data, and Search in the article database according to the object feature, obtain the article name corresponding to the object feature, output the recognition result, generate the article content text, and synthesize the article content text into the first audio link.
在实施例7中,所述物品数据库和资源库都位于服务器中,根据所述所述物品内容文本生成检索式,按照所述检索式在资源库中进行检索,如检索到所述检索式完全匹配的资源数据,所述资源数据包括与所述物品内容文本对应的视频、动画或者音频数据,则选择所述资源数据中一种或多种,生成第二音频链接。In embodiment 7, the article database and the resource library are both located in the server, the retrieval formula is generated according to the article content text, and the retrieval is performed in the resource database according to the retrieval formula. If the retrieval formula is completely retrieved, Matching resource data, where the resource data includes video, animation, or audio data corresponding to the content text of the article, then one or more of the resource data is selected to generate a second audio link.
在实施例8中,所述资源库位于识物设备中,或者第三方手机中,根据所述所述物品内容文本生成检索式,按照所述检索式在资源库中进行检索,如检索到所述检索式完全匹配的资源数据,所述资源数据包括与所述物品内容文本对应的视频、动画或者音频数据,则选择所述资源数据中一种或多种,生成第二音频链接。In Embodiment 8, the resource library is located in an object recognition device or a third-party mobile phone, a search formula is generated according to the content text of the article, and the resource library is searched according to the search formula. If the resource data that exactly matches the search formula, the resource data includes video, animation, or audio data corresponding to the content text of the article, then one or more of the resource data is selected to generate a second audio link.
实施例9中,如图4所示,所述识别单元具体包括:In Embodiment 9, as shown in FIG. 4, the identification unit specifically includes:
发送模块1301,用于发送所述图像特征数据到物品数据库中;The sending module 1301 is used to send the image feature data to the article database;
第一音频生成模块1302,用于根据所述图像特征在物品数据库中查找并获得识别结果,生成物品内容文本,根据所述物品内容文本生成第一音频链接;The first audio generating module 1302 is configured to search and obtain a recognition result in the article database according to the image feature, generate article content text, and generate a first audio link according to the article content text;
第二音频生成模块1303,用于根据所述物品内容文本在资源库中进行匹配,获得与所述识别文本相对应的资源数据,并根据所述资源数据生成第二音频链接。The second audio generating module 1303 is configured to perform matching in the resource library according to the article content text, obtain resource data corresponding to the recognized text, and generate a second audio link according to the resource data.
如图5所示,本发明又提供了实施例10,在该实施例下的基于前述智能识物方法的智能识物设备,所述智能识物设备包括:As shown in FIG. 5, the present invention further provides Embodiment 10. In this embodiment, an intelligent object recognition device based on the foregoing intelligent object recognition method includes:
包括壳体、手柄和摄像模组,所述摄像模组包括摄像头,所述摄像头用来获取所述待识别物体的图像,所述壳体上具有多个用于安装所述摄像头的安装位,至少一个所述安装位上安装有摄像头,任意两个所述安装位所在直线与所述手柄所在直线相交,若所述安装位有2个则对称地设置在所述儿童百科识物设备的左右两侧,Comprising a housing, a handle and a camera module, the camera module includes a camera, the camera is used to obtain the image of the object to be identified, the housing has a plurality of installation positions for installing the camera, A camera is installed on at least one of the installation positions, and any two straight lines where the installation positions are located intersect the straight lines where the handles are located, and if there are two installation positions, they are symmetrically arranged on the left and right sides of the children's encyclopedia knowledge device On both sides,
其中,壳体包括前盖、后盖和前壳体、后壳体,所述安装位位于所述前壳体上,所述后盖上具有通孔,光线可穿过所述通孔到达所述摄像头的感光区域;所述后盖具有对称设置的两个圆形凸出部,所述通孔设于所述凸出部的中心位置。Wherein, the housing includes a front cover, a rear cover, a front housing, and a rear housing. The installation location is located on the front housing, and the rear cover has a through hole through which light can pass through the through hole to reach the The photosensitive area of the camera; the back cover has two symmetrically arranged circular protrusions, and the through hole is provided at the center of the protrusions.
进一步的,所述智能识物设备还包括:Further, the intelligent object recognition device further includes:
电路板,所述电路板包括容纳于所述壳体内的主体部和自所述主体部延伸至所述手柄内部的连接部,所述连接部上设置有识物触发件,所述手柄包括与所述识物触发件相配合的按键,按压所述按键可带动所述识物触发件动作,启动识物方法。A circuit board, the circuit board includes a main body portion accommodated in the housing and a connecting portion extending from the main body portion to the inside of the handle, the connecting portion is provided with an identification trigger, and the handle includes and The button matched with the object-recognizing trigger can be pressed to drive the object-recognizing trigger to act, and the object-recognizing method can be started.
优选的,所述按键部分穿过所述通孔,与所述识物触发件可脱离地连接。Preferably, the button part passes through the through hole and is detachably connected to the object triggering member.
优选的,所述智能识物设备还包括显示组件,所述显示组件包括显示面板,所述前壳体上设有用于固定所述显示面板的安装框,所述显示面板安装于所述前壳体靠近所述后壳体的一侧,所述显示组件用于播放所述第一音频或第二音频。Preferably, the intelligent object recognition device further includes a display assembly, the display assembly includes a display panel, a mounting frame for fixing the display panel is provided on the front housing, and the display panel is mounted on the front housing. The body is close to one side of the rear housing, and the display assembly is used for playing the first audio or the second audio.
如图5所示,为智能识物设备的正面结构示意图,所述智能识物设备为儿童智能百科识物设备或者儿童拍照设备,本实施例以儿童智能百科识物设备为例进行说明。As shown in FIG. 5, it is a schematic diagram of the front structure of a smart object-recognizing device. The smart object-recognizing device is a child's smart encyclopedia object-recognizing device or a child's camera device.
如图5、6、7、8所示,儿童智能百科识物设备包括壳体20、手柄10和摄像模组60,摄像模组60包括摄像头61,摄像头61可以是彩色摄像头、黑白摄像头、广角摄像头或变焦摄像头,壳体20上具有多个用于安装摄像头的安装位,至少一个安装位上安装有摄像头,任意两个安装位所在直线与手柄10所在直线相交,壳体20与手柄10形成T形。As shown in Figures 5, 6, 7, and 8, the children’s smart encyclopedia knowledge device includes a housing 20, a handle 10, and a camera module 60. The camera module 60 includes a camera 61. The camera 61 can be a color camera, a black and white camera, and a wide-angle camera. For a camera or a zoom camera, the housing 20 has a plurality of installation positions for installing the camera, at least one of the installation positions is equipped with a camera, and the line where any two installation positions are located intersects the line where the handle 10 is located, and the housing 20 and the handle 10 are formed T-shaped.
壳体20上具有两个安装位,两个安装位所在直线与手柄10所在直线垂直设置,本申请中安装位有2个且对称地设置在儿童百科识物设备的左右两侧,两个安装位的中心点连线形成直线L1,L1沿水平方向延伸,手柄10所在直线L2沿竖直方向延伸。There are two installation positions on the housing 20. The line where the two installation positions are located is perpendicular to the line where the handle 10 is located. In this application, there are two installation positions and are symmetrically arranged on the left and right sides of the children's encyclopedia object recognition device. The line connecting the center points of the position forms a straight line L1, which extends in the horizontal direction, and the straight line L2 where the handle 10 is located extends in the vertical direction.
本申请的儿童智能百科识物设备可以包括一个摄像头或者多个摄像头,但摄像头的安装位至少设置2个,可以根据实际需要在2个安装位上都安装摄像头或者只在其中的一个上安装摄像头,另一个安装位上可以安装放大镜、闪光 灯或手电筒,两个安装位在外形上形成儿童百科识物设备的两个眼睛,方便造型设计为儿童喜欢的卡通人物或者动物,两个安装位对称设置更为美观。The children’s smart encyclopedia object recognition equipment of this application may include one camera or multiple cameras, but at least two camera installation positions can be installed. Cameras can be installed in both installation positions or only one of them can be installed according to actual needs. , The other mounting position can be equipped with a magnifying glass, flashlight or flashlight. The two mounting positions form two eyes of the children’s encyclopedia knowledge device in appearance, which is convenient for the design of cartoon characters or animals that children like. The two mounting positions are symmetrically arranged More beautiful.
进一步参阅图6,图6是本发明具体实施例儿童百科识物设备的各部件分解示意图,本申请的儿童百科识物设备还包括电路板30,电路板30包括容纳于壳体内的主体部31和自主体部31延伸至手柄10内部的连接部32,连接部32上设置有识物触发件,手柄10包括与识物触发件相配合的按键11,按压按键11可带动识物触发件动作,启动智能识别装置工作。识物触发件为电子件,摄像头模组60与电路板30电性连接,具体地,摄像模组60包括摄像头61、软板62和连接器63,摄像头61安装在壳体20上,连接器63与电路板30电性连接。Further refer to FIG. 6, which is an exploded schematic diagram of the components of the children's encyclopedia identification device of a specific embodiment of the present invention. The children's encyclopedia identification device of the present application further includes a circuit board 30, which includes a main body 31 housed in a housing And the connecting portion 32 extending from the main body portion 31 to the inside of the handle 10, the connecting portion 32 is provided with an object-sensing trigger, the handle 10 includes a button 11 matched with the object-sensing trigger, pressing the button 11 can drive the object-sensing trigger to act , Start the smart identification device to work. The object recognition trigger is an electronic component. The camera module 60 is electrically connected to the circuit board 30. Specifically, the camera module 60 includes a camera 61, a soft board 62, and a connector 63. The camera 61 is mounted on the housing 20, and the connector 63 is electrically connected to the circuit board 30.
手柄10内设置有电池80,电池为电路板30提供电源。A battery 80 is provided in the handle 10, and the battery provides power to the circuit board 30.
还包括显示组件40和播放器50,显示组件40包括显示面板41、保护泡棉42、显示面板固定件43和用于保护显示面板的前盖板44,前壳体21上设有用于固定所述显示面板41的安装框,显示面板41安装于前壳体21靠近后壳体22的一侧,前盖板44安装于前壳体21远离所述后壳体22的一侧,保护泡棉42粘附于安装框的一周,对显示面板进行保护,显示面板41安装在显示面板固定件43上,然后固定到安装框上,已更好的保护显示面板不受损伤。即本申请的儿童百科识物设备还包括显示面板41、保护泡棉42和显示面板固定件43,壳体上设置有用于固定显示面板41的安装框,保护泡棉粘附于安装框的一周,对显示面板进行保护,显示面板41安装在显示面板固定件上,然后固定到安装框上,已更好的保护显示面板不受损伤,显示面板可以只具有显示功能,也可以集成显示和触控功能。显示面板可以是液晶显示面板(Liquid Crystal Display,LCD)或者有机发光二极管显示面板(Organic light-emitting  diode,OLED)。It also includes a display assembly 40 and a player 50. The display assembly 40 includes a display panel 41, a protective foam 42, a display panel fixing member 43, and a front cover 44 for protecting the display panel. The installation frame of the display panel 41, the display panel 41 is installed on the side of the front casing 21 close to the rear casing 22, and the front cover 44 is installed on the side of the front casing 21 away from the rear casing 22 to protect the foam 42 is adhered to a circumference of the mounting frame to protect the display panel. The display panel 41 is mounted on the display panel fixing member 43 and then fixed to the mounting frame to better protect the display panel from damage. That is, the children’s encyclopedia identification device of the present application further includes a display panel 41, a protective foam 42, and a display panel fixing member 43. A mounting frame for fixing the display panel 41 is provided on the housing, and the protective foam adheres to the circumference of the mounting frame. , To protect the display panel, the display panel 41 is installed on the display panel fixing member, and then fixed to the mounting frame, which better protects the display panel from damage. The display panel can only have display functions, or it can integrate display and touch Control function. The display panel may be a liquid crystal display panel (Liquid Crystal Display, LCD) or an organic light-emitting diode display panel (Organic light-emitting diode, OLED).
壳体20包括前盖24、后盖23、前壳体21和后壳体22,手柄10包括手柄前壳12、手柄后壳13和硅胶后护手14,前壳体21与手柄前壳12一体设置形成前壳,后壳体22与手柄后壳13一体设置形成后壳,前盖24、后盖23、前壳和后壳依次连接,前盖24与后盖23能够一体成型,也可以通过组装形式一体式结构,前壳与后壳通过螺丝固定连接。前壳与后壳共同围合形成一个容纳腔,电路板30、电池80、显示面板41均位于该容纳腔中,电池80与电路板电连接,为电路板30提供电源。The housing 20 includes a front cover 24, a rear cover 23, a front housing 21 and a rear housing 22. The handle 10 includes a handle front shell 12, a handle rear shell 13 and a silicone rear hand guard 14. The front shell 21 and the handle front shell 12 The front case is integrally formed, and the rear case 22 and the handle rear case 13 are integrally disposed to form a rear case. The front cover 24, the rear cover 23, the front case and the rear case are connected in sequence. The front cover 24 and the rear cover 23 can be integrally formed, or Through the integrated structure of the assembled form, the front shell and the rear shell are fixedly connected by screws. The front shell and the rear shell are jointly enclosed to form an accommodating cavity. The circuit board 30, the battery 80, and the display panel 41 are all located in the accommodating cavity. The battery 80 is electrically connected to the circuit board to provide power for the circuit board 30.
手柄10大致呈圆柱状,方便手握,手柄10还包括硅胶前护手15,硅胶前护手15和硅胶后护手14上均具有防滑结构,本实用新型中防滑结构为防滑条纹;同时,手柄10还包括用于支撑智能识物装置站立的支撑部,手柄10的底部为平面,平面底部形成支撑部The handle 10 is roughly cylindrical, which is convenient to hold. The handle 10 also includes a silicone front hand guard 15. Both the silicone front hand guard 15 and the silicone rear hand guard 14 have anti-slip structures. The anti-slip structure in the present invention is an anti-slip stripe; at the same time, The handle 10 also includes a supporting part for supporting the intelligent object recognition device to stand. The bottom of the handle 10 is a flat surface, and the flat bottom forms the supporting part.
摄像模组60与电路板30电性连接,具体地,摄像模组60还包括软板62和连接器63,连接器63与电路板30电性连接,摄像模组60用于采集图像以形成对应的图像信号,电路板30上设置有处理图像信号的芯片,芯片对图像进行预处理,通过wifi或者4G/5G将图像上传至服务器,服务器识别出图片内容后先以文本形式输出物体名称,然后再将文本内容合成语音内容输出,接下来,对物体名称与服务器的教育资源库匹配最优动画内容,如匹配成功,则输出动画内容,手持电子设备依次播放语音内容和动画内容。The camera module 60 is electrically connected to the circuit board 30. Specifically, the camera module 60 further includes a soft board 62 and a connector 63. The connector 63 is electrically connected to the circuit board 30. The camera module 60 is used to collect images to form Corresponding to the image signal, the circuit board 30 is provided with a chip for processing the image signal. The chip preprocesses the image and uploads the image to the server via WiFi or 4G/5G. The server first outputs the object name in text form after identifying the content of the image. Then, the text content is synthesized into voice content and output. Next, the object name is matched with the optimal animation content of the server's educational resource library. If the matching is successful, the animation content is output, and the handheld electronic device plays the voice content and the animation content in sequence.
进一步参阅图7和图8,图7是本发明具体实施例儿童百科识物设备的前壳结构示意图,图8是本发明具体实施例儿童百科识物设备的摄像模组结构示意图,摄像头61安装在前壳体21远离后壳体22的一侧,前壳体21安装摄像头的一侧面设置两个安装位,两个安装位的中心连线形成直线L1,直线L1沿 壳体20的长度方向延伸,本申请中其中的一个安装位上安装摄像头,具体地,此安装位设置为四周具有侧壁的容纳槽211,摄像头固定于容纳槽的底部,组装时,摄像头61固定在容纳槽211内,软板62穿过开孔212,连接器63与位于容纳腔内的电路板30电连接。Further referring to Figures 7 and 8, Figure 7 is a schematic diagram of the front shell structure of the children's encyclopedia recognition device in a specific embodiment of the present invention, and Figure 8 is a schematic diagram of the camera module structure of the children's encyclopedia recognition device in a specific embodiment of the present invention, and the camera 61 is installed On the side of the front housing 21 away from the rear housing 22, two installation positions are provided on one side of the front housing 21 where the camera is installed. The center connection of the two installation positions forms a straight line L1, which is along the length direction of the housing 20 Extend, the camera is installed on one of the installation positions in the present application. Specifically, the installation position is set as a receiving groove 211 with side walls on all sides. The camera is fixed at the bottom of the receiving groove. During assembly, the camera 61 is fixed in the receiving groove 211 , The soft board 62 passes through the opening 212, and the connector 63 is electrically connected to the circuit board 30 located in the accommodating cavity.
进一步参阅图9和图10,图9是本发明具体实施例儿童百科识物设备的前盖和后盖结构示意图,图10是本发明具体实施例儿童百科识物设备的前盖和后盖另一结构示意图,本申请中后盖23上具有通孔232,光线可透过通孔232到达摄像头61的感光区域,后盖23具有对称设置的两个圆形凸出部231,通孔232设于凸出部231的中心位置,后盖23上具有容纳麦克风的凹槽233,凹槽底部具有外部声音可穿过的孔,相应地,前盖24包括用于使圆形凸出部231穿过的圆形孔241,前盖24上与凹槽233底部孔相对应的位置处也开了小孔,以便麦克风可灵敏接受到外部声音。Further referring to Figures 9 and 10, Figure 9 is a schematic diagram of the front cover and the back cover of the children's encyclopedia recognition device in a specific embodiment of the present invention, and FIG. 10 is another front cover and back cover of the children's encyclopedia recognition device in a specific embodiment of the present invention A schematic structural diagram. In the present application, the back cover 23 has a through hole 232 through which light can reach the photosensitive area of the camera 61. The back cover 23 has two symmetrically arranged circular protrusions 231, and the through hole 232 is provided with At the center of the protrusion 231, the back cover 23 has a groove 233 for accommodating the microphone, and the bottom of the groove has a hole through which external sound can pass. Accordingly, the front cover 24 includes a circular protrusion 231 for passing through For the round hole 241 that has passed, a small hole is also opened on the front cover 24 at a position corresponding to the bottom hole of the groove 233, so that the microphone can sensitively receive external sounds.
该儿童百科识物设备具有方便手握的手柄,摄像头具有多个可选择的安装位置,方便根据不同造型而选择合适的安装位置,儿童大都喜欢动物或卡通人物造型,摄像头在造型上可以作为动物或卡通人物的眼睛,造型新颖,增强了趣味性,手柄方便儿童手持。The children’s encyclopedia knowledge device has a convenient handle, and the camera has multiple optional installation positions, which is convenient to choose the appropriate installation position according to different shapes. Most children like animals or cartoon characters, and the camera can be used as an animal in shape Or the eyes of cartoon characters are novel in shape, which enhances the fun, and the handle is convenient for children to hold.
如图11本发明实施例还提供了一种终端设备,包括存储器、处理器以及存储在所述存储器中并可在所述处理器上运行的计算机程序,所述处理器运行所述计算机程序时实现智能识物方法的步骤。As shown in Figure 11, the embodiment of the present invention also provides a terminal device, including a memory, a processor, and a computer program that is stored in the memory and can run on the processor. When the processor runs the computer program, Steps to realize the method of intelligent object recognition.
图11是本发明一个实施例中提供的终端设备的结构示意图,如所示,该终端设备200包括:处理器220、存储器210以及存储在存储器210中并可在处理器220上运行的计算机程序211,例如:智能识物程序。处理器220执行计算机程序211时实现上述各个智能识物方法实施例中的步骤,或者,处理器 220执行计算机程序211时实现上述各智能识物装置实施例中各模块的功能。FIG. 11 is a schematic structural diagram of a terminal device provided in an embodiment of the present invention. As shown, the terminal device 200 includes: a processor 220, a memory 210, and a computer program stored in the memory 210 and running on the processor 220 211, such as: intelligent object recognition program. When the processor 220 executes the computer program 211, the steps in the foregoing embodiments of the smart object recognition method are implemented, or when the processor 220 executes the computer program 211, the functions of the modules in the foregoing embodiments of the smart object recognition device are implemented.
终端设备200可以为笔记本、掌上电脑、平板型计算机、手机等设备。终端设备200可包括,但不仅限于处理器220、存储器210。本领域技术人员可以理解,图11仅仅是终端设备200的示例,并不构成对终端设备200的限定,可以包括比图示更多或更少的部件,或者组合某些部件,或者不同的部件,例如:终端设备200还可以包括输入输出设备、显示设备、网络接入设备、总线等。The terminal device 200 may be a notebook, a palmtop computer, a tablet computer, a mobile phone, and other devices. The terminal device 200 may include, but is not limited to, a processor 220 and a memory 210. Those skilled in the art can understand that FIG. 11 is only an example of the terminal device 200, and does not constitute a limitation on the terminal device 200. It may include more or less components than those shown in the figure, or a combination of certain components, or different components. For example, the terminal device 200 may also include input and output devices, display devices, network access devices, buses, and so on.
处理器220可以是中央处理单元(Central Processing Unit,CPU),还可以是其他通用处理器、数字信号处理器(Digital Signal Processor,DSP)、专用集成电路(Application Specific Integrated Circuit,ASIC)、现场可编程门阵列(Field-Programmable GateArray,FPGA)或者其他可编程逻辑器件、分立门或者晶体管逻辑器件、分立硬件组件等。通用处理器220可以是微处理器或者该处理器也可以是任何常规的处理器等。The processor 220 may be a central processing unit (Central Processing Unit, CPU), or other general-purpose processors, digital signal processors (Digital Signal Processor, DSP), application specific integrated circuits (Application Specific Integrated Circuit, ASIC), on-site Field-Programmable GateArray (FPGA) or other programmable logic devices, discrete gates or transistor logic devices, discrete hardware components, etc. The general-purpose processor 220 may be a microprocessor or the processor may also be any conventional processor or the like.
存储器210可以是终端设备200的内部存储单元,例如:终端设备200的硬盘或内存。存储器210也可以是终端设备200的外部存储设备,例如:终端设备200上配备的插接式硬盘,智能存储卡(SmartMedia Card,SMC),安全数字(Secure Digital,SD)卡,闪存卡(Flash Card)等。进一步地,存储器210还可以既包括终端设备200的内部存储单元也包括外部存储设备。存储器210用于存储计算机程序211以及终端设备200所需要的其他程序和数据。存储器210还可以用于暂时地存储已经输出或者将要输出的数据。The memory 210 may be an internal storage unit of the terminal device 200, such as a hard disk or memory of the terminal device 200. The memory 210 may also be an external storage device of the terminal device 200, such as a plug-in hard disk equipped on the terminal device 200, a smart memory card (SmartMedia Card, SMC), a Secure Digital (SD) card, and a flash memory card (Flash). Card) and so on. Further, the memory 210 may also include both an internal storage unit of the terminal device 200 and an external storage device. The memory 210 is used to store the computer program 211 and other programs and data required by the terminal device 200. The memory 210 may also be used to temporarily store data that has been output or will be output.
在上述实施例中,对各个实施例的描述都各有侧重,某个实施例中没有详细描述或记载的部分,可以参见其他实施例的相关描述。In the above-mentioned embodiments, the description of each embodiment has its own focus. For parts that are not described or recorded in detail in an embodiment, reference may be made to related descriptions of other embodiments.
本发明提供的智能识物方法及装置、智能识物设备、终端设备及计算机存 储介质,至少能够带来以下有益效果:The intelligent object recognition method and device, intelligent object recognition equipment, terminal equipment and computer storage medium provided by the present invention can at least bring the following beneficial effects:
1、通过摄像头实时获取待识别物体的图像,通过图像处理获得待识别物体的特征从而进行特征识别,获得物体识别结果,并播报,对于儿童来说趣味性十足,知识的渴望可以得到及时满足,实时性高,且识物对象不局限于手头的刊物或卡片,任意物体都在识别范围,提高了知识的广度。1. Obtain the image of the object to be recognized through the camera in real time, and obtain the feature of the object to be recognized through image processing to perform feature recognition, obtain the object recognition result, and broadcast it. It is full of fun for children, and the desire for knowledge can be satisfied in time. The real-time performance is high, and the object of recognizing is not limited to the publication or card at hand, and any object is in the recognition range, which increases the breadth of knowledge.
2.除了播放物品的名称,还提供了学习资源库,针对识别到的物体,还在资源库中检索物品相关的教育资源,例如物品的历史由来、进化过程、成分,作用及分布等等,丰富了知识的同时,还提高了孩童学习的乐趣。2. In addition to the name of the item to be played, a learning resource library is also provided. For the identified object, the educational resources related to the item are also retrieved in the resource library, such as the historical origin, evolution process, composition, function and distribution of the item, etc. While enriching knowledge, it also enhances the fun of children's learning.
3.通过服务器对识物的数据库外接第三方程序接口,进行了识物对象的扩展,将识别对象无限扩大,提高了知识库的容量,更重要的,通过知识库的强大计算能力,可以大大提高识别物体的速度,用户体验度得以大大提高。3. The third-party program interface is connected to the database of the object through the server, and the object of the object is expanded. The object of the object is expanded infinitely, and the capacity of the knowledge base is increased. More importantly, the powerful computing power of the knowledge base can greatly increase the capacity of the knowledge base. Improve the speed of recognizing objects, and the user experience can be greatly improved.
4.通过预先设置的物品特征映射表,可以快速地根据物品的主要特征来定位物品分类,继而根据其他特征来进一步识别物体名称,一方面提高了物品识别的速度,另一方面也提高了物品识别的精度。4. Through the pre-set item feature mapping table, the item classification can be quickly located according to the main features of the item, and then the object name can be further identified according to other features. On the one hand, it improves the speed of item recognition, and on the other hand, it also improves the item Accuracy of recognition.
所属领域的技术人员可以清楚地了解到,为了描述的方便和简洁,仅以上述各程序模块的划分进行举例说明,实际应用中,可以根据需要而将上述功能分配由不同的程序模块完成,即将装置的内部结构划分成不同的程序单元或模块,以完成以上描述的全部或者部分功能。实施例中的各程序模块可以集成在一个处理单元中,也可是各个单元单独物理存在,也可以两个或两个以上单元集成在一个处理单元中,上述集成的单元既可以采用硬件的形式实现,也可以采用软件程序单元的形式实现。另外,各程序模块的具体名称也只是为了便于相互区分,并不用于限制本发明的保护范围。Those skilled in the art can clearly understand that for the convenience and conciseness of the description, only the division of the above-mentioned program modules is used as an example. In practical applications, the above-mentioned functions can be allocated by different program modules as needed, namely The internal structure of the device is divided into different program units or modules to complete all or part of the functions described above. The program modules in the embodiments can be integrated in one processing unit, or each unit can exist alone physically, or two or more units can be integrated in one processing unit. The above-mentioned integrated units can be implemented in the form of hardware. It can also be implemented in the form of a software program unit. In addition, the specific names of the program modules are only for the convenience of distinguishing each other, and are not used to limit the protection scope of the present invention.
本领域普通技术人员可以意识到,结合本发明中所公开的实施例描述的各 示例的单元及算法步骤,能够以电子硬件、或者计算机软件和电子硬件的结合来实现。这些功能究竟以硬件还是软件来执行,取决于技术方案的特定应用和设计约束条件。专业技术人员可以对每个特定的应用来使用不同方法来实现所描述的功能,但是这种实现不应认为超出本发明的范围。A person of ordinary skill in the art may realize that the units and algorithm steps of the examples described in the embodiments disclosed in the present invention can be implemented by electronic hardware or a combination of computer software and electronic hardware. Whether these functions are performed by hardware or software depends on the specific application and design constraint conditions of the technical solution. Professionals and technicians can use different methods for each specific application to implement the described functions, but such implementation should not be considered as going beyond the scope of the present invention.
在本发明所提供的实施例中,应该理解到,所揭露终端设备和方法,可以通过其他的方式实现。例如,以上所描述的终端设备实施例仅仅是示意性的,例如,模块或单元的划分,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式,例如,多个单元或组件可以结合或者可以集成到另一个系统,或一些特征可以忽略,或不执行。另一点,所显示或讨论的相互之间的耦合或直接耦合或通讯连接可以是通过一些接口,装置或单元的间接耦合或通讯连接,可以是电性、机械或其他的形式。In the embodiments provided by the present invention, it should be understood that the disclosed terminal device and method may be implemented in other ways. For example, the terminal device embodiments described above are merely illustrative. For example, the division of modules or units is only a logical function division. In actual implementation, there may be other division methods. For example, multiple units or components may be Combined or can be integrated into another system, or some features can be ignored or not implemented. In addition, the displayed or discussed mutual coupling or direct coupling or communication connection may be indirect coupling or communication connection through some interfaces, devices or units, and may be in electrical, mechanical or other forms.
作为分离部件说明的单元可以是或者也可以不是物理上分开的,作为单元显示的部件可以是或者也可以不是物理单元,即可以位于一个地方,或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部单元来实现本实施例方案的目的。The units described as separate components may or may not be physically separated, and the components displayed as units may or may not be physical units, that is, they may be located in one place, or they may be distributed on multiple network units. Some or all of the units may be selected according to actual needs to achieve the objectives of the solutions of the embodiments.
另外,在本发明各个实施例中的各功能单元可能集成在一个处理单元中,也可以是各个单元单独物理存在,也可以两个或两个以上单元集成在一个单元中。上述集成的单元既可以采用硬件的形式实现,也可以采用软件功能单元的形式实现。In addition, the functional units in the various embodiments of the present invention may be integrated into one processing unit, or each unit may exist alone physically, or two or more units may be integrated into one unit. The above-mentioned integrated unit can be implemented in the form of hardware or software functional unit.
本发明实施例还提供了一种计算机可读存储介质,所述计算机可读存储介质存储有计算机程序,所述计算机程序被处理器执行时实现所述智能识物方法的步骤。The embodiment of the present invention also provides a computer-readable storage medium, the computer-readable storage medium stores a computer program, and when the computer program is executed by a processor, the steps of the intelligent identification method are realized.
集成的模块/单元如果以软件功能单元的形式实现并作为独立的产品销售或使用时,可以存储在一个计算机可读存储介质中。基于这样的理解,本发明实现上述实施例方法中的全部或部分流程,也可以通过计算机程序211发送指令给相关的硬件完成,计算机程序211可存储于一计算机可读存储介质中,该 计算机程序211在被处理器220执行时,可实现上述各个方法实施例的步骤。其中,计算机程序211包括:计算机程序代码,计算机程序代码可以为源代码形式、对象代码形式、可执行文件或某些中间形式等。计算机可读存储介质可以包括:能够携带计算机程序211代码的任何实体或装置、记录介质、U盘、移动硬盘、磁碟、光盘、计算机存储器、只读存储器(ROM,Read-Only Memory)、随机存取存储器(RAM,Random Access Memory)、电载波信号、电信信号以及软件分发介质等。需要说明的是,计算机可读存储介质包含的内容可以根据司法管辖区内立法和专利实践的要求进行适当的增减,例如:在某些司法管辖区,根据立法和专利实践,计算机可读介质不包括电载波信号和电信信号。If the integrated module/unit is implemented in the form of a software functional unit and sold or used as an independent product, it can be stored in a computer-readable storage medium. Based on this understanding, the present invention implements all or part of the processes in the above-mentioned embodiment methods, and can also be completed by sending instructions to the relevant hardware through the computer program 211. The computer program 211 can be stored in a computer-readable storage medium. When executed by the processor 220, 211 may implement the steps of the foregoing method embodiments. Wherein, the computer program 211 includes: computer program code, and the computer program code may be in the form of source code, object code, executable file, or some intermediate forms. The computer-readable storage medium may include: any entity or device capable of carrying the computer program 211 code, recording medium, U disk, mobile hard disk, magnetic disk, optical disk, computer memory, read-only memory (ROM, Read-Only Memory), random Access memory (RAM, Random Access Memory), electric carrier signal, telecommunications signal, software distribution medium, etc. It should be noted that the content contained in the computer-readable storage medium can be appropriately added or deleted according to the requirements of the legislation and patent practice in the jurisdiction. For example, in some jurisdictions, according to the legislation and patent practice, the computer-readable medium Does not include electrical carrier signals and telecommunication signals.
应当说明的是,上述实施例均可根据需要自由组合。以上所述仅是本发明的优选实施例,应当指出,对于本技术领域的普通技术人员来说,在不脱离本发明原理的前提下,还可以做出若干改进和润饰,这些改进和润饰也应视为本发明的保护范围。It should be noted that the above embodiments can be freely combined as required. The above are only preferred embodiments of the present invention. It should be pointed out that for those of ordinary skill in the art, without departing from the principle of the present invention, several improvements and modifications can be made, and these improvements and modifications are also It should be regarded as the protection scope of the present invention.
应当说明的是,上述实施例均可根据需要自由组合。以上所述仅是本发明的优选实施例,应当指出,对于本技术领域的普通技术人员来说,在不脱离本发明原理的前提下,还可以做出若干改进和润饰,这些改进和润饰也应视为本发明的保护范围。It should be noted that the above embodiments can be freely combined as required. The above are only preferred embodiments of the present invention. It should be pointed out that for those of ordinary skill in the art, without departing from the principle of the present invention, several improvements and modifications can be made, and these improvements and modifications are also It should be regarded as the protection scope of the present invention.

Claims (14)

  1. 一种智能识物方法,其特征在于,包括:A method of intelligent object recognition, which is characterized in that it includes:
    获取待识别物体的图像;Obtain an image of the object to be recognized;
    对所述待识别物体的图像进行预处理,获得图像特征数据;Preprocessing the image of the object to be recognized to obtain image feature data;
    根据所述图像特征数据在物品数据库进行识别,获得识别结果并生成物品内容文本,根据所述物品内容文本生成第一音频链接;Perform recognition in an article database according to the image feature data, obtain a recognition result and generate article content text, and generate a first audio link according to the article content text;
    根据所述物品内容文本在资源库中进行匹配,获得与所述物品内容文本相对应的资源数据,并根据所述资源数据生成第二音频链接;Performing matching in the resource library according to the article content text, obtaining resource data corresponding to the article content text, and generating a second audio link according to the resource data;
    选择播放所述第一音频链接和/或第二音频链接。Select to play the first audio link and/or the second audio link.
  2. 如权利要求1所述的智能识物方法,其特征在于,对所述待识别物体的图像进行预处理,获得图像特征数据具体包括:The intelligent object recognition method of claim 1, wherein preprocessing the image of the object to be recognized to obtain image feature data specifically includes:
    对所述待识别物体的图像进行压缩、图像二值化、灰度图处理、SIFT特征提取和交点特征提取,获得图像特征数据。Compression, image binarization, gray-scale image processing, SIFT feature extraction, and intersection feature extraction are performed on the image of the object to be recognized to obtain image feature data.
  3. 如权利要求1所述的智能识物方法,其特征在于,所述根据所述图像特征数据在物品数据库进行识别,获得识别结果并生成物品内容文本,根据所述物品内容文本生成第一音频链接具体包括:The intelligent object recognition method according to claim 1, wherein the recognition is performed in the article database according to the image feature data, the recognition result is obtained and the article content text is generated, and the first audio link is generated according to the article content text Specifically:
    根据所述图像特征数据,提取核心特征点数据对应的物体特征;Extracting the object features corresponding to the core feature point data according to the image feature data;
    在物品数据库中根据所述物体特征查找,获取与所述物体特征对应的物品名称,输出识别结果,生成物品内容文本;Search in the article database according to the object feature, obtain the article name corresponding to the object feature, output the recognition result, and generate the article content text;
    将所述物品内容文本合成为第一音频链接。Synthesize the content text of the article into a first audio link.
  4. 如权利要求3所述的智能识物方法,其特征在于,所述根据所述物品内容文本在资源库中进行匹配,获得与所述识别文本相对应的资源数据,并根据所述资源数据生成第二音频链接具体包括:The intelligent object recognition method according to claim 3, wherein the article content text is matched in a resource library to obtain resource data corresponding to the recognized text, and generate according to the resource data The second audio link specifically includes:
    根据所述所述物品内容文本生成检索式,按照所述检索式在资源库中进行检索,如检索到所述检索式完全匹配的资源数据,所述资源数据包括与所 述物品内容文本对应的视频、动画或者音频数据,则选择所述资源数据中一种或多种,生成第二音频链接。A search formula is generated according to the article content text, and a search is performed in the resource library according to the search formula. For example, resource data that exactly matches the search formula is retrieved, and the resource data includes the text corresponding to the article content For video, animation or audio data, one or more of the resource data is selected to generate a second audio link.
  5. 如权利要求3所述的智能识物方法,其特征在于,所述在物品数据库中根据所述物体特征查找,获取与所述物体特征对应的物品名称具体包括:The intelligent object recognition method according to claim 3, wherein the searching in an object database according to the object feature and obtaining the object name corresponding to the object feature specifically includes:
    所述物品数据库设置于远程服务器中,所述物品数据库包括一特征物品映射表,根据所述物体特征获取对应的物品名称;The article database is set in a remote server, the article database includes a characteristic article mapping table, and the corresponding article name is obtained according to the characteristics of the article;
    其中,所述物品数据库的数据接口与第三方知识库连接,所述数据接口用来实时更新所述特征物品映射表。Wherein, the data interface of the article database is connected with a third-party knowledge base, and the data interface is used to update the characteristic article mapping table in real time.
  6. 一种智能识物装置,其特征在于,所述智能识物装置包括:An intelligent object-recognizing device, characterized in that, the intelligent object-recognizing device includes:
    图像获取单元,用于获取待识别物体的图像;The image acquisition unit is used to acquire an image of the object to be identified;
    图像预处理单元,用于对所述待识别物体的图像进行预处理,获得图像特征数据;An image preprocessing unit, configured to preprocess the image of the object to be recognized to obtain image feature data;
    识别单元,用于发送所述图像特征数据到物品数据库进行识别,获得识别结果并生成物品内容文本,根据所述物品内容文本生成第一音频链接,以及根据所述物品内容文本在资源库中进行匹配,获得与所述识别文本相对应的资源数据,并根据所述资源数据生成第二音频链接;The recognition unit is used to send the image feature data to the item database for recognition, obtain the recognition result and generate the item content text, generate the first audio link according to the item content text, and perform the process in the resource database according to the item content text Match, obtain resource data corresponding to the recognized text, and generate a second audio link according to the resource data;
    播放单元,用于接收并选择播放所述第一音频链接和/或第二音频链接。The playing unit is configured to receive and select to play the first audio link and/or the second audio link.
  7. 如权利要求6所述的智能识物装置,其特征在于,所述识别单元具体包括:The intelligent object recognition device according to claim 6, wherein the recognition unit specifically comprises:
    发送模块,用于发送所述图像特征数据到物品数据库中,其中,所述物品数据库设置于远程服务器中,所述服务器根据所述图像特征数据,提取核心特征点数据对应的物体特征,并在物品数据库中根据所述物体特征查找,获取与所述物体特征对应的物品名称,输出识别结果,生成物品内容文本,将所述物品内容文本合成为第一音频链接。The sending module is used to send the image feature data to the item database, wherein the item database is set in a remote server, and the server extracts the object feature corresponding to the core feature point data according to the image feature data, and then According to the object feature search in the article database, the article name corresponding to the object feature is obtained, the recognition result is output, the article content text is generated, and the article content text is synthesized into the first audio link.
  8. 如权利要求6所述的智能识物装置,其特征在于,所述识别单元具体包括:The intelligent object recognition device according to claim 6, wherein the recognition unit specifically comprises:
    发送模块,用于发送所述图像特征数据到物品数据库中;A sending module for sending the image feature data to the article database;
    第一音频生成模块,用于根据所述图像特征在物品数据库中查找并获得识别结果,生成物品内容文本,根据所述物品内容文本生成第一音频链接;The first audio generation module is configured to search and obtain recognition results in the article database according to the image characteristics, generate article content text, and generate a first audio link based on the article content text;
    第二音频生成模块,用于根据所述物品内容文本在资源库中进行匹配,获得与所述识别文本相对应的资源数据,并根据所述资源数据生成第二音频链接。The second audio generating module is configured to perform matching in the resource library according to the article content text, obtain resource data corresponding to the recognized text, and generate a second audio link according to the resource data.
  9. 一种基于如权利要求1-5任意一项智能识物方法的智能识物设备,其特征在于,所述智能识物设备包括:An intelligent object recognition device based on any one of claims 1-5, wherein the intelligent object recognition device comprises:
    包括壳体、手柄和摄像模组,所述摄像模组包括摄像头,所述摄像头用来获取所述待识别物体的图像,所述壳体上具有多个用于安装所述摄像头的安装位,至少一个所述安装位上安装有摄像头,任意两个所述安装位所在直线与所述手柄所在直线相交,若所述安装位有2个则对称地设置在所述儿童百科识物设备的左右两侧,Comprising a housing, a handle and a camera module, the camera module includes a camera, the camera is used to obtain the image of the object to be identified, the housing has a plurality of installation positions for installing the camera, A camera is installed on at least one of the installation positions, and any two straight lines where the installation positions are located intersect the straight lines where the handles are located, and if there are two installation positions, they are symmetrically arranged on the left and right sides of the children's encyclopedia knowledge device On both sides,
    其中,壳体包括前盖、后盖和前壳体、后壳体,所述安装位位于所述前壳体上,所述后盖上具有通孔,光线可穿过所述通孔到达所述摄像头的感光区域;所述后盖具有对称设置的两个圆形凸出部,所述通孔设于所述凸出部的中心位置。Wherein, the housing includes a front cover, a rear cover, a front housing, and a rear housing. The installation location is located on the front housing, and the rear cover has a through hole through which light can pass through the through hole to reach the The photosensitive area of the camera; the back cover has two symmetrically arranged circular protrusions, and the through hole is provided at the center of the protrusions.
  10. 如权利要求9所述的智能识物设备,其特征在于,所述智能识物设备还包括:The intelligent object recognition device of claim 9, wherein the intelligent object recognition device further comprises:
    电路板,所述电路板包括容纳于所述壳体内的主体部和自所述主体部延伸至所述手柄内部的连接部,所述连接部上设置有识物触发件,所述手柄包括与所述识物触发件相配合的按键,按压所述按键可带动所述识物触发件动作,启动识物方法。A circuit board, the circuit board includes a main body portion accommodated in the housing and a connecting portion extending from the main body portion to the inside of the handle, the connecting portion is provided with an identification trigger, and the handle includes and The button matched with the object-recognizing trigger can be pressed to drive the object-recognizing trigger to act, and the object-recognizing method can be started.
  11. 如权利要求10所述的智能识物设备,其特征在于,所述按键部分穿 过所述通孔,与所述识物触发件可脱离地连接。The intelligent object recognition device of claim 10, wherein the button part passes through the through hole and is detachably connected to the object recognition trigger.
  12. 如权利要求11所述的智能识物设备,其特征在于,所述智能识物设备还包括显示组件,所述显示组件包括显示面板,所述前壳体上设有用于固定所述显示面板的安装框,所述显示面板安装于所述前壳体靠近所述后壳体的一侧,所述显示组件用于播放所述第一音频或第二音频。The intelligent object recognition device of claim 11, wherein the intelligent object recognition device further comprises a display component, the display component comprises a display panel, and the front housing is provided with a fixing device for fixing the display panel. A mounting frame, the display panel is mounted on a side of the front housing close to the rear housing, and the display assembly is used for playing the first audio or the second audio.
  13. 一种终端设备,包括存储器、处理器以及存储在所述存储器中并可在所述处理器上运行的计算机程序,其特征在于,所述处理器运行所述计算机程序时实现如权利要求1-5中任一项所述智能识物方法的步骤。A terminal device, comprising a memory, a processor, and a computer program stored in the memory and running on the processor, wherein the processor executes the computer program as claimed in claim 1- Steps of any one of the intelligent object recognition methods described in 5.
  14. 一种计算机可读存储介质,所述计算机可读存储介质存储有计算机程序,其特征在于,所述计算机程序被处理器执行时实现如权利要求1-5中任一项所述智能识物方法的步骤。A computer-readable storage medium, the computer-readable storage medium stores a computer program, wherein the computer program is executed by a processor to implement the intelligent identification method according to any one of claims 1-5 A step of.
PCT/CN2020/138696 2019-11-06 2020-12-23 Method and apparatus for smart object recognition, object recognition device, terminal device, and storage medium WO2021089059A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201911075917.7 2019-11-06
CN201911075917.7A CN110781861A (en) 2019-11-06 2019-11-06 Electronic equipment and method for universal object recognition

Publications (1)

Publication Number Publication Date
WO2021089059A1 true WO2021089059A1 (en) 2021-05-14

Family

ID=69389468

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2020/138696 WO2021089059A1 (en) 2019-11-06 2020-12-23 Method and apparatus for smart object recognition, object recognition device, terminal device, and storage medium

Country Status (2)

Country Link
CN (1) CN110781861A (en)
WO (1) WO2021089059A1 (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110781861A (en) * 2019-11-06 2020-02-11 上海谛闲工业设计有限公司 Electronic equipment and method for universal object recognition
CN112699743A (en) * 2020-12-15 2021-04-23 黄冈格罗夫氢能汽车有限公司 System for identifying hydrogen energy fuel cell vehicle indicator lamp and button switch by one key

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7563099B1 (en) * 1999-05-21 2009-07-21 Elizabeth Iftikhar Multi-media method and apparatus for teaching language
CN103646571A (en) * 2013-12-11 2014-03-19 步步高教育电子有限公司 Object information identification displaying method and device
CN106097793A (en) * 2016-07-21 2016-11-09 北京光年无限科技有限公司 A kind of child teaching method and apparatus towards intelligent robot
CN108986566A (en) * 2018-08-06 2018-12-11 南京南奕亭文化传媒有限公司 A kind of intelligent infant educational system and its operating method
CN109559576A (en) * 2018-11-16 2019-04-02 中南大学 A kind of children companion robot and its early teaching system self-learning method
CN109766914A (en) * 2018-12-14 2019-05-17 深圳壹账通智能科技有限公司 Item identification method, device, equipment and storage medium based on image recognition
CN110781861A (en) * 2019-11-06 2020-02-11 上海谛闲工业设计有限公司 Electronic equipment and method for universal object recognition

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7563099B1 (en) * 1999-05-21 2009-07-21 Elizabeth Iftikhar Multi-media method and apparatus for teaching language
CN103646571A (en) * 2013-12-11 2014-03-19 步步高教育电子有限公司 Object information identification displaying method and device
CN106097793A (en) * 2016-07-21 2016-11-09 北京光年无限科技有限公司 A kind of child teaching method and apparatus towards intelligent robot
CN108986566A (en) * 2018-08-06 2018-12-11 南京南奕亭文化传媒有限公司 A kind of intelligent infant educational system and its operating method
CN109559576A (en) * 2018-11-16 2019-04-02 中南大学 A kind of children companion robot and its early teaching system self-learning method
CN109766914A (en) * 2018-12-14 2019-05-17 深圳壹账通智能科技有限公司 Item identification method, device, equipment and storage medium based on image recognition
CN110781861A (en) * 2019-11-06 2020-02-11 上海谛闲工业设计有限公司 Electronic equipment and method for universal object recognition

Also Published As

Publication number Publication date
CN110781861A (en) 2020-02-11

Similar Documents

Publication Publication Date Title
CN111652678B (en) Method, device, terminal, server and readable storage medium for displaying article information
WO2021089059A1 (en) Method and apparatus for smart object recognition, object recognition device, terminal device, and storage medium
CN107798932A (en) A kind of early education training system based on AR technologies
US20230274471A1 (en) Virtual object display method, storage medium and electronic device
CN109918669A (en) Entity determines method, apparatus and storage medium
CN110322760B (en) Voice data generation method, device, terminal and storage medium
CN109040297A (en) User's portrait generation method and device
CN111524501A (en) Voice playing method and device, computer equipment and computer readable storage medium
CN108334498A (en) Method and apparatus for handling voice request
CN110807325A (en) Predicate identification method and device and storage medium
CN113297843B (en) Reference resolution method and device and electronic equipment
CN111339938A (en) Information interaction method, device, equipment and storage medium
WO2021232875A1 (en) Method and apparatus for driving digital person, and electronic device
CN112115282A (en) Question answering method, device, equipment and storage medium based on search
CN109819167A (en) A kind of image processing method, device and mobile terminal
WO2022193911A1 (en) Instruction information acquisition method and apparatus, readable storage medium, and electronic device
CN113593608A (en) Object recognition-based voice beautifying method, electronic device and storage medium
CN111835621A (en) Session message processing method and device, computer equipment and readable storage medium
CN111428079A (en) Text content processing method and device, computer equipment and storage medium
CN109189978B (en) Method, device and storage medium for audio search based on voice message
CN112887654B (en) Conference equipment, conference system and data processing method
CN112119372B (en) Electronic apparatus and control method thereof
CN113593521B (en) Speech synthesis method, device, equipment and readable storage medium
CN113709364B (en) Camera identifying equipment and object identifying method
CN115206305A (en) Semantic text generation method and device, electronic equipment and storage medium

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 20884347

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 20884347

Country of ref document: EP

Kind code of ref document: A1