CN110659676A

CN110659676A - Information processing method, device and storage medium

Info

Publication number: CN110659676A
Application number: CN201910848845.9A
Authority: CN
Inventors: 王栀子
Original assignee: Zhuo Erzhi Lian Wuhan Research Institute Co Ltd
Current assignee: Zhuo Erzhi Lian Wuhan Research Institute Co Ltd
Priority date: 2019-09-09
Filing date: 2019-09-09
Publication date: 2020-01-07

Abstract

The invention discloses an information processing method, which comprises the following steps: acquiring sound data aiming at a target object, and determining a first state of the target object according to the sound data; acquiring image data of the target object, and determining a second state of the target object according to the image data; and determining a state result of the target object according to the first state and the second state. The invention also discloses an information processing device and a storage medium.

Description

Information processing method, device and storage medium

Technical Field

The present invention relates to prediction technologies, and in particular, to an information processing method, apparatus, and computer-readable storage medium.

Background

In daily life, the user generally checks the external form by experience according to the judging method for judging whether the melon and fruit are mature. Taking watermelon as an example, the mature watermelon juice is delicious and sweet and is popular with the majority of the national people, and whether the watermelon is mature or not is judged by the traditional method mainly depending on the color of the watermelon skin, the color of the watermelon flesh, the density of the watermelon and the reverberation of the watermelon during knocking. However, because the varieties of the watermelons are different and the experiences of each person are different, the user still easily makes mistakes when the watermelon ripeness degree is judged by adopting the mode, and the accuracy is low.

Disclosure of Invention

In view of the above, the present invention provides an information processing method, an information processing apparatus, and a computer-readable storage medium.

In order to achieve the purpose, the technical scheme of the invention is realized as follows:

the embodiment of the invention provides an information processing method, which comprises the following steps:

acquiring sound data aiming at a target object, and determining a first state of the target object according to the sound data;

acquiring image data of the target object, and determining a second state of the target object according to the image data;

and determining a state result of the target object according to the first state and the second state.

In the foregoing solution, the sound data for the target object includes: sound data generated by vibration of the target object;

the determining a first state of the target object from the acoustic data includes:

converting sound data generated by the vibration of the target object into an electric signal;

carrying out digital processing on the electric signal to obtain a peak frequency interval of the electric signal;

and inquiring a preset corresponding relation between the frequency and the state according to the peak frequency interval, and determining a first state corresponding to the peak frequency interval.

In the foregoing solution, the image data of the target object includes: appearance image data of the target object;

the determining a second state of the target object from the image data includes:

and identifying appearance image data of the target object by using a preset image identification model, and determining a second state of the target object.

In the above scheme, the method further comprises: generating the image recognition model; the generating of the image recognition model comprises:

acquiring a training image data set; the training image dataset comprising: at least one training image data and label data corresponding to each training image data;

fine-tuning a preset neural network by using the training image data set;

extracting image features of the training image data, the image features including at least one of: network characteristics of a neural network, red, green and blue RGB histogram characteristics and local directional texture mode LDTP characteristics;

training a classifier according to the image features and the label data corresponding to the image features to obtain a trained classifier; and taking the trimmed neural network and the trained classifier as the image recognition model.

In the foregoing solution, the determining the state result of the target object according to the first state and the second state includes any one of:

selecting a target state from the first state and the second state as the state result according to a preset rule;

determining weight values corresponding to the first state and the second state respectively; and performing weighting processing according to the first state, the second state, the weight value corresponding to the first state and the weight value corresponding to the second state, and taking a weighting processing result as the state result.

In the above scheme, the method further comprises:

acquiring state reference parameters for the target object; the state reference parameters comprise: hardness data;

inquiring the corresponding relation between the preset state reference parameter and the state result according to the state reference parameter, and determining a third state;

correspondingly, the determining the state result of the target object according to the first state and the second state includes:

and determining a state result of the target object according to the first state, the second state and the third state.

In the foregoing solution, the selecting a target state from the first state and the second state according to a preset rule, as the state result, includes:

comparing the first state with the second state to determine the difference between the two states;

judging whether the difference value of the two states exceeds a difference value threshold value or not, and selecting any one state from the first state and the second state as the state result if the difference value of the two states does not exceed the difference value threshold value;

and when the difference value of the two states exceeds a difference threshold value, determining the corresponding credibility of the first state and the second state respectively, and selecting a target state from the first state and the second state as the state result according to the corresponding credibility of the first state and the second state respectively.

An embodiment of the present invention provides an information processing apparatus, including: the system comprises a first processing module, a second processing module and a third processing module; wherein,

the first processing module is used for acquiring sound data of a target object and determining a first state of the target object according to the sound data;

the second processing module is used for acquiring image data of the target object and determining a second state of the target object according to the image data;

and the third processing module is used for determining a state result of the target object according to the first state and the second state.

An embodiment of the present invention provides an information processing apparatus, including: a processor and a memory for storing a computer program capable of running on the processor; wherein,

the processor is configured to execute the steps of any one of the information processing methods when the computer program is executed.

An embodiment of the present invention further provides a computer-readable storage medium, on which a computer program is stored, where the computer program, when executed by a processor, implements the steps of any one of the information processing methods described above.

The information processing method, the information processing device and the computer readable storage medium provided by the embodiment of the invention are used for acquiring sound data aiming at a target object and determining a first state of the target object according to the sound data; acquiring image data of the target object, and determining a second state of the target object according to the image data; and determining a state result of the target object according to the first state and the second state. In the embodiment of the invention, the state (such as judging the maturity degree of watermelon) is judged according to the sound data and the image data of the target object, and the state identification accuracy can be improved by combining the two modes.

Drawings

Fig. 1 is a schematic flowchart of an information processing method according to an embodiment of the present invention;

FIG. 2 is a flow chart illustrating another information processing method according to an embodiment of the present invention;

fig. 3 is a schematic structural diagram of an identification apparatus according to an embodiment of the present invention;

fig. 4 is a schematic structural diagram of an information processing apparatus according to an embodiment of the present invention;

fig. 5 is a schematic structural diagram of another information processing apparatus according to an embodiment of the present invention.

Detailed Description

In various embodiments of the present invention, acoustic data for a target object is acquired, and a first state of the target object is determined from the acoustic data; acquiring image data of the target object, and determining a second state of the target object according to the image data; and determining a state result of the target object according to the first state and the second state.

The present invention will be described in further detail with reference to examples.

Fig. 1 is a schematic flowchart of an information processing method according to an embodiment of the present invention; the method can be applied to intelligent electronic equipment; as shown in fig. 1, the method includes:

step 101, sound data of a target object is acquired, and a first state of the target object is determined according to the sound data.

In this embodiment, the sound data for the target object includes: sound data generated by vibration of the target object;

Further, the sound data generated by the vibration of the target object may specifically be: and after the preset position of the target object is knocked by preset force, the preset position vibrates to generate sound data.

Here, in order to improve the recognition accuracy, it may be to acquire sound data generated by tapping a specific portion (i.e., the predetermined position) of the target object with the same force (i.e., the predetermined force) to determine the first state of the target object, considering that the sounds may be different when tapping with different forces and/or tapping with different positions.

Here, the preset correspondence relationship between the frequency and the state may be obtained in advance through a large number of experiments. Specifically, for a plurality of sample objects (specifically, a plurality of sample objects of the same kind), knocking a predetermined position of the sample object according to a preset force, collecting sample sound data generated by vibration of the knocked sample object, converting the sample sound data into a sample electric signal, and performing digital processing on the sample electric signal to obtain a sample peak frequency interval of the sample electric signal; the state of the sample object is determined. And analyzing the states of the plurality of sample objects and the sample peak frequency interval corresponding to each sample object to obtain the preset corresponding relation between the frequency and the states.

It should be noted that, the state of the sample object may be examined by a developer according to a certain rule, and the state of the sample object is determined. Here, the state may be of various types. For example: the state characterizing maturity may then comprise: immature, relatively mature, etc. The different recording can be performed by numbers for various states, such as immature recording by number 1, mature recording by number 2, mature recording by number 3, etc.

In this embodiment, the intelligent electronic device may include or be connected to an assembly (such as a manipulator, and the manipulator may set a force standard) capable of knocking the target object with a certain force, so as to generate sound data; the intelligent electronic equipment further comprises or is connected with a module for collecting the sound data.

And 102, acquiring image data of the target object, and determining a second state of the target object according to the image data.

In this embodiment, the image data of the target object includes: appearance image data of the target object. The intelligent electronic device may include or be connected to a module for collecting the image data, such as a camera or other module having a shooting function.

In this embodiment, the method further includes: generating the image recognition model; the generating of the image recognition model comprises:

fine-tuning a preset neural network by using the training image data set;

extracting image features of the training image data, the image features including at least one of: network characteristics of a neural network, red-green-blue (RGB) histogram characteristics, Local Directional Texture Pattern (LDTP) characteristics;

Here, the neural network may employ an Alexnet neural network. The network characteristics of the neural network specifically include an output result output by the neural network according to the training image data. The tag data correspond to the state, and if the state represents the maturity, the tag data is the maturity; if the state represents quality, the tag data is quality, which is not described herein.

And 103, determining a state result of the target object according to the first state and the second state.

In this embodiment, the determining the state result of the target object according to the first state and the second state includes any one of:

Specifically, the selecting a target state from the first state and the second state according to a preset rule, as the state result, includes:

The above-mentioned difference threshold value may be preset and saved by a developer.

In this embodiment, the method further includes:

Here, the hardness data may refer to a pressure value that the skin of the target object can bear. In particular, the intelligent electronic device may comprise or be connected to a pressure monitor.

In this embodiment, the determining the state result of the target object according to the first state, the second state and the third state includes any one of:

selecting a target state from the first state, the second state and the third state as a state result according to a preset rule;

determining weight values corresponding to the first state, the second state and the third state respectively; and performing weighting processing according to the first state, the second state, the third state, the weight value corresponding to the first state, the weight value corresponding to the second state and the weight value corresponding to the third state, and taking a weighting processing result as the state result.

It should be noted that the first state, the second state, and the third state may respectively correspond to different values, that is, different states are represented by different values. Therefore, when the weighting processing is carried out, the weighting processing is carried out by using the numerical value of each state and the weight value corresponding to each state, and the weighting processing result is obtained.

In this embodiment, the selecting a target state from the first state, the second state, and the third state according to a preset rule, as the state result, includes:

comparing the first state, the second state and the third state, and determining the difference value of any two states;

judging whether the difference value of any two states exceeds a difference value threshold value, if the difference value of any two states does not exceed the difference value threshold value, selecting any one state from the first state, the second state and the third state as a target state, and taking the target state as the state result;

and when the situation that the difference value of any two states exceeds a difference value threshold value is determined, determining the corresponding credibility of the first state, the second state and the third state respectively, and selecting a target state as the state result according to the corresponding credibility of the first state, the second state and the third state respectively.

It should be noted that in this embodiment, different states are represented by different values, so that a difference between any two states can be compared, and a difference threshold is compared according to the difference, and the target state is determined according to the comparison result by using the above method, and is used as a state result.

And when the difference value of any two states does not exceed the difference value threshold value and the characterization error range is small, selecting any one state as a target state. When the difference value of any two states exceeds the difference value threshold value, a large error may exist in the representation, a concept of reliability can be introduced at the moment, and the state corresponding to the highest reliability is selected as the target state. Here, the reliability of each state may be preset and stored, and the reliability may be set by a large number of experiments in advance, that is, for the same sample object, the first state, the second state, and the third state of the sample object are compared with the tag (that is, the set state result) by using three methods, that is, the first state of the sample object is determined from the sound data, the second state of the sample object is determined from the image data, and the third state of the sample object is determined from the state reference parameter, and the error rate is increased from low to high, and the reliability is increased from high to low.

For example, the first state of the sample object is determined according to the sound data, the second state of the sample object is determined according to the image data, and the third state of the sample object is determined according to the state reference parameter, and the error rates of the three methods are sequentially from small to large, so that the credibility of the three methods is from high to low, that is, the credibility of the first state, the second state and the third state is from high to low.

In this embodiment, different types of target objects correspond to different types of state results, that is, the determined state results are different for different types of target objects; for example, for melons and fruits, the status results may characterize the degree of maturity; for solid wood furniture, the status result may be indicative of the Quality (Quality) of the furniture (or of the wood of the furniture).

The method provided by the embodiment can be used for judging any object of which the state needs to refer to parameters such as sound, appearance image, appearance hardness and the like aiming at the target object; for example: can be watermelon, Hami melon and other melons.

The following provides an example of detecting the maturity of watermelon to explain the above method, and the information processing method provided by the present embodiment can be used to detect different types of watermelon, and can improve the accuracy of detection relative to the judgment result of the personal experience of the user.

FIG. 2 is a flow chart illustrating another information processing method according to an embodiment of the present invention; as shown in FIG. 2, the method is applied to the identification of the watermelon maturity, and is combined with the method shown in FIG. 1, and the method comprises the following steps:

step 201, sound data and image data for watermelon are acquired.

Here, the sound data for watermelon can be obtained by tapping the watermelon peel to generate a sound and collecting the sound.

And obtaining the image data aiming at the watermelon by taking a picture of the watermelon. Here, the picture may specifically be a color picture.

Step 202, converting the sound data into an electric signal, analyzing the electric signal, and determining the maturity degree of the watermelon as a first identification result.

In particular, the method may be applied to an identification device, which may adopt the structure shown in fig. 3, and may include: the device comprises a sound acquisition module, a sound processing module, an image acquisition module and a result display module. After the watermelon is knocked, the watermelon shakes to generate sound, the sound collection module collects the sound and converts the sound into an electric signal, and the sound processing module analyzes and processes the electric signal to obtain a conclusion and transmits the conclusion to the result display module to display a detection result; the image acquisition module may acquire image data for a watermelon.

Specifically, the sound processing module performs digital processing on the electric signal to obtain a peak frequency interval of the electric signal; and acquiring data partition areas obtained in advance according to a large number of experiments, and inquiring the data partition areas according to the obtained peak frequency area to obtain the corresponding maturity of the watermelon.

And step 203, identifying the image data, and determining the maturity of the watermelon as a second identification result.

Here, step 203 specifically includes: and (3) extracting the features of the image data of the watermelon, and determining the maturity of the watermelon according to the extracted features. Here, the method may specifically include: classifying the watermelon according to the extracted features, finding out the variety of the watermelon, and determining the maturity of the watermelon by using the trained watermelon classifier of the variety.

Specifically, the method further comprises: generating an image recognition model; the image recognition model is used for recognizing the image data. Specifically, the method for generating the image recognition model in the method shown in fig. 1 may be used.

For the image recognition model for the watermelon maturity in this embodiment, the generating the image recognition model includes:

step 031: preprocessing the acquired watermelon image, inputting the watermelon image with a label into an Alexnet network, finely adjusting the Alexnet network by adopting the label to obtain the Alexnet network for extracting characteristics, and taking an output result of the Alexnet network as the extracted network characteristics of the Alexnet network;

step 032: extracting image direction and texture information features by adopting LDTP, and combining the network features, RGB histogram features and LDTP features of the Alexnet network in the step 031 to obtain image features;

step 033: and (3) repeating the watermelon image acquired in the step 001, labeling watermelons with different maturity degrees with different digital labels to form training set data, processing the training set data to obtain a training set characteristic set, and training an SVM classifier by using the set data to obtain an SVM classification model.

And obtaining an image recognition model capable of recognizing the watermelon maturity according to the Alexnet network and the SVM classifier.

The watermelon image can be collected through an image collecting module, and the image recognition model is applied to the image processing module.

In the process of generating the image recognition model, the watermelon ripeness degree recognition model can be obtained by respectively training different types of watermelons

And step 204, determining the maturity degree of the watermelon according to the first identification result and the second identification result.

Here, the ripeness degree of the watermelon can be displayed to the user through the result display module for the user to refer to.

Specifically, in the step 204, the maturity degree of watermelon is determined according to the first recognition result and the second recognition result, which may adopt the same method as that of determining the state result of the target object according to the first state and the second state in the method shown in fig. 1, that is, the first recognition result may be used as the first state, the second recognition result may be used as the second state, and the obtained maturity degree of watermelon is the state result of the target object; and will not be described in detail herein.

In this embodiment, in addition to identifying the ripeness of watermelon by sound data and image data, auxiliary detection may be performed by detecting the hardness of watermelon peel.

Specifically, the method may further include: and reading the hardness data of the watermelon epidermis, and comparing the hardness data with the corresponding relation table of the hardness and the maturity degree so as to determine the maturity degree of the watermelon as a third identification result.

When the third identification result is obtained, the step 204 of determining the maturity degree of watermelon may refer to the first identification result, the second identification result and the third identification result. The specific reference method may adopt the same method as that in the method shown in fig. 1, in which the state result of the target object is determined according to the first state, the second state, and the third state, that is, the first recognition result may be used as the first state, the second recognition result may be used as the second state, the third recognition result may be used as the third state, and the obtained ripeness degree of the watermelon is the state result of the target object, which is not described herein again.

According to the scheme of the embodiment, the maturity of the watermelon is detected based on the image and the data in a bimodal manner, the image of the watermelon and the sound generated by beating the watermelon can be fused to judge whether the watermelon is mature, the two modes are combined to detect the maturity of the watermelon, and the identification accuracy of the maturity of the watermelon is further improved.

Fig. 4 is a schematic structural diagram of an information processing apparatus according to an embodiment of the present invention; the device can be applied to intelligent electronic equipment; as shown in fig. 4, the apparatus includes: the device comprises a first processing module, a second processing module and a third processing module.

Specifically, the sound data for the target object includes: sound data generated by vibration of the target object; the first processing module is used for converting sound data generated by vibration of the target object into an electric signal; carrying out digital processing on the electric signal to obtain a peak frequency interval of the electric signal; and inquiring a preset corresponding relation between the frequency and the state according to the peak frequency interval, and determining a first state corresponding to the peak frequency interval.

Specifically, the image data of the target object includes: appearance image data of the target object;

the second processing module is configured to identify appearance image data of the target object by using a preset image identification model, and determine a second state of the target object.

In particular, the apparatus further comprises a generation module for generating the image recognition model. Specifically, the generating module is specifically configured to acquire a training image dataset; the training image dataset comprising: at least one training image data and label data corresponding to each training image data; fine-tuning a preset neural network by using the training image data set; extracting image features of the training image data, the image features including at least one of: network characteristics, RGB histogram characteristics, LDTP characteristics of the neural network; training a classifier according to the image features and the label data corresponding to the image features to obtain a trained classifier; and taking the trimmed neural network and the trained classifier as the image recognition model.

Specifically, the third processing module is configured to determine a state result of the target object according to the first state and the second state by using any one of the following methods:

In this embodiment, the apparatus further includes: the fourth processing module is used for acquiring state reference parameters aiming at the target object; the state reference parameters comprise: hardness data; inquiring the corresponding relation between the preset state reference parameter and the state result according to the state reference parameter, and determining a third state;

correspondingly, the third processing module is configured to determine a state result of the target object according to the first state, the second state, and the third state.

Specifically, the third processing module is specifically configured to determine a state result of the target object according to the first state, the second state, and the third state by using any one of the following methods:

Specifically, the third processing module is configured to compare the first state and the second state, and determine a difference between the two states; judging whether the difference value of the two states exceeds a difference value threshold value or not, and selecting any one state from the first state and the second state as the state result if the difference value of the two states does not exceed the difference value threshold value; and when the difference value of the two states exceeds a difference threshold value, determining the corresponding credibility of the first state and the second state respectively, and selecting a target state from the first state and the second state as the state result according to the corresponding credibility of the first state and the second state respectively.

Specifically, the third processing module is configured to compare the first state, the second state, and the third state, and determine a difference between any two states; judging whether the difference value of any two states exceeds a difference value threshold value, determining that the difference value of any two states does not exceed the difference value threshold value, and selecting any one state from the first state, the second state and the third state as the state result; and when the situation that the difference value of any two states exceeds a difference value threshold value is determined, determining the corresponding credibility of the first state, the second state and the third state respectively, and selecting a target state as the state result according to the corresponding credibility of the first state, the second state and the third state respectively.

It should be noted that: in the information processing apparatus provided in the above embodiment, when performing information processing, only the division of each program module is exemplified, and in practical applications, the processing may be distributed to different program modules according to needs, that is, the internal structure of the apparatus may be divided into different program modules to complete all or part of the processing described above. In addition, the information processing apparatus and the information processing method provided by the above embodiments belong to the same concept, and specific implementation processes thereof are described in the method embodiments and are not described herein again.

Fig. 5 is a schematic structural diagram of another information processing apparatus according to an embodiment of the present invention. The apparatus 50 comprises: a processor 501 and a memory 502 for storing computer programs executable on the processor; wherein, the processor 501 is configured to execute, when running the computer program, the following steps: acquiring sound data aiming at a target object, and determining a first state of the target object according to the sound data; acquiring image data of the target object, and determining a second state of the target object according to the image data; and determining a state result of the target object according to the first state and the second state. The sound data for the target object includes: sound data generated by vibration of the target object; image data of the target object, comprising: appearance image data of the target object.

In an embodiment, the processor 501 is further configured to execute, when running the computer program, the following: converting sound data generated by the vibration of the target object into an electric signal; carrying out digital processing on the electric signal to obtain a peak frequency interval of the electric signal; and inquiring a preset corresponding relation between the frequency and the state according to the peak frequency interval, and determining a first state corresponding to the peak frequency interval.

In an embodiment, the processor 501 is further configured to execute, when running the computer program, the following: and identifying appearance image data of the target object by using a preset image identification model, and determining a second state of the target object.

In an embodiment, the processor 501 is further configured to execute, when running the computer program, the following: acquiring a training image data set; the training image dataset comprising: at least one training image data and label data corresponding to each training image data; fine-tuning a preset neural network by using the training image data set; extracting image features of the training image data, the image features including at least one of: network characteristics, RGB histogram characteristics, LDTP characteristics of the neural network; training a classifier according to the image features and the label data corresponding to the image features to obtain a trained classifier; and taking the trimmed neural network and the trained classifier as the image recognition model.

In an embodiment, the processor 501 is further configured to execute, when running the computer program, the following: selecting a target state from the first state and the second state as the state result according to a preset rule; or determining the weight values corresponding to the first state and the second state respectively; and performing weighting processing according to the first state, the second state, the weight value corresponding to the first state and the weight value corresponding to the second state, and taking a weighting processing result as the state result.

In an embodiment, the processor 501 is further configured to execute, when running the computer program, the following: acquiring state reference parameters for the target object; the state reference parameters comprise: hardness data; inquiring the corresponding relation between the preset state reference parameter and the state result according to the state reference parameter, and determining a third state; correspondingly, the determining the state result of the target object according to the first state and the second state includes: and determining a state result of the target object according to the first state, the second state and the third state.

In an embodiment, the processor 501 is further configured to execute, when running the computer program, the following: selecting a target state from the first state, the second state and the third state as a state result according to a preset rule; or determining weight values corresponding to the first state, the second state and the third state respectively; and performing weighting processing according to the first state, the second state, the third state, the weight value corresponding to the first state, the weight value corresponding to the second state and the weight value corresponding to the third state, and taking a weighting processing result as the state result.

In an embodiment, the processor 501 is further configured to execute, when running the computer program, the following: comparing the first state with the second state to determine the difference between the two states; judging whether the difference value of the two states exceeds a difference value threshold value or not, and selecting any one state from the first state and the second state as the state result if the difference value of the two states does not exceed the difference value threshold value; and when the difference value of the two states exceeds a difference threshold value, determining the corresponding credibility of the first state and the second state respectively, and selecting a target state from the first state and the second state as the state result according to the corresponding credibility of the first state and the second state respectively.

In an embodiment, the processor 501 is further configured to execute, when running the computer program, the following: comparing the first state, the second state and the third state, and determining the difference value of any two states; judging whether the difference value of any two states exceeds a difference value threshold value, determining that the difference value of any two states does not exceed the difference value threshold value, and selecting any one state from the first state, the second state and the third state as the state result; and when the situation that the difference value of any two states exceeds a difference value threshold value is determined, determining the corresponding credibility of the first state, the second state and the third state respectively, and selecting a target state as the state result according to the corresponding credibility of the first state, the second state and the third state respectively.

It should be noted that: the information processing apparatus and the information processing method provided by the above embodiments belong to the same concept, and specific implementation processes thereof are described in the method embodiments and are not described herein again.

In practical applications, the apparatus 50 may further include: at least one network interface 503. Various components within information handling device 50 are coupled together by bus system 504. It is understood that the bus system 504 is used to enable communications among the components. The bus system 504 includes a power bus, a control bus, and a status signal bus in addition to a data bus. For clarity of illustration, however, the various buses are labeled as bus system 504 in fig. 5. The number of the processors 501 may be at least one. The network interface 503 is used for communication between the information processing apparatus 50 and other devices in a wired or wireless manner.

The memory 502 in the embodiment of the present invention is used to store various types of data to support the operation of the information processing apparatus 50.

The method disclosed by the above-mentioned embodiments of the present invention may be applied to the processor 501, or implemented by the processor 501. The processor 501 may be an integrated circuit chip having signal processing capabilities. In implementation, the steps of the above method may be performed by integrated logic circuits of hardware or instructions in the form of software in the processor 501. The Processor 501 may be a general purpose Processor, a DiGital Signal Processor (DSP), or other programmable logic device, discrete gate or transistor logic device, discrete hardware components, etc. Processor 501 may implement or perform the methods, steps, and logic blocks disclosed in embodiments of the present invention. A general purpose processor may be a microprocessor or any conventional processor or the like. The steps of the method disclosed by the embodiment of the invention can be directly implemented by a hardware decoding processor, or can be implemented by combining hardware and software modules in the decoding processor. The software modules may be located in a storage medium located in the memory 502, and the processor 501 reads the information in the memory 502 and performs the steps of the aforementioned methods in conjunction with its hardware.

In an exemplary embodiment, the information processing apparatus 50 may be implemented by one or more Application Specific Integrated Circuits (ASICs), DSPs, Programmable Logic Devices (PLDs), Complex Programmable Logic Devices (CPLDs), Field Programmable Gate Arrays (FPGAs), general purpose processors, controllers, Micro Controllers (MCUs), microprocessors (microprocessors), or other electronic components for performing the foregoing methods.

An embodiment of the present invention further provides a computer-readable storage medium, on which a computer program is stored, where the computer program, when executed by a processor, performs: acquiring sound data aiming at a target object, and determining a first state of the target object according to the sound data; acquiring image data of the target object, and determining a second state of the target object according to the image data; and determining a state result of the target object according to the first state and the second state. The sound data for the target object includes: sound data generated by vibration of the target object; image data of the target object, comprising: appearance image data of the target object.

In one embodiment, the computer program, when executed by the processor, performs: converting sound data generated by the vibration of the target object into an electric signal; carrying out digital processing on the electric signal to obtain a peak frequency interval of the electric signal; and inquiring a preset corresponding relation between the frequency and the state according to the peak frequency interval, and determining a first state corresponding to the peak frequency interval.

In one embodiment, the computer program, when executed by the processor, performs: and identifying appearance image data of the target object by using a preset image identification model, and determining a second state of the target object.

In one embodiment, the computer program, when executed by the processor, performs: acquiring a training image data set; the training image dataset comprising: at least one training image data and label data corresponding to each training image data; fine-tuning a preset neural network by using the training image data set; extracting image features of the training image data, the image features including at least one of: network characteristics, RGB histogram characteristics, LDTP characteristics of the neural network; training a classifier according to the image features and the label data corresponding to the image features to obtain a trained classifier; and taking the trimmed neural network and the trained classifier as the image recognition model.

In one embodiment, the computer program, when executed by the processor, performs: selecting a target state from the first state and the second state as the state result according to a preset rule; or determining the weight values corresponding to the first state and the second state respectively; and performing weighting processing according to the first state, the second state, the weight value corresponding to the first state and the weight value corresponding to the second state, and taking a weighting processing result as the state result.

In one embodiment, the computer program, when executed by the processor, performs: acquiring state reference parameters for the target object; the state reference parameters comprise: hardness data; inquiring the corresponding relation between the preset state reference parameter and the state result according to the state reference parameter, and determining a third state; correspondingly, the determining the state result of the target object according to the first state and the second state includes: and determining a state result of the target object according to the first state, the second state and the third state.

In one embodiment, the computer program, when executed by the processor, performs: selecting a target state from the first state, the second state and the third state as a state result according to a preset rule; or determining weight values corresponding to the first state, the second state and the third state respectively; and performing weighting processing according to the first state, the second state, the third state, the weight value corresponding to the first state, the weight value corresponding to the second state and the weight value corresponding to the third state, and taking a weighting processing result as the state result.

In one embodiment, the computer program, when executed by the processor, performs: comparing the first state with the second state to determine the difference between the two states; judging whether the difference value of the two states exceeds a difference value threshold value or not, and selecting any one state from the first state and the second state as the state result if the difference value of the two states does not exceed the difference value threshold value; and when the difference value of the two states exceeds a difference threshold value, determining the corresponding credibility of the first state and the second state respectively, and selecting a target state from the first state and the second state as the state result according to the corresponding credibility of the first state and the second state respectively.

In one embodiment, the computer program, when executed by the processor, performs: comparing the first state, the second state and the third state, and determining the difference value of any two states; judging whether the difference value of any two states exceeds a difference value threshold value, determining that the difference value of any two states does not exceed the difference value threshold value, and selecting any one state from the first state, the second state and the third state as the state result; and when the situation that the difference value of any two states exceeds a difference value threshold value is determined, determining the corresponding credibility of the first state, the second state and the third state respectively, and selecting a target state as the state result according to the corresponding credibility of the first state, the second state and the third state respectively.

In the several embodiments provided in the present application, it should be understood that the disclosed apparatus and method may be implemented in other ways. The above-described device embodiments are merely illustrative, for example, the division of the unit is only a logical functional division, and there may be other division ways in actual implementation, such as: multiple units or components may be combined, or may be integrated into another system, or some features may be omitted, or not implemented. In addition, the coupling, direct coupling or communication connection between the components shown or discussed may be through some interfaces, and the indirect coupling or communication connection between the devices or units may be electrical, mechanical or other forms.

The units described as separate parts may or may not be physically separate, and parts displayed as units may or may not be physical units, that is, may be located in one place, or may be distributed on a plurality of network units; some or all of the units can be selected according to actual needs to achieve the purpose of the solution of the embodiment.

In addition, all the functional units in the embodiments of the present invention may be integrated into one processing unit, or each unit may be separately regarded as one unit, or two or more units may be integrated into one unit; the integrated unit can be realized in a form of hardware, or in a form of hardware plus a software functional unit.

Those of ordinary skill in the art will understand that: all or part of the steps for implementing the method embodiments may be implemented by hardware related to program instructions, and the program may be stored in a computer readable storage medium, and when executed, the program performs the steps including the method embodiments; and the aforementioned storage medium includes: a mobile storage device, a Read-Only Memory (ROM), a Random Access Memory (RAM), a magnetic disk or an optical disk, and other various media capable of storing program codes.

Alternatively, the integrated unit of the present invention may be stored in a computer-readable storage medium if it is implemented in the form of a software functional module and sold or used as a separate product. Based on such understanding, the technical solutions of the embodiments of the present invention may be essentially implemented or a part contributing to the prior art may be embodied in the form of a software product, which is stored in a storage medium and includes several instructions for causing a computer device (which may be a personal computer, a server, or a network device) to execute all or part of the methods described in the embodiments of the present invention. And the aforementioned storage medium includes: a removable storage device, a ROM, a RAM, a magnetic or optical disk, or various other media that can store program code.

The above description is only exemplary of the present invention and should not be taken as limiting the scope of the present invention, and any modifications, equivalents, improvements, etc. that are within the spirit and principle of the present invention should be included in the present invention.

Claims

1. An information processing method, characterized in that the method comprises:

2. The method of claim 1, wherein the acoustic data for the target object comprises: sound data generated by vibration of the target object;

3. The method of claim 1, wherein the image data of the target object comprises: appearance image data of the target object;

4. The method of claim 3, further comprising: generating the image recognition model; the generating of the image recognition model comprises:

fine-tuning a preset neural network by using the training image data set;

5. The method of claim 1, wherein determining the state of the target object based on the first state and the second state comprises any one of:

6. The method of claim 1, further comprising:

7. The method according to claim 5, wherein said selecting a target state from said first state and said second state as said state result according to a preset rule comprises:

8. An information processing apparatus characterized in that the apparatus comprises: the system comprises a first processing module, a second processing module and a third processing module; wherein,

9. An information processing apparatus characterized in that the apparatus comprises: a processor and a memory for storing a computer program capable of running on the processor; wherein,

the processor is adapted to perform the steps of the method of any one of claims 1 to 7 when running the computer program.

10. A computer-readable storage medium, on which a computer program is stored, which, when being executed by a processor, carries out the steps of the method of any one of claims 1 to 7.