CN110941737B

CN110941737B - Single-machine voice storage method, device and equipment and readable storage medium

Info

Publication number: CN110941737B
Application number: CN201911247024.6A
Authority: CN
Inventors: 彭辉; 黎智勇; 许敏强
Original assignee: Guangzhou Speakin Intelligent Technology Co ltd
Current assignee: Guangzhou Speakin Intelligent Technology Co ltd
Priority date: 2019-12-06
Filing date: 2019-12-06
Publication date: 2023-01-20
Anticipated expiration: 2039-12-06
Also published as: CN110941737A

Abstract

The application discloses a stand-alone voice storage method, a stand-alone voice storage device, stand-alone voice storage equipment and a readable storage medium, wherein the method comprises the steps of collecting voice information and judging whether uploading conditions are met; if the uploading condition is not met, acquiring dimension information of the voice information; storing the voice information to a target storage partition pointed by the dimension information in the acquisition terminal storage partition, and adding an index mark of the voice information to the target storage partition; and when the voice information is detected to meet the preset uploading condition, determining the target storage partition based on the index mark, and uploading the voice information and the corresponding dimension information in the target storage partition. After voice information is collected, whether uploading conditions are met or not is judged, the uploading conditions are not met, a target storage partition in the collection terminal is found according to dimension information, the voice information is stored, an index mark is built, if the uploading conditions are met, the target storage partition is rapidly determined according to the index mark, and the voice information and the dimension information are uploaded.

Description

Single-machine voice storage method, device and equipment and readable storage medium

Technical Field

The invention relates to the technical field of data storage, in particular to a stand-alone voice storage method, a stand-alone voice storage device, stand-alone voice storage equipment and a readable medium.

Background

After the public security department collects the voice information, the voice information needs to be uploaded to a public security voiceprint library for voiceprint comparison, but sometimes the voiceprint comparison is limited by the fact that the current condition and situation cannot be directly uploaded, the collected voiceprint information needs to be stored at the moment, and then the stored voiceprint information is uploaded to the public security voiceprint library for comparison.

The existing storage method is to store the collected voice information into a local memory at random, and when the uploading condition is met, the collected voice information is uploaded to a public security voiceprint library again.

Disclosure of Invention

The application mainly aims to provide a single-computer voice storage method, a single-computer voice storage device, single-computer voice storage equipment and a readable storage medium, and aims to solve the problem that collected voice information is stored and then uploaded inconveniently.

In order to achieve the above object, the present application provides a stand-alone speech storage method, where the incremental data clustering method includes the following steps:

collecting voice information and judging whether uploading conditions are met or not;

if the uploading condition is not met, acquiring dimension information of the voice information;

storing the voice information to a target storage partition pointed by the dimension information in a storage partition of an acquisition terminal, and adding an index mark of the voice information to the storage partition;

when the voice information is detected to meet a preset uploading condition, the target storage partition is determined based on the index mark, and the voice information and the corresponding dimension information in the target storage partition are uploaded.

Optionally, the step of collecting voice information includes:

collecting voice information;

analyzing the number of speakers of the voice information;

and separating the voice information according to the number of the speakers to obtain the voice information of each speaker.

Optionally, the step of determining whether the uploading condition can be met includes:

establishing connection with a public security voiceprint library;

acquiring the connection result;

if the connection is successful, the uploading condition is met;

if the connection fails, the uploading condition is not met.

Optionally, the step of obtaining the dimension information of the voice information includes:

determining voice characteristic information of the voice information;

determining the acquisition place information of the voice information;

determining the acquisition time information of the voice information;

and forming the dimension information according to the voice feature information, the acquisition place information and the acquisition time information.

Optionally, the step of storing the voice information to a target storage partition pointed by the dimension information in a storage partition of a collection terminal, and adding an index mark of the voice information to the target storage partition includes:

finding the target storage partition pointed in the acquisition terminal according to the acquisition place information;

according to the voice feature information, carrying out similarity comparison on the voice information and the storage information in the target storage partition;

if the similarity is larger than a preset value, storing the voice information into the target storage partition;

and adding the index mark to the target storage partition according to the acquisition time information.

Optionally, the step of comparing the similarity between the voice information and the storage information in the target storage partition is followed by:

if the similarity is not greater than a preset value, establishing a temporary storage partition according to the acquisition place information;

and storing the voice information into the temporary storage subarea according to the acquisition place information.

When the number of the voice messages in the temporary storage partition exceeds a preset number, the temporary storage partition is changed into a fixed storage partition;

and adding index marks to the fixed storage partition according to the acquisition time information of the voice information in the fixed storage partition.

Optionally, the determining the target storage partition based on the index mark, and the uploading the voice information and the corresponding dimension information in the target storage partition includes:

acquiring the uploading time of the last uploading voice information;

acquiring index marks of all storage partitions, and determining the target storage partition according to the uploading time;

and uploading the voice information and the corresponding dimension information in the target storage partition according to the acquisition time information of the voice information in the target storage partition.

The present application further provides a stand-alone voice storage device, the stand-alone voice storage device includes:

the judging module is used for collecting voice information and judging whether uploading conditions are met or not;

the acquisition module is used for acquiring the dimension information of the voice information if the uploading condition is not met;

the storage module is used for storing the voice information to a target storage partition pointed by the dimension information in a storage partition of the acquisition terminal and adding an index mark of the voice information to the target storage partition;

and the uploading module is used for determining the target storage partition based on the index mark and uploading the voice information and the corresponding dimension information in the target storage partition when the voice information is detected to meet a preset uploading condition.

The present application further provides a stand-alone voice storage device, which comprises: the system comprises a memory, a processor and a stand-alone voice storage program which is stored on the memory and can run on the processor, wherein the stand-alone voice storage program realizes the steps of the stand-alone voice storage method when being executed by the processor.

The present application also provides a readable storage medium having stored thereon a computer program which, when executed by a processor, implements the steps of the stand-alone voice storage method as described above.

The method comprises the steps of judging whether uploading conditions are met or not by collecting voice information; if the uploading condition is not met, acquiring dimension information of the voice information; storing the voice information to a target storage partition pointed by the dimension information in a storage partition of an acquisition terminal, and adding an index mark of the voice information to the target storage partition; when the voice information is detected to meet a preset uploading condition, the target storage partition is determined based on the index mark, and the voice information and the corresponding dimension information in the target storage partition are uploaded. After voice information is collected, whether the voice information can be directly uploaded to a public security voiceprint library or not is judged, the voice information which cannot be uploaded can be found out, a target storage partition in the collection terminal is found according to dimension information of the voice information, the voice information is stored in the target storage partition, an index mark is established, and when uploading conditions are met, the target storage partition is quickly determined and the voice information and the dimension information are uploaded according to the index mark, so that comparison with voiceprint information in the public security voiceprint library is facilitated.

Drawings

The accompanying drawings, which are incorporated in and constitute a part of this specification, illustrate embodiments consistent with the present application and together with the description, serve to explain the principles of the application.

In order to more clearly illustrate the embodiments of the present application or the technical solutions in the prior art, the drawings needed to be used in the description of the embodiments or the prior art will be briefly described below, and it is obvious for those skilled in the art to obtain other drawings without inventive exercise.

Fig. 1 is a schematic device structure diagram of a hardware operating environment according to an embodiment of the present application;

FIG. 2 is a flowchart illustrating a stand-alone voice storage method according to a first embodiment of the present application;

FIG. 3 is a detailed flowchart of step S10 of FIG. 2 in a second embodiment of the stand-alone speech storage method of the present application;

FIG. 4 is a flowchart illustrating a detailed procedure of step S20 of FIG. 2 in a third embodiment of the stand-alone speech storage method of the present application;

FIG. 5 is a flowchart illustrating a detailed process of step S30 of FIG. 2 in a fourth embodiment of the stand-alone speech storage method of the present application;

FIG. 6 is a flowchart illustrating a detailed process of step S40 of FIG. 2 in a fourth embodiment of the stand-alone speech storage method of the present application;

FIG. 7 is a system diagram of a stand-alone voice storage device according to an embodiment of the present invention.

The implementation, functional features and advantages of the objectives of the present application will be further explained with reference to the accompanying drawings.

Detailed Description

It should be understood that the specific embodiments described herein are merely illustrative of the present application and are not intended to limit the present application.

In the following description, suffixes such as "module", "component", or "unit" used to denote elements are used only for facilitating the explanation of the present invention, and have no specific meaning in itself. Thus, "module", "component" or "unit" may be used mixedly.

As shown in fig. 1, fig. 1 is a schematic terminal structure diagram of a hardware operating environment according to an embodiment of the present application.

The terminal in the embodiment of the application is a stand-alone voice storage device.

As shown in fig. 1, the terminal may include: a processor 1001, such as a CPU, a network interface 1004, a user interface 1003, a memory 1005, a communication bus 1002. Wherein a communication bus 1002 is used to enable connective communication between these components. The user interface 1003 may include a Display screen (Display), an input unit such as a Keyboard (Keyboard), and the optional user interface 1003 may also include a standard wired interface, a wireless interface. The network interface 1004 may optionally include a standard wired interface, a wireless interface (e.g., WI-FI interface). The memory 1005 may be a high-speed RAM memory or a non-volatile memory such as a disk memory. The memory 1005 may alternatively be a storage device separate from the processor 1001.

Optionally, the terminal may further include a camera, a Radio Frequency (RF) circuit, a sensor, an audio circuit, a WiFi module, and the like. Such as light sensors, motion sensors, and other sensors, among others. Specifically, the light sensor may include an ambient light sensor that adjusts the brightness of the display screen according to the brightness of ambient light, and a proximity sensor that turns off the display screen and/or the backlight when the terminal device is moved to the ear. Of course, the terminal device may also be configured with other sensors such as a gyroscope, a barometer, a hygrometer, a thermometer, and an infrared sensor, which are not described herein again.

Those skilled in the art will appreciate that the terminal structure shown in fig. 1 is not intended to be limiting and may include more or fewer components than those shown, or some components may be combined, or a different arrangement of components.

As shown in fig. 1, the memory 1005, which is a kind of computer storage medium, may include therein an operating system, a network communication module, a user interface module, and a stand-alone voice storage program.

In the terminal shown in fig. 1, the network interface 1004 is mainly used for connecting to a backend server and performing data communication with the backend server; the user interface 1003 is mainly used for connecting a client (user side) and performing data communication with the client; and processor 1001 may be configured to invoke a stand-alone voice storage program stored in memory 1005 and perform the following operations:

storing the voice information to a target storage partition pointed by the dimension information in a storage partition of an acquisition terminal, and adding an index mark of the voice information to the target storage partition;

Based on the above terminal hardware structure, various embodiments of the present application are provided.

The application provides a stand-alone voice storage method.

Referring to fig. 2, in a first embodiment of the stand-alone voice storage method, the method comprises:

step S10, collecting voice information and judging whether uploading conditions are met;

when the public security department carries out case investigation, voice information needs to be collected, the collected voice information is uploaded to a public security voiceprint library to be matched and compared with voiceprint information in the voiceprint library, and if similar voiceprint information is matched, the fact that the source of the collected voice information is suspected to be criminal is shown. However, in order to ensure the information security of the public security voiceprint library, the uploading condition is relatively harsh, the uploaded information needs to be subjected to strict security evaluation, the uploading environment, such as the uploading place and the traffic condition of the network, needs to be detected when the uploading voice information is uploaded, if the connection with the public security voiceprint library is successfully established, the voice information is uploaded, and if the connection with the public security voiceprint library cannot be established, the voice information needs to be stored in the local memory.

Step S20, if the uploading condition is not met, acquiring dimension information of the voice information;

and when the uploading condition is not met, acquiring dimension information of the voice information, wherein the dimension information of the voice information comprises voice characteristic information, acquisition place information and acquisition time information, the voice characteristic information comprises characteristic information which can analyze and compare voice such as pitch and tone of the voice, the acquisition place information is a place where the voice information is acquired, and the acquisition time information is time for acquiring the voice information.

Step S30, according to the dimension information, storing the voice information to a target storage partition pointed by the dimension information in a storage partition corresponding to an acquisition terminal, and adding an index mark of the voice information to the target storage partition;

the terminal stores part of voice information derived from the public security voiceprint library, the voice information derived from the public security voiceprint library is classified according to the location information during collection, meanwhile, the voice information is sequenced according to the sequence of the collection time, for newly collected voice information, a possible target storage partition is preliminarily determined according to the collection location information in the dimension information, then, voice characteristic information is used for matching with the voice information in the previously stored collection terminal, if voice information with high similarity exists, the collected voice information is stored in the corresponding target storage partition, an index mark is added to the target storage partition according to the dimension information, if no proper existing storage partition is found, a new temporary storage partition is established according to the collection time information of the voice information, and the collected voice information is stored in the new temporary storage partition.

Step S40, when uploading is possible and the voice information meets the preset uploading condition, the voice information and the corresponding dimension information in the target storage partition are uploaded from the marked target storage partition determined based on the index according to the dimension information;

and when the current environment is detected to meet the preset uploading condition again, determining that a target storage partition subjected to data updating exists according to the index mark of the storage partition, and uploading the newly acquired voice information and the corresponding dimension information to the public security voiceprint library according to the acquisition time information in the dimension information of the voice information.

In this embodiment, whether the uploading condition is met is judged by collecting voice information; if the uploading condition is not met, acquiring dimension information of the voice information; storing the voice information to a target storage partition pointed by the dimension information in a storage partition of an acquisition terminal, and adding an index mark of the voice information to the target storage partition; when the voice information is detected to meet a preset uploading condition, the target storage partition is determined based on the index mark, and the voice information and the corresponding dimension information in the target storage partition are uploaded. After voice information is collected, whether the voice information can be directly uploaded to a public security voiceprint library or not is judged, the voice information which cannot be uploaded can be found out, a target storage partition in the collection terminal is found according to dimension information of the voice information, the voice information is stored in the target storage partition, an index mark is established, and when uploading conditions are met, the target storage partition is quickly determined and the voice information and the dimension information are uploaded according to the index mark, so that comparison with voiceprint information in the public security voiceprint library is facilitated.

Further, referring to fig. 2 and 3, on the basis of the first embodiment of the stand-alone voice storage method of the present application, there is provided a second embodiment of the stand-alone voice storage method, in which,

the step S10 includes:

s11, collecting voice information;

the voice information of a site is collected through the specified authenticated voice collecting equipment, so that the voice information collected by the authenticated collecting equipment can be uploaded to a public security voiceprint library, and meanwhile, the number of the collecting equipment, the number of collecting personnel, the collecting place and other information are recorded during collection.

Step S12, analyzing the number of speakers of the voice information;

because the environment for collecting the voice information is complex, the voice information mixed by a plurality of speakers can be collected usually, the collected voice information is identified through the voice identification function of the collection equipment, the collected voice information is clustered according to the voice characteristic information, and the clustering number of the clustering is the number of the speakers.

S13, separating the voice information according to the number of the speakers to obtain the voice information of each speaker;

according to the number of recognized speakers, the collected voice information is separated, the voice information corresponding to each speaker is extracted respectively, and meanwhile, the information such as the collection time, the collection place and the like is added to each separated voice information.

Step S14, establishing connection with a public security voiceprint library;

the public security voiceprint library stores voiceprint information of criminal suspects collected by a public security department, belongs to relatively confidential information, needs to be subjected to security verification when connected with the public security voiceprint library, and judges whether similar suspects exist through comparison with information in the public security voiceprint library by connecting the public security voiceprint library after voice information is collected.

Step S15, obtaining the connection result;

because the information stored in the public security voiceprint library is confidential, a stricter requirement is imposed on the environment connected with the public security voiceprint library, for example, in the case of an environment in which the transmission of a network is not limited well or a network environment in which information may be stolen such as being out of country, in order to ensure the information security in the public security voiceprint library, when the environmental condition cannot meet the connection requirement, the connection result cannot be obtained after the connection attempt is made to the public security voiceprint library.

Step S16, if the connection is successful, the uploading condition is met;

step S17, if the connection fails, the uploading condition is not met;

when the connection is successful, the voice information is automatically uploaded, and when the connection is failed, the voice information cannot be uploaded, the voice information needs to be stored in a local memory to wait for uploading.

In this embodiment, voice information is collected, the number of speakers included in the voice information is identified, the voice information of each speaker is separated, meanwhile, connection to the public security voiceprint library is attempted, a connection result with the public security voiceprint library is judged, the next operation is determined, the connection is successful, the voice information is automatically uploaded to the public security voiceprint library, and if the connection is failed, the voice information is stored in the local memory.

Further, referring to fig. 2 and 4, on the basis of the second embodiment of the stand-alone voice storing method of the present application, there is provided a third embodiment of the stand-alone voice storing method, in which,

step S20 includes:

step S21, determining voice characteristic information of the voice information;

step S22, determining the acquisition place information of the voice information;

step S23, determining the acquisition time information of the voice information;

step S24, forming the dimension information according to the voice characteristic information, the acquisition place information and the acquisition time information;

the voice characteristic information is obtained by analyzing the voice information, the voice characteristic information comprises information such as pitch, amplitude, frequency and the like of the voice, the acquisition place refers to a place where the voice information is acquired, and the acquisition time refers to time when the voice information is acquired. The voice information is collected to judge whether a suspect exists in the site, and the collected voice information is compared with information in a public security voiceprint library. Under the general condition, the range of motion of a suspect is relatively fixed and is easy to appear in some fixed places, and the acquisition place information and the acquisition time information can form the case dimension of voice information, namely, existing cases which are possibly similar to cases are judged through time and place, and then the voice information acquired by the cases which are possibly similar to each other in a public security voiceprint library is compared according to the voice characteristic information to determine whether the cases can be the same person or not.

In this embodiment, the acquired voice information is analyzed to obtain dimension information of the voice information, and the dimension information may be used for matching with the voice information in the public security voiceprint library and may also be used for storing the voice information in a local storage.

Further, referring to fig. 2 and 5, on the basis of the third embodiment of the stand-alone voice storing method of the present application, there is provided a fourth embodiment of the stand-alone voice storing method, in which,

step S30 includes:

step S31, finding the target storage partition pointed in the acquisition terminal according to the acquisition place information;

the terminal stores partial voice information which is already derived from the public security voiceprint library, the voice information derived from the public security voiceprint library is stored in a partition mode according to the location information of the acquired voice information, the city can be used as a partition range for dividing the acquisition location, the province can be used as a partition range, namely, the partition of the storage partition can be adjusted according to actual conditions, and after new voice information is acquired, a target storage partition is determined preliminarily according to the acquisition location information of the voice information.

Step S32, according to the voice characteristic information, comparing the similarity of the voice information and the storage information in the target storage partition;

after a possible target storage partition of the voice information is preliminarily determined according to the acquisition place information, whether existing voice information similar to the acquired voice information exists is determined according to the voice characteristic information, if the existing voice information similar to the acquired voice information exists, the voice information is stored in the corresponding target storage partition, and then the voice information can be uploaded to the corresponding storage partition in the public security voiceprint library according to the storage partition where the existing voice information exists during uploading.

Step S33, if the similarity is larger than a preset value, storing the voice information into the target storage partition;

and comparing the similarity of the collected voice information with the stored voice information, and if the similarity is higher, storing the voice information into a corresponding target storage partition.

Step S34, adding the index mark to the target storage partition according to the acquisition time information;

and for the storage partition subjected to data updating, adding an index mark to the storage partition according to the acquisition time information of the voice information stored in the storage partition, wherein the index mark can record the acquisition time of the voice information updated in the storage partition.

Step S35, if the similarity is not greater than a preset value, a temporary storage partition is established according to the acquisition place information;

and if the similarity is low, the correlation between the voice information and the currently stored voice information is low, and the voice information cannot be directly placed into the corresponding storage partition, establishing a new temporary storage partition for storing the voice information again according to the acquisition place information, wherein for the new temporary storage partition, the acquisition time information of the voice information is different from the existing voice information in the storage partition.

Step S36, storing the voice information into the temporary storage subarea according to the acquisition place information;

if the collected voice information has no corresponding temporary storage partition, establishing a new temporary storage partition and storing the voice information into the new temporary storage partition, and if the temporary storage partition with the same collected place information exists, directly storing the voice information into the corresponding existing storage partition.

Step S37, when the quantity of the voice information in the temporary storage partition exceeds a preset quantity, the temporary storage partition is changed into a fixed storage partition;

when the voice information in the temporary storage partition is more, the voice information in the temporary storage partition possibly forms the related voice information of a new case, the temporary storage partition is changed into a fixed storage partition, and then the collected voice information can be stored in the new fixed storage partition if the collected voice information meets the conditions.

Step S38, adding index marks to the fixed storage partition according to the acquisition time information of the voice information in the fixed storage partition;

similarly, an index mark is added to the new fixed storage partition, and like the previous index mark, the index mark of the new fixed storage partition contains the acquisition time information of the voice information updated last time by the storage partition.

In the embodiment, for the collected voice information, if the collected voice information can be stored in the existing target storage partition, the collected voice information is stored in the target storage partition and an index mark is added; and if the voice information cannot be stored in the existing storage partition, establishing a new temporary storage partition and storing the acquired voice information into the temporary storage partition.

Further, referring to fig. 2 and 6, on the basis of the fourth embodiment of the stand-alone voice storage method of the present application, there is provided a fifth embodiment of the stand-alone voice storage method, in which,

step S40 includes:

step S41, acquiring the uploading time of the last voice information uploading;

the collection terminal can automatically record each uploading operation, when uploading voice information each time, the newly collected voice information in the twice uploading interval can be uploaded to the public security voiceprint library, and therefore the uploading time of uploading the voice information last time needs to be acquired during each uploading.

Step S42, acquiring index marks of all storage partitions, and determining the target storage partition according to the uploading time;

and when the data updating time in the index marks of the storage partitions is later than the last uploading time, determining that the target storage partition needs to upload the voice information.

Step S43, uploading the voice information and the corresponding dimension information in the target storage partition according to the acquisition time information of the voice information in the target storage partition;

and after the target storage partition is determined, acquiring acquisition time information of each voice message in the target storage partition, if the acquisition time is later than the uploading time of the last uploaded data, uploading the voice message and the corresponding dimension information to a public security voiceprint library, and simultaneously storing the voice message in an area corresponding to the public security voiceprint library according to the dimension information.

In this embodiment, the target storage partition is quickly determined according to the index mark of the storage partition, the corresponding uploaded voice information is determined according to the dimension information of the voice information, and the voice information and the corresponding dimension information are uploaded.

In addition, referring to fig. 7, an embodiment of the present application further provides a stand-alone voice storage apparatus, where the stand-alone voice storage apparatus includes:

Optionally, the determining module is further configured to:

collecting voice information;

analyzing the number of speakers of the voice information;

Optionally, the determining module is further configured to:

establishing connection with a public security voiceprint library;

acquiring the connection result;

if the connection is successful, the uploading condition is met;

if the connection fails, the uploading condition is not satisfied.

Optionally, the obtaining module is further configured to:

determining voice characteristic information of the voice information;

determining the acquisition place information of the voice information;

determining the acquisition time information of the voice information;

Optionally, the storage module is further configured to:

according to the voice characteristic information, carrying out similarity comparison on the voice information and the storage information in the target storage partition;

Optionally, the storage module is further configured to:

storing the voice information into the temporary storage subarea according to the acquisition place information;

Optionally, the upload module is further configured to:

acquiring the uploading time of the last uploading voice information;

The specific implementation of the apparatus and the readable storage medium (i.e., the computer readable storage medium) of the present application has substantially the same extension as that of each embodiment of the stand-alone voice storage method, which is not described herein again.

It should be noted that, in this document, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrase "comprising a … …" does not exclude the presence of another identical element in a process, method, article, or apparatus that comprises the element.

The above-mentioned serial numbers of the embodiments of the present invention are only for description, and do not represent the advantages and disadvantages of the embodiments.

Through the above description of the embodiments, those skilled in the art will clearly understand that the method of the above embodiments can be implemented by software plus a necessary general hardware platform, and certainly can also be implemented by hardware, but in many cases, the former is a better implementation manner. Based on such understanding, the technical solutions of the present invention may be embodied in the form of a software product, which is stored in a storage medium (such as ROM/RAM, magnetic disk, optical disk) and includes instructions for enabling a terminal (such as a mobile phone, a computer, a server, an air conditioner, or a network device) to execute the method according to the embodiments of the present invention.

While the present invention has been described with reference to the embodiments shown in the drawings, the present invention is not limited to the embodiments, which are illustrative and not restrictive, and it will be apparent to those skilled in the art that various changes and modifications can be made therein without departing from the spirit and scope of the invention as defined in the appended claims.

Claims

1. A stand-alone voice storage method is characterized by comprising the following steps:

when the voice information is detected to meet a preset uploading condition, determining the target storage partition based on the index mark, and uploading the voice information and the corresponding dimension information in the target storage partition;

wherein the step of obtaining the dimension information of the voice information comprises:

determining voice characteristic information of the voice information;

determining the acquisition place information of the voice information;

determining the acquisition time information of the voice information;

and forming the dimension information according to the voice characteristic information, the acquisition place information and the acquisition time information.

2. A stand-alone voice storage method as claimed in claim 1, wherein said step of collecting voice information comprises:

collecting voice information;

analyzing the number of speakers of the voice information;

3. The stand-alone voice storage method according to claim 2, wherein the step of determining whether the upload condition is satisfied comprises:

establishing connection with a public security voiceprint library;

acquiring a connection result;

if the connection is successful, the uploading condition is met;

if the connection fails, the uploading condition is not satisfied.

4. The stand-alone voice storage method according to claim 1, wherein the step of storing the voice information into a target storage partition to which the dimensional information points in a collection terminal storage partition, and adding an index mark of the voice information to the target storage partition comprises:

5. A stand-alone voice storage method as claimed in claim 4 wherein said step of comparing said voice information with the stored information in said target memory partition is followed by the steps of:

6. The stand-alone voice storage method as claimed in claim 5, wherein said determining said target storage partition based on said index tag, said uploading said voice information and corresponding said dimension information in said target storage partition comprises:

acquiring the uploading time of the last uploading voice information;

7. A stand-alone voice storage apparatus, said stand-alone voice storage apparatus comprising:

the judging module is used for acquiring voice information and judging whether uploading conditions are met;

the uploading module is used for determining the target storage partition based on the index mark and uploading the voice information and the corresponding dimension information in the target storage partition when the voice information is detected to meet a preset uploading condition;

the acquisition module is further used for determining voice characteristic information of the voice information; determining the acquisition place information of the voice information; determining the acquisition time information of the voice information; and forming the dimension information according to the voice characteristic information, the acquisition place information and the acquisition time information.

8. A stand-alone voice storage device, the device comprising: memory, processor and a stand-alone speech storage program stored on said memory and executable on said processor, said stand-alone speech storage program when executed by said processor implementing the steps of the stand-alone speech storage method according to any of claims 1 to 6.

9. A readable storage medium, characterized in that the readable storage medium has stored thereon a computer program which, when being executed by a processor, carries out the steps of the stand-alone speech storage method according to any one of claims 1 to 6.