CN113378579A

CN113378579A - Method, system and electronic equipment for voice input of structured data

Info

Publication number: CN113378579A
Application number: CN202110606081.XA
Authority: CN
Inventors: 胡斌
Original assignee: May 8 Home Co ltd
Current assignee: May 8 Home Co ltd
Priority date: 2021-05-31
Filing date: 2021-05-31
Publication date: 2021-09-10

Abstract

The invention belongs to the technical field of computer application, and provides a method, a system and electronic equipment for voice input of structured data. The method comprises the following steps: the method comprises the steps of obtaining voice information input by a user, converting the voice information into text information, segmenting the text information, identifying fields and field contents in the segmented text information to generate structured data, providing an interface to display the structured data for the user, receiving user input to edit the structured data, and storing the structured data. According to the invention, the voice information is generated into the text information, and the text is automatically corrected by combining the professional word segmentation word stock, so that the input efficiency of a user is improved, the complicated manual input link is reduced, and the waste of input time is avoided.

Description

Method, system and electronic equipment for voice input of structured data

Technical Field

The invention belongs to the technical field of computer application, and particularly relates to a method, a system and electronic equipment for voice input of structured data.

Background

In the current home administration industry, when a home administration practitioner issues job-seeking information to a home administration platform or needs to interact with the home administration platform, the home administration practitioner generally manually inputs various information data and completes the input of information such as requirements, clues and orders, and the input items are relatively more and tedious, the information needs to be input one by one through characters, the input efficiency is low, and the home administration practitioner has poor experience on the complicated input process and consumes long time.

Disclosure of Invention

The present invention is directed to a method, a system and an electronic device for voice recording of structured data, so as to at least partially solve the above technical problems.

In order to solve the above technical problem, an aspect of the present invention provides a method for inputting structured data by voice, including: the method comprises the steps of obtaining voice information input by a user, converting the voice information into text information, segmenting the text information, identifying fields and field contents in the segmented text information to generate structured data, providing an interface to display the structured data for the user, receiving user input to edit the structured data, and storing the structured data.

As a second aspect of the present invention, there is provided a system for voice-recording structured data, comprising:

inputting voice information through a client;

the server side obtains voice information input by a user at the client side, and converts the voice information into text information;

the service end carries out word segmentation on the text information;

the server side identifies the fields and the field contents in the text information after word segmentation, and structured data are generated;

the server provides an interface to display the structured data to the user at the client, and receives input information of the user at the client to edit the structured data and upload the structured data to the server again;

and the server stores the structured data.

A third aspect of the invention proposes a server comprising a processor and a memory for storing a computer executable program, the processor performing said method of voice entering structured data when the computer program is executed by the processor.

The fifth aspect of the present invention also provides a computer-readable medium storing a computer-executable program, which when executed, implements the method for voice-recording structured data.

Based on the technical scheme, compared with the prior art, the method for inputting the structured data by voice at least has one of the following beneficial effects:

the invention identifies according to the industry word stock and the core words, automatically corrects the voice text, and reduces the content error caused by pronunciation when the user records the voice.

The invention can identify the core vocabulary by semantic analysis, can input a plurality of information at one time, and effectively reduces the time cost of character input.

Drawings

FIG. 1 is a flow chart of a method of voice entry of structured data of the present invention.

FIG. 2 is a schematic flow chart diagram of a method for voice entry of structured data in accordance with one embodiment of the present invention.

Fig. 3 is a structural relationship diagram of a client/server according to an embodiment of the invention.

FIG. 4 is a block diagram of an application of the method for voice entry of structured data shown in accordance with an exemplary embodiment of the present invention.

Fig. 5 is a schematic structural diagram of an electronic device according to an embodiment of the present invention.

Fig. 6 is a schematic diagram of a computer-readable recording medium of an embodiment of the present invention.

Detailed Description

In describing particular embodiments, specific details of structures, properties, effects, or other features are set forth in order to provide a thorough understanding of the embodiments by one skilled in the art. However, it is not excluded that a person skilled in the art may implement the invention in a specific case without the above-described structures, performances, effects or other features.

The flow chart in the drawings is only an exemplary flow demonstration, and does not represent that all the contents, operations and steps in the flow chart are necessarily included in the scheme of the invention, nor does it represent that the execution is necessarily performed in the order shown in the drawings. For example, some operations/steps in the flowcharts may be divided, some operations/steps may be combined or partially combined, and the like, and the execution order shown in the flowcharts may be changed according to actual situations without departing from the gist of the present invention.

The block diagrams in the figures generally represent functional entities and do not necessarily correspond to physically separate entities. I.e. these functional entities may be implemented in the form of software, or in one or more hardware modules or integrated circuits, or in different network and/or processing unit devices and/or microcontroller devices.

The same reference numerals denote the same or similar elements, components, or parts throughout the drawings, and thus, a repetitive description thereof may be omitted hereinafter. It will be further understood that, although the terms first, second, third, etc. may be used herein to describe various elements, components, or sections, these elements, components, or sections should not be limited by these terms. That is, these phrases are used only to distinguish one from another. For example, a first device may also be referred to as a second device without departing from the spirit of the present invention. Furthermore, the term "and/or", "and/or" is intended to include all combinations of any one or more of the listed items.

In order to solve the technical problem, the invention provides a method, a system and an electronic device for voice recording of structured data.

In the present invention, the client refers to any electronic device capable of performing data transmission with other electronic devices, including desktop computers, notebook computers, tablet computers, mobile phones, and other devices.

The client is usually provided with a human-machine interaction interface, including for example a display or a display screen, a mouse, a keyboard, etc., which usually has a visual interaction interface. For example, mobile intelligent clients such as mobile phones, the man-machine interaction interface is usually a touch screen and buttons. In any case, the user may interact with the client through a human-machine interface.

The client of the invention is mainly applied to clients storing data in a file mode, such as computers, mobile phones, servers and the like provided with windows, IOS, Android and other operating systems.

FIG. 1 is a flow chart of a method of voice entry of structured data of the present invention. As shown in fig. 1, the method for voice-entering structured data includes: the method comprises the steps of obtaining voice information input by a user, converting the voice information into text information, segmenting the text information, identifying fields and field contents in the segmented text information to generate structured data, providing an interface to display the structured data for the user, receiving user input to edit the structured data, and storing the structured data.

In a preferred embodiment, the method further comprises: providing an operation interface, wherein the operation interface comprises: a voice input operation area, an information display revision area, wherein,

the voice recording operation area is used for starting or stopping a voice recording task by a user;

the information display revision area is used for inputting to a user to edit the structured data.

In a preferred embodiment, the voice information entered by the user includes: and recording the voice audio generated by the user before the voice input is terminated by clicking the voice input in the voice input operation area by the user.

In a preferred embodiment, the voice information is uploaded to a service end and converted into text information;

the method also comprises the step of sending the text information generated by the user to the server side for voice word segmentation, wherein the voice word segmentation is automatically intercepted according to the characteristics of the voice text.

Preferably, the structured data generated by identifying the fields and the field contents in the segmented text information includes: necessary text information and target text information of the user.

Preferably, 11 continuous digits are intercepted as the contact way of the user;

intercepting province and city address information as a user contact address;

and matching other voice data of the user according to the core vocabulary.

Preferably, the method further comprises the following steps: and the server receives the voice information of the voice information uploaded by the plurality of clients, feeds back the structured data to the plurality of clients, uploads the structured data to the server again after being edited by the user and stores the structured data.

Preferably, the managing of the voice information entered by the plurality of users includes:

adding voice information input by different users into a voice input task queue;

and distributing the voice recording task queue to different processors or processing devices for processing.

The invention also discloses a system for inputting the structured data by voice, which is characterized by comprising the following steps:

a user inputs voice information through a client;

the service end carries out word segmentation on the text information;

and the server stores the structured data.

The invention also discloses a server for voice-entering structured data, comprising a processor and a memory, the memory being configured to store a computer-executable program, the processor executing the method when the computer program is executed by the processor, the processor being configured to perform the steps of:

according to a preferred embodiment of the present invention, the task server and the text field generating server are the same server, and the task server is further configured to: receiving the video-voice entry task from a client;

according to a preferred embodiment of the present invention, the task server is further configured to perform: sending the structured data to the client;

according to a preferred embodiment of the present invention, the task server is not the same server as the structured data server, and the structured data server is further configured to perform: and uploading the structured data to a task server.

In a preferred embodiment, the invention also proposes a computer-readable medium storing a computer-executable program which, when executed, implements the method.

In order that the objects, technical solutions and advantages of the present invention will become more apparent, the present invention will be further described in detail with reference to the accompanying drawings in conjunction with the following specific embodiments.

Step S101, acquiring voice information input by a user, and converting the voice information into text information.

FIG. 2 is a schematic flow chart diagram of a method for voice entry of structured data in accordance with one embodiment of the present invention. As shown in fig. 2, a user opens a voice input information page at a client, clicks a functional interface to start voice input, and the client records audio generated by the current user as voice information, where the voice information input by the user includes: and clicking voice input in the voice input operation area by a user, recording voice audio generated by the user before terminating the voice input, converting the voice audio into text information, and uploading the text information to the server.

As a preferred embodiment, the obtaining of the voice information mainly passes through a client input device, and the input device may include: microphone, earphone, and/or microphone. In the process of inputting voice information by a user, a voice input dialog box pops up by the client, and the user is appropriately prompted to input key words such as: name, address, work history, required post, etc

The client-side specific interface provides an operation interface for a client side for user operation and interaction with the server, and the operation interface comprises: a voice recording operation area and an information display revision area.

In an embodiment, the client specific interface provides an operation interface for the client for the user to interact with the server. It should be noted that the voice entry operation area is used for the user to start or terminate the voice entry task, including voice entry start function key, voice entry termination function key to and when logging function key etc. again, voice entry start function key can be for clicking to start the continuous recording function or long pressing start the continuous recording function, voice entry termination function key can be for clicking to terminate the voice entry function, the function key of logging again also can be for long pressing the voice entry start function key to slide up midway, or another independent function key acts on the voice entry function again. The information display revision area is used for editing the structured data by inputting the data by the user, the user can modify and edit the character information fed back by the server after the voice input, and the area comprises any functions of editing the data, such as character input, stroke input and/or voice input.

And step S102, performing word segmentation on the text information.

The client sends text information to the server, the server performs word segmentation on the text information, the voice word segmentation is automatically intercepted according to voice text characteristics, and the text characteristics comprise: segmenting words from numbers and words, for example, intercepting 11 continuous numbers as user contact information, intercepting province address information as user contact address, and matching other voice data of the user according to core vocabularies, including but not limited to: the method comprises the steps that specialized vocabularies of home administration industries such as infant nurses, nurses and accompanying careers recognize core vocabularies by means of semantic analysis, for example, the address is filled with follow-up contents, and the contact way is filled with the information.

In the embodiment, the process further includes performing automatic error correction on the recognized text according to an industry word segmentation system, for example: speech room-baby nurses, bait-nurseries, etc., wherein, as a preferred embodiment, the user can enter individual information at the time of the entered speech information, such as: name, contact, address, etc. Furthermore, a plurality of voice messages can be recorded at one time through continuous voice messages, for example, three names, the telephone 158, the address Beijing city Changpio 406, and core words such as names, contact addresses, addresses and the like can be identified through semantic analysis. Different information is filled in different information areas.

Step S103, identifying the fields and the field contents in the text information after word segmentation to generate structured data.

The structured data generated by the fields and the field contents in the text information after word segmentation is identified comprises the following steps: user essential text information such as: name, contact address, age, place of residence, etc., the intended text information includes but is not limited to: work post, target salary, work experience, etc., for example, when the user issues interview information to the infant nurses, the necessary text information required is: aunt, 158, 43 years old, the target character information is: the baby nurses, the salaries 8K and the working experience 5 years are rich … ….

Further, the server distinguishes necessary character information and target character information according to the text information structured data, and feeds the structured data back to the user interface of the client.

Step S104, providing an interface to display the structured data for the user, and receiving user input to edit the structured data.

The server side feeds back the structured data of the user to the client side, and the user can preview and edit the data in the information display revision area at the client side.

Step S105, storing the structured data.

And the server receives the voice information of the voice information uploaded by the plurality of clients, feeds back the structured data to the plurality of clients, uploads the structured data to the server again after being edited by the user and stores the structured data. When the user confirms that the structured data fed back by the server side is correct or correct after editing, the final structured data can be uploaded to the server side through the function keys of the client side and stored.

In an embodiment, the server receives the voice information uploaded by the plurality of clients and feeds back the voice input structured data to the plurality of clients, and manages the voice information input by the plurality of users in order to more effectively improve the efficiency of the voice input structured data, including: and adding the voice information input by different users into the voice input task queue, and distributing the voice input task queue to different processors or processing devices for processing.

Fig. 3 is a structural relationship diagram of a client/server according to an embodiment of the invention. As shown in fig. 3, a user opens a voice input information page at a client, clicks a functional interface to start voice input, the client records audio generated by the current user as voice information, the user inputs the voice information through the client, a server acquires the voice information input by the user at the client and converts the voice information into text information, the server performs word segmentation on the text information, the server identifies fields and field contents in the text information after word segmentation to generate structured data, the server provides an interface to display the structured data to the user at the client, receives input information of the user at the client to edit the structured data and upload the structured data to the server again, and the server stores the structured data.

As shown in fig. 4, the system architecture 400 may include

client devices

401, 402, 403, a network 404, and a server 405. Network 404 serves as a medium for providing communication links between

client devices

401, 402, 403 and server 505. Network 404 may include various types of connections, such as wire, wireless communication links, or fiber optic cables, to name a few.

A user may interact with a server 405 over a network 404 using

client devices

401, 402, 403 to receive or send messages, etc. The

client devices

401, 402, 403 may have installed thereon various APPs for the home industry.

The

client devices

401, 402, 403 may be various electronic devices having a display screen and supporting APP browsing, and having an IOS or android operating system, including but not limited to smart phones, tablets, laptop and desktop computers, and the like.

The

client devices

401, 402, 403 may, for example, send and/or receive the housekeeping industry information downloaded and/or uploaded by the user over the network; the

client devices

401, 402 and 403 may be used for operations such as browsing and/or uploading information by a user, where the operations refer to operations of browsing and/or publishing information by the user through a home-based industry platform on the client; the

client devices

401, 402, 403 may perform forwarding and/or review operations, for example, according to a user uploading to the home administration industry platform.

The server 405 may be a server that provides various services, such as a background management server that provides support for the household business class browsed by the user using the

client devices

401, 402, 403. The server 405 may manage the received downloading, uploading, receiving, browsing, editing, forwarding and/or commenting operations performed on the

client devices

401, 402, 403 by the user, where the management may include personal information, structured data, the authenticity of the comment statement, and the like, which may require the operation control of the staff member.

The server 405 may also be a server that provides various services, such as a server that provides support for structured data downloaded and/or uploaded over a network by users sent and/or received by the users using the

client devices

401, 402, 403. For example, a user uploads job information to the server 405 through the client device 403 (which may also be the client device 401 or 402), and the server 405 may obtain job information through a network, for example; the server 405 may process the voice entry task, for example, using a standardized structured data service, to generate structured data, and the server 405 may send the structured data to a resource location of the client device 403 (which may also be the client device 401 or 402) where the user logs in, for example, through a network, where the resource location is a page used by the housekeeping platform to display job information of the user on the client.

The server 405 may be a server of one entity, and may also be composed of a plurality of servers, for example, it should be noted that the method for inputting structured data by voice provided by the embodiment of the present disclosure may be executed by the server 405 and/or the

client devices

401, 402, and 403, and accordingly, a system for inputting structured data by voice may be disposed in the server 405 and/or the

client devices

401, 402, and 403.

Fig. 4 is a structural relationship diagram of a client/server according to another embodiment of the present invention. In an embodiment, the server comprises a task server and a structured data server. The task server and the structured data server can be the same server or two different servers, the structured data server loads tasks from the task server, obtains voice information input by a user from a client according to the loaded tasks, converts the voice information into text information, performs word segmentation and error correction to generate structured data, edits the structured data and stores the final structured data.

In this embodiment, the task server and the structured data server are the same server, and the server obtains a voice entry task, where the voice entry task includes voice information, obtains voice information entered by a user, converts the voice information into text information, performs word segmentation on the text information, identifies fields and field contents in the text information after word segmentation to generate structured data, provides an interface to display the structured data to the user, receives user input to edit the structured data, and finally stores the structured data.

When the task server is not the same as the structured data server, the structured data server is further configured to: and uploading the structured data to a task server.

In this embodiment, the task server and the structured data server are different servers, and acquire a voice entry task from the task server, where the voice entry task includes voice information, acquires voice information entered by a user, converts the voice information into text information, performs word segmentation on the text information, identifies fields and field contents in the text information after word segmentation to generate structured data, and sends the structured data to the structured data server, the client provides an interface to show the structured data to the user, the task server receives user input to edit the structured data, and the structured data server stores the structured data.

It will be appreciated by those skilled in the art that this embodiment is merely an example and that the architecture and settings of the task server and structured data server may be user specific. For example, the task server and the structured data server may be devices whose tasks have information processing capability, and if the two devices are not the same device, communication between the two devices is guaranteed so as to perform information interaction. Either of the two may also be a device other than a single device, such as a cloud server implementation based on distributed computing.

Fig. 5 is a schematic structural diagram of an electronic device including a processor and a memory for storing a computer-executable program, the processor executing the method of voice entry of structured data of the present invention when the computer program is executed by the processor, according to an embodiment of the present invention. As previously mentioned, the electronic device of the present invention may be a server for receiving entered speech information from a client while providing structured data to the client.

The electronic equipment comprises the task server, the function of displaying the structured data server and the execution method thereof.

As shown in fig. 5, the electronic device is in the form of a general purpose computing device. The processor can be one or more and can work together. The invention also does not exclude that distributed processing is performed, i.e. the processors may be distributed over different physical devices. The electronic device of the present invention is not limited to a single entity, and may be a sum of a plurality of entity devices.

The memory stores a computer executable program, typically machine readable code. The computer readable program may be executed by the processor to enable an electronic device to perform the method of the invention, or at least some of the steps of the method.

The memory may include volatile memory, such as Random Access Memory (RAM) and/or cache memory, and may also be non-volatile memory, such as read-only memory (ROM).

Optionally, in this embodiment, the electronic device further includes an I/O interface, which is used for data exchange between the electronic device and an external device. The I/O interface may be a local bus representing one or more of several types of bus structures, including a memory unit bus or memory unit controller, a peripheral bus, an accelerated graphics port, a processing unit, and/or a memory storage device using any of a variety of bus architectures.

It should be understood that the electronic device shown in fig. 5 is only one example of the present invention, and elements or components not shown in the above example may be further included in the electronic device of the present invention. For example, some electronic devices further include a display unit such as a display screen, and some electronic devices further include a human-computer interaction element such as a button, a keyboard, and the like. Electronic devices are considered to be covered by the present invention as long as the electronic devices are capable of executing a computer-readable program in a memory to implement the method of the present invention or at least a part of the steps of the method.

Fig. 6 is a schematic diagram of a computer-readable recording medium of an embodiment of the present invention. As shown in fig. 7, the computer-readable recording medium has stored therein a computer-executable program, which when executed, implements the above-described method for voice-entering structured data of the present invention. The computer readable storage medium may include a propagated data signal with readable program code embodied therein, for example, in baseband or as part of a carrier wave. Such a propagated data signal may take many forms, including, but not limited to, electro-magnetic, optical, or any suitable combination thereof. A readable storage medium may also be any readable medium that is not a readable storage medium and that can communicate, propagate, or transport a program for use by or in connection with an instruction execution system, apparatus, or device. Program code embodied on a readable storage medium may be transmitted using any appropriate medium, including but not limited to wireless, wireline, optical fiber cable, RF, etc., or any suitable combination of the foregoing.

Program code for carrying out operations for aspects of the present invention may be written in any combination of one or more programming languages, including an object oriented programming language such as Java, C + + or the like and conventional procedural programming languages, such as the "C" programming language or similar programming languages. The program code may execute entirely on the user's computing device, partly on the user's device, as a stand-alone software package, partly on the user's computing device and partly on a remote computing device, or entirely on the remote computing device or server. In the case of a remote computing device, the remote computing device may be connected to the user computing device through any kind of network, including a Local Area Network (LAN) or a Wide Area Network (WAN), or may be connected to an external computing device (e.g., through the internet using an internet service provider).

From the above description of the embodiments, those skilled in the art will readily understand that the present invention may be implemented by hardware capable of executing a specific computer program, such as the system of the present invention, and the electronic processing unit, the server, the client, the mobile phone, the control unit, the processor, etc. included in the system, and that the present invention may also be implemented by software executed by the client, the server, etc. implemented by computer software executing the method of the present invention. It should be noted, however, that the computer software for executing the method of the present invention is not limited to being executed by one or more specific hardware entities, but may also be executed by one or more specific hardware entities in a distributed manner, for example, the method steps of voice recording structured data executed by a computer program may be executed on a client. For computer software, the software product may be stored in a computer readable storage medium (which may be a CD-ROM, a usb disk, a removable hard disk, etc.) or may be distributed over a network, as long as it enables the electronic device to perform the method according to the present invention.

While the foregoing embodiments have described the objects, aspects and advantages of the present invention in further detail, it should be understood that the present invention is not inherently related to any particular computer, virtual machine or electronic device, and various general-purpose machines may be used to implement the present invention. The invention is not to be considered as limited to the specific embodiments thereof, but is to be understood as being modified in all respects, all changes and equivalents that come within the spirit and scope of the invention.

Claims

1. A method of voice entry of structured data, comprising:

acquiring voice information input by a user, and converting the voice information into text information;

performing word segmentation on the text information;

identifying fields and field contents in the text information after word segmentation so as to generate structured data;

providing an interface to present the structured data to the user and receiving user input to edit the structured data;

and storing the edited structured data.

2. The voice-entry structured data as claimed in claim 1, further comprising:

providing an operation interface, wherein the operation interface comprises: a voice input operation area, an information display revision area, wherein,

3. Method for voice entry of structured data according to claim 1 or 2, characterized in that: the voice information input by the user comprises: and recording the voice audio generated by the user before the voice input is terminated by clicking the voice input in the voice input operation area by the user.

4. A method for voice entry of structured data according to claim 3, wherein: uploading the voice information to a server to convert the voice information into text information;

5. Method for voice entry of structured data according to any of claims 1 to 4, characterized in that: the structured data generated by the fields and the field contents in the text information after word segmentation is identified comprises the following steps: necessary text information and target text information of the user.

6. Method for voice entry of structured data according to claim 5, characterized in that: providing an interface to present the structured data to the user, the structured data comprising:

intercepting 11 continuous digits as a user contact way;

intercepting province and city address information as a user contact address;

and matching other voice data of the user according to the core vocabulary.

7. Method for voice entry of structured data according to claim 6, characterized in that: and the server receives the voice information of the voice information uploaded by the plurality of clients, feeds back the structured data to the plurality of clients, uploads the structured data to the server again after being edited by the user and stores the structured data.

8. The method for voice entry of structured data according to claim 7, wherein: managing voice information entered by a plurality of users, comprising:

9. A system for voice entry of structured data, comprising:

a user inputs voice information through a client;

the service end carries out word segmentation on the text information;

and the server stores the structured data.

10. A server for voice entry of structured data, comprising a processor, wherein the processor is configured to computer-implement the method of any one of claims 1-9.

11. A computer-readable medium storing a computer-executable program, wherein the computer-executable program, when executed, implements the method of any of claims 1-9.