CN114461170A - Page reading method and system for mobile banking application program - Google Patents

Page reading method and system for mobile banking application program Download PDF

Info

Publication number
CN114461170A
CN114461170A CN202210105397.5A CN202210105397A CN114461170A CN 114461170 A CN114461170 A CN 114461170A CN 202210105397 A CN202210105397 A CN 202210105397A CN 114461170 A CN114461170 A CN 114461170A
Authority
CN
China
Prior art keywords
reading
page
dom
mobile banking
character string
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202210105397.5A
Other languages
Chinese (zh)
Inventor
杨建福
王向东
李扬
王月亭
张帅
袁明杰
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shandong City Commercial Banks Alliance Co ltd
Original Assignee
Shandong City Commercial Banks Alliance Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shandong City Commercial Banks Alliance Co ltd filed Critical Shandong City Commercial Banks Alliance Co ltd
Priority to CN202210105397.5A priority Critical patent/CN114461170A/en
Publication of CN114461170A publication Critical patent/CN114461170A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/16Sound input; Sound output
    • G06F3/167Audio in a user interface, e.g. using voice commands for navigating, audio feedback
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/08Text analysis or generation of parameters for speech synthesis out of text, e.g. grapheme to phoneme translation, prosody generation or stress or intonation determination

Abstract

The invention provides a page reading method and a page reading system for a mobile banking application program, wherein the method comprises the steps of monitoring a click event of a page element; when the element is clicked for the first time, increasing the frame selection style, and determining a reading area; acquiring page text information in the region, and preprocessing the text information to form a character string; and when the same element is clicked for the second time, activating the element and reading the corresponding character string. According to the method, the page operation events of the collected bank APP are monitored in a unified mode, page elements are captured automatically, the reading area is determined according to the evidence clicking event, the reading area is preprocessed, the reading character string is obtained, the whole process does not need to depend on three adaptive parties, the applicability of the product is improved, a large amount of modification operation which depends on a system screen reading technology is avoided, the working efficiency is improved, and the marketing process of the product is accelerated.

Description

Page reading method and system for mobile banking application program
Technical Field
The invention relates to the technical field of artificial intelligence, in particular to a method and a system for reading pages aloud of application programs of a mobile phone bank.
Background
With the popularization of electronic products and businesses, requirements are put forward for barrier-free electronic banks, and particularly for visually impaired groups, page element reading assistance functions are required to be achieved.
Currently, page reading is realized by means of customized development of screen reading software of three parties in the market, or the page reading function (the blank function of an IOS (input operating system) and the talk back of an android system) of a mobile phone system is adapted.
The screen reading technology of the adaptive three-party or system needs to modify the existing pages one by one, the appointed tags are added, the whole page needs to be adjusted for adaptation, the influence on the system modification of the produced stable operation is large, the number of the pages of the mobile phone bank reaches thousands of pages, and the workload of modifying one by one is also very large.
Disclosure of Invention
The invention provides a page reading method and system for a mobile banking application program, which are used for solving the problem of high workload when the page reading function of a mobile banking APP is realized by means of the conventional mode.
In order to realize the purpose, the invention adopts the following technical scheme:
the invention provides a page reading method of a mobile banking application program in a first aspect, which comprises the following steps:
monitoring click events of page elements;
when the element is clicked for the first time, increasing the frame selection style, and determining a reading area;
acquiring page text information in the region, and preprocessing the text information to form a character string;
and when the same element is clicked for the second time, activating the element and reading the corresponding character string.
Further, the click event of the monitoring page element is specifically:
and traversing a page Document Object Model (DOM), adding a touch event to the element node page body, and monitoring the touch event to obtain a click event.
Further, the determining the reading area specifically includes:
searching a parent DOM element of the current click element, and if the parent DOM element is configured with a preset tag, taking the current click element and the parent DOM element as an integral reading area;
if the parent DOM element is not configured with a preset tag, searching a child DOM element of the current click element, and if the child DOM element is configured with the preset tag, taking the current click element and the child DOM element as an integral reading area;
and if the preset tag is not configured on the sub DOM element, the current click element is a reading area.
Further, the preprocessing the text information to form the character string specifically includes:
traversing the child nodes of the elements in the reading area layer by layer;
and splicing the traversed reading information into character strings.
Further, the reading information comprises text information and the value of the input box.
Further, after the forming of the character string, the method further comprises the steps of:
and replacing polyphone characters in the character string with characters with the same pronunciation, and sending the processed character string to a voice synthesis interface.
The invention provides a page reading system of a mobile banking application program in a second aspect, which comprises:
the information acquisition unit is used for monitoring click events of the page elements;
the area selection unit is used for increasing frame selection styles and determining a reading area when the elements are clicked for the first time;
the text processing unit is used for acquiring the page text information in the region and carrying out text information preprocessing to form a character string;
and the reading unit is used for activating the elements and reading the corresponding character strings when the same elements are clicked for the second time.
Further, the system further comprises a preprocessing unit for traversing the page document object model DOM, adding touch events to the element node page bodies,
further, the area selection unit includes:
the parent DOM element searching sub-unit is used for searching a parent DOM element of the current click element, and if the parent DOM element is configured with a preset tag, the current click element and the parent DOM element are used as an integral reading area;
the child DOM element searching child unit is used for searching a child DOM element of the current click element when the parent DOM element is not configured with a preset tag, and taking the current click element and the child DOM element as an integral reading area if the child DOM element is configured with the preset tag; and if the preset tag is not configured on the sub DOM element, the current click element is a reading area.
A third aspect of the present invention provides a computer storage medium having stored thereon computer instructions which, when executed on the speakable system, cause the speakable system to perform the steps of the speakable method.
The page reading system of the second aspect of the present invention can implement the methods of the first aspect and the implementation manners of the first aspect, and achieve the same effects.
The effect provided in the summary of the invention is only the effect of the embodiment, not all the effects of the invention, and one of the above technical solutions has the following advantages or beneficial effects:
according to the invention, the page operation events of the collected bank APP are monitored uniformly, the page elements are captured automatically, the evidence click event is used for determining the reading area, the reading area is preprocessed, and the reading character string is obtained.
Drawings
In order to more clearly illustrate the embodiments or technical solutions in the prior art of the present invention, the drawings used in the description of the embodiments or prior art will be briefly described below, and it is obvious for those skilled in the art that other drawings can be obtained based on these drawings without creative efforts.
FIG. 1 is a schematic flow diagram of an embodiment of the method of the present invention;
fig. 2 is a schematic structural diagram of an embodiment of the system of the present invention.
Detailed Description
In order to clearly explain the technical features of the present invention, the following detailed description of the present invention is provided with reference to the accompanying drawings. The following disclosure provides many different embodiments, or examples, for implementing different features of the invention. To simplify the disclosure of the present invention, specific example components and arrangements are described below. Furthermore, the present invention may repeat reference numerals and/or letters in the various examples. This repetition is for the purpose of simplicity and clarity and does not in itself dictate a relationship between the various embodiments and/or configurations discussed. It should be noted that the components illustrated in the figures are not necessarily drawn to scale. Descriptions of well-known components and processing techniques and procedures are omitted so as to not unnecessarily limit the invention.
The invention considers that the influence range of the general modification scheme is larger, and the system screen reading technology is relied on, and the voice reading link is not controllable. Unified monitoring and interception aiming at page operation events, automatic capturing and automatic label adding of page elements are designed, voice synthesis services which are already possessed by the alliance are combined, a voice synthesis engine is called to realize voice synthesis after page reading text information is captured, and then playing is returned, so that the page reading function is integrally realized.
As shown in fig. 1, a page reading method for a mobile banking application provided in the embodiment of the present invention includes the following steps:
s1, monitoring click events of the page elements;
s2, when the element is clicked for the first time, frame selection patterns are added, and a reading area is determined;
s3, acquiring page text information in the region, and preprocessing the text information to form a character string;
and S4, when the same element is clicked for the second time, activating the element and reading the corresponding character string.
In step S1, the click event of the monitoring page element is specifically:
traversing the Document Object Model DOM (Document Object Model) of the page, which allows the script (js) to control the Web pages, windows and documents, this embodiment relates to the standard HTML DOM-a standard Model for HTML documents, in other words, it is the Object connecting the page and the script or programming language), adding touch events to the element node page body, acquiring click events by monitoring the touch events, and triggering the click events earlier at the mobile terminal than the click events.
In step S2, when an element is clicked for the first time, the element is selected, the frame selection style is added, and bubbling and default events are prevented, so that the clicked element is not activated (i.e., not activated, i.e., not read out). Determining when the clicked element should be framed for overall reading, the parent DOM element is searched for whether a configured aria-atomic tag exists, and if so, the tag is regarded as a block for overall reading, that is, the current clicked element and the parent DOM element are taken as an overall reading area. If the DOM child element is not searched for whether the aria-atomic ═ true exists, if yes, the DOM child element is selected, namely the current clicked element and the child DOM element are taken as the whole reading area, and if not, the clicked element is selected, namely the current clicked element is taken as the reading area.
Wherein the DOM has different nodes: the document nodes, the element nodes, the attribute nodes and the text nodes are arranged from top to bottom in sequence. Elements have standard tag definitions, such as body, div, ul, li, text, etc., and the hierarchy can be obtained according to standard API methods.
Asia is an abbreviation for "acceptable Rich Internet Applications". It is a set of development standard specifications issued by W3C (Web barrier-free promotion organization) for disabled people to access internet applications.
The "true" is a tag attribute in the barrier-free standard specification, and is used to indicate whether the area content is completely broadcasted. Fingers may be true and false. When true, this indicates that the auxiliary device needs to notify the user of the entire area content.
In step S3, the preprocessing the text information to form a character string specifically includes:
traversing all child nodes (child nodes) of the elements in the reading area layer by layer;
and splicing the traversed reading information into character strings.
For the child node, taking a transfer transaction page in a mobile banking as an example, the whole page is a dom object top-level tag which is HTML, and the page has elements such as a title, a collection bank, an account number input box, an amount input box and the like. In code implementation, the child nodes are all page child nodes, and when the whole external style is selected, the contents of the child nodes can be sequentially read from top to bottom by selecting reading.
Wherein the reading information comprises text information and the value of the input box.
After the character string is formed, the method further comprises the following steps:
and replacing polyphone characters in the character string with characters with the same pronunciation, and sending the processed character string to a voice synthesis interface. Homophone substitution, for example: the 'line' in the three words of bank, in-line transfer and cross-line transfer is that polyphone should be read as 'hang', and certain error exists in the speech synthesis engine for processing the polyphone, so that the front end replaces the 'bank' with 'silver navigation' by using homophone before calling, and accurate reading speech is ensured to be used as a reading transition scheme.
The special symbols commonly used by banks include: and c, the speech synthesis engine cannot read aloud due to sentence break of special characters, commonly used characters such as RMB, dollars and percentile are reserved, and the rest of processing is not read aloud.
In step S2, when the same element is clicked a second time, it is checked whether the same element has been clicked once, and if the click once does not prevent the event, the activation element performs the reading of the character string formed in step S3.
As shown in fig. 2, the embodiment of the present invention further provides a page reading system for a mobile banking application, where the system includes an information acquisition unit 1, an area selection unit 2, a text processing unit 3, and a reading unit 4.
The information acquisition unit 1 is used for monitoring click events of page elements; the area selection unit 2 is used for increasing frame selection styles and determining a reading area when the elements are clicked for the first time; the text processing unit 3 is used for acquiring the page text information in the region and carrying out text information preprocessing to form a character string; and the reading unit 4 is used for activating the elements and reading the corresponding character strings when the same elements are clicked for the second time.
In an implementation manner of the embodiment of the system of the present invention, the system further includes a preprocessing unit 5, where the preprocessing unit 5 is configured to traverse the page document object model DOM and add a touch event to the element node page body.
The area selection unit comprises a parent DOM element searching sub-unit 21 and a child DOM element searching sub-unit 22.
The parent DOM element searching subunit 21 is configured to search a parent DOM element of the currently clicked element, and if the parent DOM element is configured with a preset tag, the currently clicked element and the parent DOM element are used as an overall reading area;
the child DOM element searching subunit 22 is configured to search, when the parent DOM element is not configured with the preset tag, a child DOM element of the currently clicked element, and if the child DOM element is configured with the preset tag, use the currently clicked element and the child DOM element as an overall reading area; and if the preset tag is not configured on the sub DOM element, the current click element is a reading area.
The embodiment of the present invention further provides a computer storage medium, where a computer instruction is stored in the computer storage medium, and when the computer instruction runs on the reading system, the reading system executes the steps of the reading method.
Although the embodiments of the present invention have been described with reference to the accompanying drawings, it is not intended to limit the scope of the present invention, and it should be understood by those skilled in the art that various modifications and variations can be made without inventive efforts by those skilled in the art based on the technical solution of the present invention.

Claims (10)

1. The page reading method of the mobile banking application program is characterized by comprising the following steps of:
monitoring click events of page elements;
when the element is clicked for the first time, increasing the frame selection style, and determining a reading area;
acquiring page text information in the region, and preprocessing the text information to form character strings;
and when the same element is clicked for the second time, activating the element and reading the corresponding character string.
2. The page reading method of the mobile banking application program as claimed in claim 1, wherein the click event of the monitoring page element is specifically:
and traversing a page Document Object Model (DOM), adding a touch event to the element node page body, and monitoring the touch event to obtain a click event.
3. The page reading method of the mobile banking application program according to claim 1, wherein the determining of the reading area specifically comprises:
searching a parent DOM element of the current click element, and if the parent DOM element is configured with a preset tag, taking the current click element and the parent DOM element as an integral reading area;
if the parent DOM element is not configured with a preset tag, searching a child DOM element of the current click element, and if the child DOM element is configured with the preset tag, taking the current click element and the child DOM element as an integral reading area;
and if the preset tag is not configured on the sub DOM element, the current click element is a reading area.
4. The page reading method of the mobile banking application program as claimed in claim 1, wherein the step of preprocessing the text message to form a character string specifically comprises:
traversing the child nodes of the elements in the reading area layer by layer;
and splicing the traversed reading information into character strings.
5. The method for reading aloud pages in mobile banking applications as claimed in claim 4, wherein said aloud information includes text information and input box values.
6. The method for reading aloud pages in an application program of a mobile banking system as claimed in claim 4, wherein after said character string is formed, the method further comprises the steps of:
and replacing polyphone characters in the character string with characters with the same pronunciation, and sending the processed character string to a voice synthesis interface.
7. The page reading system of the mobile banking application program is characterized by comprising the following components:
the information acquisition unit is used for monitoring click events of the page elements;
the area selection unit is used for increasing frame selection styles and determining a reading area when the elements are clicked for the first time;
the text processing unit is used for acquiring the page text information in the region and carrying out text information preprocessing to form a character string;
and the reading unit is used for activating the elements and reading the corresponding character strings when the same elements are clicked for the second time.
8. The system for page reading of a mobile banking application as claimed in claim 7, further comprising a preprocessing unit for traversing the page document object model DOM and adding touch events to the element node page body.
9. The page reading system of the mobile banking application of claim 7, wherein said area selection unit comprises:
the parent DOM element searching sub-unit is used for searching a parent DOM element of the current click element, and if the parent DOM element is configured with a preset tag, the current click element and the parent DOM element are used as an integral reading area;
the child DOM element searching subunit is used for searching a child DOM element of the current click element when the parent DOM element is not configured with the preset tag, and taking the current click element and the child DOM element as an integral reading area if the child DOM element is configured with the preset tag; and if the preset tag is not configured on the sub DOM element, the current click element is a reading area.
10. A computer storage medium having computer instructions stored thereon, which, when run on the speakable system of any of claims 7-9, cause the speakable system to perform the steps of the speakable method of any of claims 1-6.
CN202210105397.5A 2022-01-27 2022-01-27 Page reading method and system for mobile banking application program Pending CN114461170A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202210105397.5A CN114461170A (en) 2022-01-27 2022-01-27 Page reading method and system for mobile banking application program

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202210105397.5A CN114461170A (en) 2022-01-27 2022-01-27 Page reading method and system for mobile banking application program

Publications (1)

Publication Number Publication Date
CN114461170A true CN114461170A (en) 2022-05-10

Family

ID=81412024

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202210105397.5A Pending CN114461170A (en) 2022-01-27 2022-01-27 Page reading method and system for mobile banking application program

Country Status (1)

Country Link
CN (1) CN114461170A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115248650A (en) * 2022-06-24 2022-10-28 南京伟柏软件技术有限公司 Screen reading method and device

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102937872A (en) * 2012-09-28 2013-02-20 北京航空航天大学 Interactive system and interactive method of human and touch screen device
CN102968461A (en) * 2012-11-05 2013-03-13 王逸竹 Gathering, editing and reading web page browser and realization method thereof
CN104050962A (en) * 2013-03-16 2014-09-17 李军 Multifunctional reader based on voice synthesis technology
CN104078038A (en) * 2013-03-28 2014-10-01 腾讯科技(深圳)有限公司 Page content aloud-reading method and device
CN106710326A (en) * 2015-07-31 2017-05-24 曾晓敏 Reading textbook system based on Chinese
CN111883134A (en) * 2020-07-24 2020-11-03 北京贝塔科技股份有限公司 Voice input method and device, electronic equipment and storage medium
CN112882679A (en) * 2020-12-21 2021-06-01 广州橙行智动汽车科技有限公司 Voice interaction method and device
CN113450762A (en) * 2021-06-23 2021-09-28 Oppo广东移动通信有限公司 Character reading method, device, terminal and storage medium

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102937872A (en) * 2012-09-28 2013-02-20 北京航空航天大学 Interactive system and interactive method of human and touch screen device
CN102968461A (en) * 2012-11-05 2013-03-13 王逸竹 Gathering, editing and reading web page browser and realization method thereof
CN104050962A (en) * 2013-03-16 2014-09-17 李军 Multifunctional reader based on voice synthesis technology
CN104078038A (en) * 2013-03-28 2014-10-01 腾讯科技(深圳)有限公司 Page content aloud-reading method and device
CN106710326A (en) * 2015-07-31 2017-05-24 曾晓敏 Reading textbook system based on Chinese
CN111883134A (en) * 2020-07-24 2020-11-03 北京贝塔科技股份有限公司 Voice input method and device, electronic equipment and storage medium
CN112882679A (en) * 2020-12-21 2021-06-01 广州橙行智动汽车科技有限公司 Voice interaction method and device
CN113450762A (en) * 2021-06-23 2021-09-28 Oppo广东移动通信有限公司 Character reading method, device, terminal and storage medium

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
伽蓝: "用听的,不用看的", 《少儿科技》, no. 4, pages 10 *

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115248650A (en) * 2022-06-24 2022-10-28 南京伟柏软件技术有限公司 Screen reading method and device

Similar Documents

Publication Publication Date Title
US10878191B2 (en) Iterative ontology discovery
US8429099B1 (en) Dynamic gazetteers for entity recognition and fact association
US8831950B2 (en) Automated voice enablement of a web page
US8656274B2 (en) Automatic identification and storage of context information associated with phone numbers in computer documents
CN100524295C (en) An arrangement and a method relating to access to internet content
US8543404B2 (en) Proactive completion of input fields for automated voice enablement of a web page
US9047869B2 (en) Free form input field support for automated voice enablement of a web page
US10192544B2 (en) Method and system for constructing a language model
US20150046493A1 (en) Access and management of entity-augmented content
CN106372130B (en) A kind of static resource management method
WO2023065629A1 (en) Dialogue management method and system, and terminal and storage medium
CN109657121A (en) A kind of Web page information acquisition method and device based on web crawlers
CN106547511A (en) A kind of voice broadcasts method, browser client and the server of reading web page information
CN116955674B (en) Method and web device for generating graph database statement through LLM
CN103389972A (en) Method and device for obtaining text based on really simple syndication (RSS)
CN108090069A (en) A kind of method and apparatus for showing web page resources in a browser
CN114461170A (en) Page reading method and system for mobile banking application program
CN106055721A (en) Accessible webpage processing method and relevant equipment
US20080282150A1 (en) Finding important elements in pages that have changed
US20130073944A1 (en) Method and system for dynamically providing contextually relevant posts on an article
CN110472126A (en) A kind of acquisition methods of page data, device and equipment
CN109062890B (en) Label switching method and device and computer readable storage medium
CN104778232A (en) Searching result optimizing method and device based on long query
US11630805B2 (en) Method and device to automatically identify themes and based thereon derive path designator proxy indicia
CN113157351B (en) Voice plug-in construction method for quickly calling form tool

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination