CN102254550A - Method and system for reading characters on webpage - Google Patents

Method and system for reading characters on webpage Download PDF

Info

Publication number
CN102254550A
CN102254550A CN2010101795421A CN201010179542A CN102254550A CN 102254550 A CN102254550 A CN 102254550A CN 2010101795421 A CN2010101795421 A CN 2010101795421A CN 201010179542 A CN201010179542 A CN 201010179542A CN 102254550 A CN102254550 A CN 102254550A
Authority
CN
China
Prior art keywords
web page
title
plain text
server
text data
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN2010101795421A
Other languages
Chinese (zh)
Other versions
CN102254550B (en
Inventor
王新亮
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Tencent Technology Shenzhen Co Ltd
Original Assignee
Tencent Technology Shenzhen Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tencent Technology Shenzhen Co Ltd filed Critical Tencent Technology Shenzhen Co Ltd
Priority to CN201010179542.1A priority Critical patent/CN102254550B/en
Publication of CN102254550A publication Critical patent/CN102254550A/en
Application granted granted Critical
Publication of CN102254550B publication Critical patent/CN102254550B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Abstract

The invention relates to a method for reading characters on a webpage, comprising the following steps: acquiring a webpage file according to a webpage address in an access command; extracting plain text data from the webpage file; and converting the plain text data into voice data and displaying the voice data. According to the method and system for reading characters on a webpage, users can browse webpage information through hearing by the way of obtaining the webpage file, extracting the plain text data from the webpage file, converting the plain text data into the voice data and displaying the voice data, thus relieving visual fatigue of the users.

Description

Method and system read aloud in the webpage literal
Technical field
The present invention relates to communication technical field, relate in particular to the webpage literal and read aloud method and system.
Background technology
Mobile Internet is to be produced by the fusion of mobile communication and internet, and the open telecommunications basic network of a new generation of high-quality services such as speech, fax, data, image, multimedia can be provided simultaneously, is the important component part that national information is built.Mobile Internet can allow the user in moving by mobile device such as mobile phone, personal digital assistant (Personal Digital Assistant, PDA) wait portable terminal at any time, access internet (Internet) everywhere.
Along with the high development of mobile Internet, people's every day, all a large amount of terminals such as mobile phone, personal digital assistant, net book of passing through were passed through the mobile communications network browsing page, and the quantity of information of being browsed presents the gesture of continuous rapid growth.But because terminal screen sizes such as mobile phone, personal digital assistant, net book are little, thereby the little visual fatigue that very easily causes of font, and because the little displaying contents that makes of screen size is limited, need continuous roll screen or page turn over operation, cumbersome when making user's browsing page, user experience is very poor.
Summary of the invention
The object of the present invention is to provide a kind of webpage literal to read aloud method and system, is speech data with the webpage text conversion, makes that the user can be by sense of hearing browsing page information.
The invention provides a kind of webpage literal and read aloud method, comprising: obtain web page files according to the web page address in the access instruction; Extract plain text data from web page files; Plain text data is converted to speech data and broadcast.
Preferably, the above-mentioned step of obtaining web page files according to the web page address in the access instruction specifically comprises: terminal receives the access instruction that comprises web page address, sends webpage according to web page address to web page server and obtains request; Web page server obtains the web page files corresponding with web page address, and sends to terminal.
Preferably, above-mentioned step from web page files extraction plain text data specifically comprises: terminal is extracted web page title or Web page text from web page files; With web page title or Web page text, be converted to title plain text data or text plain text data.
Preferably, the above-mentioned step that plain text data is converted to speech data and plays specifically comprises: terminal is analyzed title plain text data or text plain text data, and produces rhythm running parameter; According to synthetic title speech data of rhythm running parameter or text speech data, and play.
Preferably, the above-mentioned step of obtaining web page files according to the web page address in the access instruction specifically comprises: terminal receives the access instruction that comprises web page address, sends conversion request according to web page address to web page server; Web page server obtains the web page files corresponding with web page address.
Preferably, above-mentioned step from web page files extraction plain text data specifically comprises: web page server extracts web page title or Web page text from web page files; With web page title or Web page text, be converted to title plain text data or text plain text data.
Preferably, the above-mentioned step that plain text data is converted to speech data and plays specifically comprises: web page server is analyzed title plain text data or text plain text data, and produces rhythm running parameter; According to synthetic title speech data of rhythm running parameter or text speech data, and send to terminal; Terminal plays title speech data or text speech data.
Preferably, the above-mentioned step of obtaining web page files according to the web page address in the access instruction specifically comprises: terminal receives the access instruction that comprises web page address, sends conversion request according to web page address to change server; Change server sends webpage to web page server and obtains request according to web page address; Web page server obtains the web page files corresponding with web page address, and sends to change server.
Preferably, above-mentioned step from web page files extraction plain text data specifically comprises: change server extracts web page title or Web page text from web page files; With web page title or Web page text, be converted to title plain text data or text plain text data.
Preferably, the above-mentioned step that plain text data is converted to speech data and plays specifically comprises: change server is analyzed title plain text data or text plain text data, and produces rhythm running parameter; According to synthetic title speech data of rhythm running parameter or text speech data, and send to terminal; Terminal plays title speech data or text speech data.
The present invention also provides a kind of terminal, comprising: control module, be used for obtaining web page files according to the web page address of access instruction, and extract plain text data from web page files; The phonetic synthesis module is used for plain text data is converted to speech data; The speech play module is used to play speech data.
Preferably, above-mentioned control module comprises obtains submodule, extraction submodule, and the conversion submodule; Obtain submodule, be used for sending webpage to web page server and obtain request, receive the web page files corresponding that web page server sends with web page address according to web page address; Extract submodule, be used for extracting web page title or Web page text from web page files; The conversion submodule is used for web page title or Web page text, is converted to title plain text data or text plain text data.
Preferably, above-mentioned phonetic synthesis module also is used for title plain text data or text plain text data are analyzed, and produces rhythm running parameter; According to synthetic title speech data of rhythm running parameter or text speech data.
Preferably, above-mentioned terminal also comprises interface module, is used to receive the access instruction that comprises web page address.
The present invention also provides a kind of webpage literal bright read apparatus, comprises web page server and above-mentioned terminal.
The present invention also provides a kind of server, comprising: control module, be used for obtaining web page files according to the web page address of access instruction, and extract plain text data from web page files; The phonetic synthesis module is used for plain text data is converted to speech data.
Preferably, above-mentioned server is a web page server, and above-mentioned control module comprises: obtain submodule, be used for the conversion request that comprises web page address that receiving terminal sends, and obtain the web page files corresponding with web page address; Extract submodule, be used for extracting web page title or Web page text from web page files; The conversion submodule is used for web page title or Web page text, is converted to title plain text data or text plain text data.
Preferably, above-mentioned server is a change server, and above-mentioned control module comprises: obtain submodule, the conversion request that comprises web page address that receiving terminal sends sends webpage according to web page address to web page server and obtains request; And the web page files corresponding that receives the web page server transmission with web page address; Extract submodule, be used for extracting web page title or Web page text from web page files; The conversion submodule is used for web page title or Web page text, is converted to title plain text data or text plain text data.
Preferably, above-mentioned phonetic synthesis module also is used for title plain text data or text plain text data are analyzed, and produces rhythm running parameter; According to synthetic title speech data of rhythm running parameter or text speech data.
The present invention also provides a kind of webpage literal bright read apparatus, comprises terminal and web page server.
The present invention also provides a kind of webpage literal bright read apparatus, comprises terminal, web page server and change server.
Method and system read aloud in webpage literal provided by the invention, by obtaining web page files, extracts the plain text data in the web page files, and plain text data is converted to speech data, and the broadcast speech data, make the user to alleviate user's visual fatigue by sense of hearing browsing page information.Also can help simultaneously disabled person's browsing page visually impaired.
Description of drawings
Fig. 1 reads aloud the schematic flow sheet of method one embodiment for webpage literal of the present invention;
Fig. 2 reads aloud the schematic flow sheet of another embodiment of method for webpage literal of the present invention;
Fig. 3 reads aloud the schematic flow sheet of the another embodiment of method for webpage literal of the present invention;
Fig. 4 is the structural representation of bright read apparatus one embodiment of webpage literal of the present invention;
Fig. 5 is the structural representation of bright another embodiment of read apparatus of webpage literal of the present invention;
Fig. 6 is the structural representation of the another embodiment of the bright read apparatus of webpage literal of the present invention;
Fig. 7 is the structural representation of control module.
Embodiment
Literal of the present invention can be character string or the single character that comprises punctuate, it for example can be single Chinese character, or the word of forming by a plurality of Chinese character, the domain names that can also be single English alphabet, English word, form by English alphabet and punctuation mark, or the network address of forming by arabic numeral and punctuation mark; Certainly, can also comprise other form, the present invention does not limit to literal definition.
Generally speaking, the total technical scheme of the present invention is: at first obtain web page files according to the web page address in the access instruction, extract plain text data from web page files then, at last plain text data is converted to speech data and broadcast.
After the present invention can obtain web page files from web page server by terminal, extract plain text data, then plain text data is converted to speech data and broadcast from web page files; After also can obtaining web page files, extract text data, and plain text data is converted to speech data, send terminal plays then to from web page files by web page server; Can also after web page server obtains web page files, extract text data by change server, and plain text data is converted to speech data, send terminal plays then to.Plain text data among the present invention comprises title plain text data and text plain text data.Terminal can be by existing speech player play title speech data or text speech data, perhaps the playback interface play title speech data or the text speech data that provide of calling system.
Below in conjunction with the drawings and specific embodiments technical solution of the present invention is described in further detail, can be implemented so that those skilled in the art can better understand the present invention also, but illustrated embodiment is not as a limitation of the invention.
Fig. 1 reads aloud the schematic flow sheet of method one embodiment for webpage literal of the present invention.
In the present embodiment, finish obtaining of web page files by terminal, the extraction of plain text data and speech data synthetic, and play speech data.Terminal is the equipment with computing, storage and information exchange functions, can be smart mobile phone, and the personal digital assistant (Personal Digital Assistant, PDA).Particularly, method read aloud in the webpage literal in the present embodiment, comprises the steps:
Step S101, terminal receives the access instruction that comprises web page address, sends webpage according to web page address to web page server and obtains request.
In the present embodiment, the user can be by the user interface input web page address in terminal, or the mode by clickable hyperlinks, browser on terminal sends access instruction, start browser loading of plug-in or calling system interface startup service routine by this access instruction, send webpage to web page server and obtain request, to obtain the web page files corresponding with web page address.This user interface can be interface menu or button.Web page server can be Web server or Wap (Wireless ApplicationProtocol, Wireless Application Protocol) server, corresponding browser can be Web browser or Wap browser, the former is by HTTP (Hypertext Transfer Protocol, HTML (Hypertext Markup Language)) agreement is communicated by letter with Web server, and the latter communicates by letter with the Wap browser by the Wap agreement.
Step S102, web page server obtain the web page files corresponding with web page address, and send to terminal.
In the present embodiment, web page server extracts and obtains the corresponding web page files of web page address that carries in the request with webpage, and send to terminal by inquiry local page database.Such as when web page server is Web server, Web server is with HTML (Hypertext Markup Language, Hypertext Markup Language) or the web page files of WML (Wireless Markup Language, WAP Markup Language) form send to terminal.
Step S103, terminal is extracted web page title or Web page text from web page files.
In the present embodiment, after terminal receives web page files, at first remove irrelevant informations such as note in the page, script, style sheet.And then be text block, chained block, image block etc. with page division, remove other pieces except text block again.From text block, distinguish non-critical information pieces such as advertisement at last according to semanteme, only keep web page title and text.
Step S104, terminal is converted to title plain text data or text plain text data with web page title or Web page text.
In the present embodiment, terminal is carried out different conversions according to web page format, such as webpage, need remove HTML or WML label in web page title or the Web page text respectively, a retain header plain text data and text plain text data for html format or WML form.
Step S105, terminal is analyzed title plain text data or text plain text data, and produces rhythm running parameter.
In the present embodiment, at first, to title plain text data or the standardization of text plain text data.Such as, search misspelling, and the character that maybe can't pronounce lack of standardization is filtered out.Then, analyze the border of speech in title plain text data or the text plain text data or phrase, determine the pronunciation of literal, determine the pronunciation mode of numeral, surname, special character, proprietary word and various polyphones simultaneously.At last, the punctuation mark that occurs on structure, composition and the diverse location according to title plain text data or text plain text data, the conversion of the tone and the weight mode of unisonance not when determining pronunciation, and produce rhythm running parameter.
Step S106, terminal is according to synthetic title speech data of rhythm running parameter or text speech data, and broadcast.
In the present embodiment, terminal is at first selected parameters,acoustic from the pronunciation storehouse, then according to rhythm running parameter, produces title speech data or text speech data by composition algorithm, and plays.Terminal can be play speech data by existing speech player, and perhaps the playback interface that provides of calling system is play speech data.
Fig. 2 reads aloud the schematic flow sheet of another embodiment of method for webpage literal of the present invention.
In the present embodiment, finish obtaining of web page files by web page server, the extraction of plain text data and speech data synthetic, and send to terminal, by the terminal plays speech data.Terminal is the equipment with computing, storage and information exchange functions, can be smart mobile phone, and the personal digital assistant (Personal DigitalAssistant, PDA).Particularly, method read aloud in the webpage literal in the present embodiment, comprises the steps:
Step S201, terminal receives the access instruction that comprises web page address, sends conversion request according to web page address to web page server.
In the present embodiment, the user can be by the user interface input web page address in terminal, or the mode by clickable hyperlinks, sending access instruction to the browser of terminal, the browser that starts on terminal by this access instruction sends conversion request to web page server.This user interface can be interface menu or button.Web page server can be Web server or Wap server, and corresponding browser can be Web browser or Wap browser, and the former communicates by letter with Web server by http protocol, and the latter communicates by letter with the Wap browser by the Wap agreement.
Step S202, web page server obtain the web page files corresponding with web page address.
In the present embodiment, web page server starts service routine by loading of plug-in or calling system interface, and inquiry local page database extracts and obtains the corresponding web page files of web page address that carries in the request with webpage.When being Web server or Wap server, then obtain the web page files of HTML corresponding or WML form with web page address when web page server.
Step S203, web page server extracts web page title or Web page text from web page files.
In the present embodiment, web page server will at first be removed irrelevant informations such as note in the page, script, style sheet.And then be text block, chained block, image block etc. with page division, remove other pieces except text block again.From text block, distinguish non-critical information pieces such as advertisement at last according to semanteme, only keep web page title and text.
Step S204, web page server are converted to title plain text data or text plain text data with web page title or Web page text.
In the present embodiment, web page server carries out different conversions according to web page format, such as webpage, need remove HTML or WML label in web page title or the Web page text respectively, a retain header plain text data and text plain text data for html format or WML form.
Step S205, web page server is analyzed title plain text data or text plain text data, and produces rhythm running parameter.
In the present embodiment, at first, to title plain text data or the standardization of text plain text data.Such as, search misspelling, and the character that maybe can't pronounce lack of standardization is filtered out.Then, analyze the border of speech in title plain text data or the text plain text data or phrase, determine the pronunciation of literal, determine the pronunciation mode of numeral, surname, special character, proprietary word and various polyphones simultaneously.At last, the punctuation mark that occurs on structure, composition and the diverse location according to title plain text data or text plain text data, the conversion of the tone and the weight mode of unisonance not when determining pronunciation, and produce rhythm running parameter.
Step S206, web page server synthesizes title speech data or text speech data according to rhythm running parameter, and sends to terminal.
In the present embodiment, web page server is at first selected parameters,acoustic from the pronunciation storehouse, then according to rhythm running parameter, produces title speech data or text speech data by composition algorithm, and sends to terminal.Web page server can pass through RTP, and (Real-time Transport Protocol RTP) sends to terminal with title speech data or text speech data.
Step S207, terminal plays title speech data or text speech data.
In the present embodiment, terminal can be by existing speech player play title speech data or text speech data, perhaps the playback interface play title speech data or the text speech data that provide of calling system.
Fig. 3 reads aloud the schematic flow sheet of the another embodiment of method for webpage literal of the present invention.
In the present embodiment, be provided with change server, finish obtaining of web page files by change server, the extraction of plain text data and speech data synthetic, and send to terminal, by the terminal plays speech data.Terminal is the equipment with computing, storage and information exchange functions, can be smart mobile phone, the personal digital assistant.Particularly, method read aloud in the webpage literal in the present embodiment, comprises the steps:
Step S301, terminal receives the access instruction that comprises web page address, sends conversion request according to web page address to change server.
In the present embodiment, the user can be by the user interface input web page address in terminal, or the mode by clickable hyperlinks, sending access instruction to the browser of terminal, the browser that starts on terminal by this access instruction sends conversion request to change server.This user interface can be interface menu or button.
Step S302, change server send webpage to web page server and obtain request according to web page address.
In the present embodiment, web page server can be Web server or Wap server, and corresponding browser can be Web browser or Wap browser, and the former communicates by letter with Web server by http protocol, and the latter communicates by letter with the Wap browser by the Wap agreement.Change server is the computer equipment with computing, storage and information exchange functions.
Step S303, web page server obtain the web page files corresponding with web page address, and send to change server.
In the present embodiment, web page server extracts and obtains the corresponding web page files of web page address that carries in the request with webpage, and send to terminal by inquiry local page database.Such as when web page server is Web server, Web is by inquiry Web database, and the web page files of HTML or WML form is sent to change server.
Step S304, change server extracts web page title or Web page text from web page files.
In the present embodiment, change server will at first be removed irrelevant informations such as note in the page, script, style sheet.And then be text block, chained block, image block etc. with page division, remove other pieces except text block again.From text block, distinguish non-critical information pieces such as advertisement at last according to semanteme, only keep web page title and text.
Step S305, change server are converted to title plain text data or text plain text data with web page title or Web page text.
In the present embodiment, change server carries out different conversions according to web page format, such as webpage, need remove HTML or WML label in web page title or the Web page text respectively, a retain header plain text data and text plain text data for html format or WML form.
Step S306, change server is analyzed title plain text data or text plain text data, and produces rhythm running parameter.
In the present embodiment, at first, to title plain text data or the standardization of text plain text data.Such as, search misspelling, and the character that maybe can't pronounce lack of standardization is filtered out.Then, analyze the border of speech in title plain text data or the text plain text data or phrase, determine the pronunciation of literal, determine the pronunciation mode of numeral, surname, special character, proprietary word and various polyphones simultaneously.At last, the punctuation mark that occurs on structure, composition and the diverse location according to title plain text data or text plain text data, the conversion of the tone and the weight mode of unisonance not when determining pronunciation, and produce rhythm running parameter.
Step S307, change server synthesizes title speech data or text speech data according to rhythm running parameter, and sends to terminal.
In the present embodiment, change server is at first selected parameters,acoustic from the pronunciation storehouse, then according to rhythm running parameter, produces title speech data or text speech data by composition algorithm, and sends to terminal.Web page server can send to terminal with title speech data or text speech data by RTP.
Step S308, terminal plays title speech data or text speech data.
In the present embodiment, terminal can be by existing speech player play title speech data or text speech data, perhaps the playback interface play title speech data or the text speech data that provide of calling system.
Fig. 4 is the structural representation of bright read apparatus one embodiment of webpage literal of the present invention.
In the present embodiment, the bright read apparatus of webpage literal comprises terminal 10a and web page server 20a, and terminal 10a comprises control module 410, phonetic synthesis module 420, speech play module 430, and interface module 440.
Control module 410 is used for obtaining web page files according to the web page address of access instruction, extracts plain text data from web page files; Phonetic synthesis module 420 is used for plain text data is converted to speech data; Speech play module 430 is used to play speech data.Interface module 440 is used to receive the access instruction that comprises web page address.
Fig. 5 is the structural representation of bright another embodiment of read apparatus of webpage literal of the present invention.
In the present embodiment, the bright read apparatus of webpage literal comprises terminal 10b and web page server 20b, and web page server 20b comprises control module 410 and phonetic synthesis module 420, and terminal 10b comprises speech play module 430 and interface module 440.
Control module 410 is used for obtaining web page files according to the web page address of access instruction, extracts plain text data from web page files; Phonetic synthesis module 420 is used for plain text data is converted to speech data; Speech play module 430 is used to play speech data.Interface module 440 is used to receive the access instruction that comprises web page address.
Fig. 6 is the structural representation of the another embodiment of the bright read apparatus of webpage literal of the present invention.
In the present embodiment, the bright read apparatus of webpage literal comprises terminal 10b, web page server 20a, and change server 30.Change server 30 comprises control module 410 and phonetic synthesis module 420, and terminal 10b comprises speech play module 430 and interface module 440.
Control module 410 is used for obtaining web page files according to the web page address of access instruction, extracts plain text data from web page files; Phonetic synthesis module 420 is used for plain text data is converted to speech data; Speech play module 430 is used to play speech data.Interface module 440 is used to receive the access instruction that comprises web page address.
Further, as shown in Figure 7, control module 410 comprises obtains submodule 411, extraction submodule 412, and conversion submodule 413.
When obtaining submodule 411 and be arranged in the terminal 10a shown in Figure 4, be used for sending webpage to web page server 20a and obtain request according to web page address, receive the web page files corresponding that web page server 20a sends with web page address;
When obtaining submodule 411 and be arranged in the web page server 20b shown in Figure 5, be used for the conversion request that comprises web page address that receiving terminal 10b sends, and obtain the web page files corresponding with web page address;
When obtaining submodule 411 and be arranged in the change server shown in Figure 6 30, be used for the conversion request that comprises web page address that receiving terminal 10b sends, send webpage according to web page address to web page server 20a and obtain request; And the web page files corresponding that receives web page server 20a transmission with web page address.
Extract submodule 412, be used for extracting web page title or Web page text from web page files; Conversion submodule 413 is used for web page title or Web page text, is converted to title plain text data or text plain text data.
Further, phonetic synthesis module 420 also is used for title plain text data or text plain text data are analyzed, and produces rhythm running parameter; According to synthetic title speech data of rhythm running parameter or text speech data.
Method and system read aloud in webpage literal provided by the invention, by obtaining web page files, extracts the plain text data in the web page files, and plain text data is converted to speech data, and the broadcast speech data, make the user to alleviate user's visual fatigue by sense of hearing browsing page information.Also can help simultaneously disabled person's browsing page visually impaired.
Below only be the preferred embodiments of the present invention; be not so limit claim of the present invention; every equivalent structure or equivalent flow process conversion that utilizes instructions of the present invention and accompanying drawing content to be done; or directly or indirectly be used in other relevant technical fields, all in like manner be included in the scope of patent protection of the present invention.

Claims (21)

1. method read aloud in a webpage literal, it is characterized in that, comprising:
Obtain web page files according to the web page address in the access instruction;
Extract plain text data from web page files;
Plain text data is converted to speech data and broadcast.
2. method read aloud in webpage literal as claimed in claim 1, it is characterized in that, the described step of obtaining web page files according to the web page address in the access instruction specifically comprises:
Terminal receives the access instruction that comprises web page address, sends webpage according to web page address to web page server and obtains request;
Web page server obtains the web page files corresponding with web page address, and sends to terminal.
3. method read aloud in webpage literal as claimed in claim 2, it is characterized in that, described step from web page files extraction plain text data specifically comprises:
Terminal is extracted web page title or Web page text from web page files;
With web page title or Web page text, be converted to title plain text data or text plain text data.
4. method read aloud in webpage literal as claimed in claim 3, it is characterized in that, the described step that plain text data is converted to speech data and plays specifically comprises:
Terminal is analyzed title plain text data or text plain text data, and produces rhythm running parameter;
According to synthetic title speech data of rhythm running parameter or text speech data, and play.
5. method read aloud in webpage literal as claimed in claim 1, it is characterized in that, the described step of obtaining web page files according to the web page address in the access instruction specifically comprises:
Terminal receives the access instruction that comprises web page address, sends conversion request according to web page address to web page server;
Web page server obtains the web page files corresponding with web page address.
6. method read aloud in webpage literal as claimed in claim 5, it is characterized in that, described step from web page files extraction plain text data specifically comprises:
Web page server extracts web page title or Web page text from web page files;
With web page title or Web page text, be converted to title plain text data or text plain text data.
7. method read aloud in webpage literal as claimed in claim 6, it is characterized in that, the described step that plain text data is converted to speech data and plays specifically comprises:
Web page server is analyzed title plain text data or text plain text data, and produces rhythm running parameter;
According to synthetic title speech data of rhythm running parameter or text speech data, and send to terminal;
Terminal plays title speech data or text speech data.
8. method read aloud in webpage literal as claimed in claim 1, it is characterized in that, the described step of obtaining web page files according to the web page address in the access instruction specifically comprises:
Terminal receives the access instruction that comprises web page address, sends conversion request according to web page address to change server;
Change server sends webpage to web page server and obtains request according to web page address;
Web page server obtains the web page files corresponding with web page address, and sends to change server.
9. method read aloud in webpage literal as claimed in claim 8, it is characterized in that, described step from web page files extraction plain text data specifically comprises:
Change server extracts web page title or Web page text from web page files;
With web page title or Web page text, be converted to title plain text data or text plain text data.
10. method read aloud in webpage literal as claimed in claim 9, it is characterized in that, the described step that plain text data is converted to speech data and plays specifically comprises:
Change server is analyzed title plain text data or text plain text data, and produces rhythm running parameter;
According to synthetic title speech data of rhythm running parameter or text speech data, and send to terminal;
Terminal plays title speech data or text speech data.
11. a terminal is characterized in that, comprising:
Control module is used for obtaining web page files according to the web page address of access instruction, extracts plain text data from web page files;
The phonetic synthesis module is used for plain text data is converted to speech data;
The speech play module is used to play speech data.
12. terminal as claimed in claim 11 is characterized in that, described control module comprises obtains submodule, extraction submodule, and the conversion submodule:
The described submodule that obtains is used for sending webpage according to web page address to web page server and obtains request, receives the web page files corresponding with web page address that web page server sends;
Described extraction submodule is used for extracting web page title or Web page text from web page files;
Described conversion submodule is used for web page title or Web page text, is converted to title plain text data or text plain text data.
13. terminal as claimed in claim 12 is characterized in that, described phonetic synthesis module also is used for title plain text data or text plain text data are analyzed, and produces rhythm running parameter; According to synthetic title speech data of rhythm running parameter or text speech data.
14. terminal as claimed in claim 12 is characterized in that, also comprises interface module, is used to receive the access instruction that comprises web page address.
15. the bright read apparatus of webpage literal is characterized in that, comprises each described terminal of web page server and claim 11 to 14.
16. a server is characterized in that, comprising:
Control module is used for obtaining web page files according to the web page address of access instruction, extracts plain text data from web page files;
The phonetic synthesis module is used for plain text data is converted to speech data.
17. server as claimed in claim 16 is characterized in that, described server is a web page server, and described control module comprises:
Obtain submodule, be used for the conversion request that comprises web page address that receiving terminal sends, and obtain the web page files corresponding with web page address;
Extract submodule, be used for extracting web page title or Web page text from web page files;
The conversion submodule is used for web page title or Web page text, is converted to title plain text data or text plain text data.
18. server as claimed in claim 16 is characterized in that, described server is a change server, and described control module comprises:
Obtain submodule, the conversion request that comprises web page address that receiving terminal sends sends webpage according to web page address to web page server and obtains request; And the web page files corresponding that receives the web page server transmission with web page address;
Extract submodule, be used for extracting web page title or Web page text from web page files;
The conversion submodule is used for web page title or Web page text, is converted to title plain text data or text plain text data.
19., it is characterized in that described phonetic synthesis module also is used for title plain text data or text plain text data are analyzed as claim 17 or 18 described servers, and produce rhythm running parameter; According to synthetic title speech data of rhythm running parameter or text speech data.
20. the bright read apparatus of webpage literal is characterized in that, comprises the described server of terminal and claim 17.
21. the bright read apparatus of webpage literal is characterized in that, comprises the described server of terminal, web page server and claim 18.
CN201010179542.1A 2010-05-21 2010-05-21 Method and system for reading characters on webpage Active CN102254550B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201010179542.1A CN102254550B (en) 2010-05-21 2010-05-21 Method and system for reading characters on webpage

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201010179542.1A CN102254550B (en) 2010-05-21 2010-05-21 Method and system for reading characters on webpage

Publications (2)

Publication Number Publication Date
CN102254550A true CN102254550A (en) 2011-11-23
CN102254550B CN102254550B (en) 2015-06-17

Family

ID=44981762

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201010179542.1A Active CN102254550B (en) 2010-05-21 2010-05-21 Method and system for reading characters on webpage

Country Status (1)

Country Link
CN (1) CN102254550B (en)

Cited By (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102629936A (en) * 2012-03-12 2012-08-08 华为终端有限公司 Method for mobile terminal to process text, related device and system
CN102968461A (en) * 2012-11-05 2013-03-13 王逸竹 Gathering, editing and reading web page browser and realization method thereof
CN103377238A (en) * 2012-04-26 2013-10-30 腾讯科技(深圳)有限公司 Method and browser for processing webpage information
CN103871399A (en) * 2012-12-10 2014-06-18 腾讯科技(深圳)有限公司 Method and device for text message playing
WO2014101378A1 (en) * 2012-12-28 2014-07-03 深圳创维数字技术股份有限公司 Page control method and browser
CN104078038A (en) * 2013-03-28 2014-10-01 腾讯科技(深圳)有限公司 Page content aloud-reading method and device
CN105427855A (en) * 2015-11-09 2016-03-23 上海语知义信息技术有限公司 Voice broadcast system and voice broadcast method of intelligent software
CN105975469A (en) * 2015-12-01 2016-09-28 乐视致新电子科技(天津)有限公司 Method and device for browsing web page of browser
CN106547806A (en) * 2015-09-23 2017-03-29 阿里巴巴集团控股有限公司 Page loading method and device
CN106547511A (en) * 2015-09-16 2017-03-29 广州市动景计算机科技有限公司 A kind of voice broadcasts method, browser client and the server of reading web page information
CN106936908A (en) * 2017-03-10 2017-07-07 广州华多网络科技有限公司 A kind of phonic warning method and relevant apparatus based on web
CN108763500A (en) * 2018-05-30 2018-11-06 深圳壹账通智能科技有限公司 Voice-based Web browser method, device, equipment and storage medium
CN109788127A (en) * 2018-12-20 2019-05-21 努比亚技术有限公司 A kind of acquisition methods of text information, mobile terminal and storage medium
CN111367490A (en) * 2020-02-28 2020-07-03 广州华多网络科技有限公司 Voice playing method and device and electronic equipment
CN114461171A (en) * 2022-01-27 2022-05-10 山东省城市商业银行合作联盟有限公司 Method and system for reading web bank pages

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1337817A (en) * 2000-08-16 2002-02-27 庄华 Interactive speech polling of radio web page content in telephone
CN1666199A (en) * 2002-07-02 2005-09-07 艾利森电话股份有限公司 An arrangement and a method relating to access to internet content
CN101055575A (en) * 2006-04-13 2007-10-17 北京闻言科技有限公司 Method for listening web page
CN101398839A (en) * 2008-10-23 2009-04-01 浙江大学 Personalized push method for vocal web page news

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1337817A (en) * 2000-08-16 2002-02-27 庄华 Interactive speech polling of radio web page content in telephone
CN1666199A (en) * 2002-07-02 2005-09-07 艾利森电话股份有限公司 An arrangement and a method relating to access to internet content
CN101055575A (en) * 2006-04-13 2007-10-17 北京闻言科技有限公司 Method for listening web page
CN101398839A (en) * 2008-10-23 2009-04-01 浙江大学 Personalized push method for vocal web page news

Cited By (28)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102629936B (en) * 2012-03-12 2016-03-30 华为终端有限公司 A kind of method of mobile terminal process text, relevant device and system
WO2013135167A1 (en) * 2012-03-12 2013-09-19 华为终端有限公司 Method, relevant device and system for processing text by mobile terminal
CN102629936A (en) * 2012-03-12 2012-08-08 华为终端有限公司 Method for mobile terminal to process text, related device and system
US9203877B2 (en) 2012-03-12 2015-12-01 Huawei Device Co., Ltd. Method for mobile terminal to process text, related device, and system
CN103377238A (en) * 2012-04-26 2013-10-30 腾讯科技(深圳)有限公司 Method and browser for processing webpage information
CN103377238B (en) * 2012-04-26 2016-04-06 腾讯科技(深圳)有限公司 The method of process info web and browser
CN102968461A (en) * 2012-11-05 2013-03-13 王逸竹 Gathering, editing and reading web page browser and realization method thereof
CN103871399A (en) * 2012-12-10 2014-06-18 腾讯科技(深圳)有限公司 Method and device for text message playing
CN103871399B (en) * 2012-12-10 2017-07-18 腾讯科技(深圳)有限公司 Text message player method and device
WO2014101378A1 (en) * 2012-12-28 2014-07-03 深圳创维数字技术股份有限公司 Page control method and browser
CN104078038A (en) * 2013-03-28 2014-10-01 腾讯科技(深圳)有限公司 Page content aloud-reading method and device
CN104078038B (en) * 2013-03-28 2019-03-01 腾讯科技(深圳)有限公司 A kind of content of pages reads aloud method and apparatus
WO2014154097A1 (en) * 2013-03-28 2014-10-02 Tencent Technology (Shenzhen) Company Limited Automatic page content reading-aloud method and device thereof
CN106547511A (en) * 2015-09-16 2017-03-29 广州市动景计算机科技有限公司 A kind of voice broadcasts method, browser client and the server of reading web page information
US11308935B2 (en) 2015-09-16 2022-04-19 Guangzhou Ucweb Computer Technology Co., Ltd. Method for reading webpage information by speech, browser client, and server
US10714074B2 (en) 2015-09-16 2020-07-14 Guangzhou Ucweb Computer Technology Co., Ltd. Method for reading webpage information by speech, browser client, and server
CN106547511B (en) * 2015-09-16 2019-12-10 广州市动景计算机科技有限公司 Method for playing and reading webpage information in voice, browser client and server
CN106547806A (en) * 2015-09-23 2017-03-29 阿里巴巴集团控股有限公司 Page loading method and device
CN105427855A (en) * 2015-11-09 2016-03-23 上海语知义信息技术有限公司 Voice broadcast system and voice broadcast method of intelligent software
CN105975469A (en) * 2015-12-01 2016-09-28 乐视致新电子科技(天津)有限公司 Method and device for browsing web page of browser
WO2017092312A1 (en) * 2015-12-01 2017-06-08 乐视控股(北京)有限公司 Method of browsing webpage on browser and device
CN106936908A (en) * 2017-03-10 2017-07-07 广州华多网络科技有限公司 A kind of phonic warning method and relevant apparatus based on web
CN108763500A (en) * 2018-05-30 2018-11-06 深圳壹账通智能科技有限公司 Voice-based Web browser method, device, equipment and storage medium
CN109788127A (en) * 2018-12-20 2019-05-21 努比亚技术有限公司 A kind of acquisition methods of text information, mobile terminal and storage medium
CN111367490A (en) * 2020-02-28 2020-07-03 广州华多网络科技有限公司 Voice playing method and device and electronic equipment
CN111367490B (en) * 2020-02-28 2024-04-09 广州华多网络科技有限公司 Voice playing method and device and electronic equipment
CN114461171A (en) * 2022-01-27 2022-05-10 山东省城市商业银行合作联盟有限公司 Method and system for reading web bank pages
CN114461171B (en) * 2022-01-27 2023-11-28 山东省城市商业银行合作联盟有限公司 Method and system for reading online banking page

Also Published As

Publication number Publication date
CN102254550B (en) 2015-06-17

Similar Documents

Publication Publication Date Title
CN102254550B (en) Method and system for reading characters on webpage
US6823311B2 (en) Data processing system for vocalizing web content
JP4225703B2 (en) Information access method, information access system and program
CN102708174B (en) Method and device for displaying rich media information in browser
CN100550007C (en) Analytic system and method based on a plurality of files of key element
CN105657129A (en) Call information obtaining method and device
CN101197849A (en) Method and device for commuting internet page into wireless application protocol page
CN102270206A (en) Method and device for capturing valid web page contents
CN110234080B (en) Information display method, device and system
CN104468959A (en) Method, device and mobile terminal displaying image in communication process of mobile terminal
CN104078038B (en) A kind of content of pages reads aloud method and apparatus
CN102141868B (en) Method for quickly operating information interaction page, input method system and browser plug-in
CN1936893A (en) Method and system for generating input-method word frequency base based on internet information
CN102402432A (en) Method for creating a multi-lingual web page
KR100792325B1 (en) Interactive dialog database construction method for foreign language learning, system and method of interactive service for foreign language learning using its
CN108806688A (en) Sound control method, smart television, system and the storage medium of smart television
CN101651938A (en) Telephone number recognition system for mobile terminal and application method thereof
CN104090923A (en) Method and device for displaying rich media information in browser
CN106126713A (en) Wearable device and synchronous applications message display method thereof
EP1168799A2 (en) Data processing system with vocalisation mechanism
CN105955967A (en) Data processing method and data processing device
CN103136235B (en) Data processing platform (DPP), data handling system and data processing method
CN107729573A (en) Information-pushing method and device
CN104281560B (en) Display method, device and terminal of memory text information
CN110970011A (en) Picture processing method, device and equipment and computer readable storage medium

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant