WO2017072865A1 - Testing device, testing method, recording medium, and program - Google Patents

Testing device, testing method, recording medium, and program Download PDF

Info

Publication number
WO2017072865A1
WO2017072865A1 PCT/JP2015/080288 JP2015080288W WO2017072865A1 WO 2017072865 A1 WO2017072865 A1 WO 2017072865A1 JP 2015080288 W JP2015080288 W JP 2015080288W WO 2017072865 A1 WO2017072865 A1 WO 2017072865A1
Authority
WO
WIPO (PCT)
Prior art keywords
document
browser
text
unit
inspection
Prior art date
Application number
PCT/JP2015/080288
Other languages
French (fr)
Japanese (ja)
Inventor
桂太 平中
彩美 木下
Original Assignee
楽天株式会社
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 楽天株式会社 filed Critical 楽天株式会社
Priority to JP2017547240A priority Critical patent/JP6356924B2/en
Priority to PCT/JP2015/080288 priority patent/WO2017072865A1/en
Publication of WO2017072865A1 publication Critical patent/WO2017072865A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/28Error detection; Error correction; Monitoring by checking the correct order of processing
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing

Definitions

  • the present invention relates to an inspection apparatus, an inspection method, a recording medium, and a program for appropriately inspecting whether a document including text is correctly drawn by a browser.
  • a document described in a predetermined markup language such as HTML (Hypertext Markup Language) (hereinafter simply referred to as a document)
  • a web browser hereinafter simply referred to as a browser
  • the browser tags the document.
  • HTML Hypertext Markup Language
  • the layout may be corrupted by other browsers even though text and images are displayed neatly in some browsers. Similar problems may occur between different versions of the same browser or between different terminals. Therefore, various methods for inspecting whether the layout of texts and images is not broken have been proposed.
  • Patent Document 1 a web page described in HTML is given to two web browsers with different specifications, the visuals of rendered image pairs are compared, and the position, size, style, etc. of the elements in each image are compared. If there is a difference, a method for detecting it as an error is disclosed.
  • Patent Document 1 Although the inspection method disclosed in Patent Document 1 and the like can detect a layout collapse due to a part of the tabs being hidden or an image protruding from the display area of the screen, the characters are displayed overlapping each other. If a character is missing, the defect cannot be detected. For this reason, it has not been possible to properly check whether or not a document including text is correctly rendered by the browser.
  • the present invention is to solve the above problems, and provides an inspection apparatus, an inspection method, a recording medium, and a program for appropriately inspecting whether a document including text is correctly rendered by a browser. For the purpose.
  • an inspection apparatus comprises: By giving the document to the browser and displaying it on the screen or drawing it in the virtual view of the browser, an image in which the document is drawn is acquired, By recognizing the acquired image to obtain recognized text, Extract body text to be drawn from the document, By comparing the recognized text and the body text, it is determined whether or not the document is correctly rendered by the browser.
  • FIG. 1 shows a hardware configuration of an inspection apparatus 1 according to an embodiment of the present invention.
  • the inspection apparatus 1 includes a storage device 11, a reception device 12, a display device 13, and a control unit 14.
  • the storage device 11 includes a recording medium such as a hard disk and stores various software such as various browsers and inspection software for inspecting whether or not to draw a document.
  • the storage device 11 stores a plurality of documents described in HTML.
  • the document is not limited to the one described in HTML, and may be described in other markup languages such as XHTML (Extensible Hypertext Markup Language) and XML (Extensible Markup Language).
  • the storage device 11 may be configured by a non-volatile non-temporary recording medium other than a hard disk, for example, a flash memory, an optical disk, or a magneto-optical disk.
  • the reception device 12 includes a keyboard, a mouse, a track pad, and the like.
  • the accepting device 12 accepts a user instruction through a user operation, and generates an input signal indicating the accepted user instruction. Then, the generated input signal is supplied to the control unit 14.
  • the display device 13 includes a screen such as a liquid crystal display and displays various data such as text data, images, and moving images supplied from the control unit 14.
  • the control unit 14 includes a CPU (Central Processing Unit), a ROM (Read Only Memory), a RAM (Random Access Memory), and the like, and a program stored in a non-temporary recording medium such as a ROM is a temporary recording medium.
  • the data is read into a certain RAM, and a command included in the read program is executed by the CPU.
  • the control part 14 receives the input signal supplied from the reception apparatus 12, and performs the user's instruction
  • FIG. 2 shows a functional configuration of the inspection apparatus 1.
  • the control unit 14 reads the program stored in the ROM or the like into the RAM and controls the execution of the program, whereby the inspection device 1 is displayed in the software storage unit 21, the document storage unit 22, the reception unit 23, the acquisition unit 24, and the display. Function as a unit 25, a recognition unit 26, an extraction unit 27, and a determination unit 28.
  • the software storage unit 21 and the document storage unit 22 are constructed in the storage device 11.
  • the receiving unit 23 is realized by the cooperation of the receiving device 12 and the control unit 14.
  • the acquisition unit 24, the recognition unit 26, the extraction unit 27, and the determination unit 28 are realized by the control unit 14.
  • the display unit 25 is realized by the cooperation of the display device 13 and the control unit 14.
  • the software storage unit 21 stores various software such as various browsers and inspection software for inspecting whether or not a document is drawn.
  • Various browsers include Internet Explorer [Internet Explorer (registered trademark)], Mozilla Firefox (Mozilla Firefox (registered trademark)), Google Chrome [Google Chrome (registered trademark)], Opera [Opera (registered trademark)] Includes browser.
  • the various browsers may include various browsers other than the famous browsers described above.
  • the document storage unit 22 stores a plurality of documents described in HTML. Note that, as described above, the plurality of documents may be described in a markup language other than HTML.
  • the accepting unit 23 accepts various requests and instructions from the user through the operation of the accepting device 12 by the user. For example, a request for inspecting whether or not a document is correctly drawn (hereinafter referred to as an inspection request) is received. Also, an instruction indicating a document selected by the user as an inspection target is received. Also, an instruction indicating the browser selected by the user as the inspection browser is received. Then, the received request and instruction are supplied to the acquisition unit 24.
  • the document is selected from a plurality of documents stored in the document storage unit 22.
  • the browser is selected from a plurality of browsers stored in the software storage unit 21.
  • the acquisition unit 24 When the acquisition unit 24 receives the inspection request from the reception unit 23, the acquisition unit 24 acquires the inspection software from the software storage unit 21. Then, the inspection software is activated and initialization processing is performed.
  • the initialization process includes, for example, a process of canceling the designation when a specific browser is designated as an inspection browser, and a process of canceling the designation when a specific document is designated as an inspection target.
  • the acquiring unit 24 acquires the selected document from the document storage unit 22 as the inspection target based on the instruction.
  • the acquisition unit 24 acquires the selected browser from the software storage unit 21 as the inspection browser, and starts the inspection browser. .
  • the acquisition unit 24 acquires an image in which the document to be inspected is drawn by giving the document acquired as the inspection target to the inspection browser and displaying the document on the screen of the display device 13.
  • FIG. 3 shows an example of a document stored in the document storage unit 22.
  • the document stored in the document storage unit 22 is described in HTML.
  • the acquisition unit 24 gives the document shown in the figure to the inspection browser and displays it on the screen of the display device 13, thereby acquiring an image on which the document shown in the figure is drawn.
  • FIG. 4 shows an image 110 in which the document shown in FIG. 3 is drawn by the inspection browser.
  • the text included in the document shown in FIG. 3 is drawn at the position specified by the tag.
  • an image on which a document is drawn is not an image on which a source code written in HTML is drawn.
  • the character string “health food / supplement” is to be displayed in the left frame 110a of the image 110, and a part of the character string protrudes from the frame 110a. Is hidden in the banner ad.
  • the other is a part of the character string “best for bulk buying ... very active!”
  • the frame 110b on the lower right side of the image 110 and “it can also be used at work, school, shops, leisure, etc. It overlaps with a part of the character string.
  • the display unit 25 displays the document given to the inspection browser on the screen of the display device 13 based on the control of the acquisition unit 24. Further, based on the control of the determination unit 28, a determination result indicating whether or not the document has been correctly rendered by the inspection browser is displayed on the screen of the display device 13.
  • the recognition unit 26 acquires the text drawn on the image as recognized text by recognizing the image acquired by the acquisition unit 24.
  • the recognition unit 26 divides the image acquired by the acquisition unit 24 into an image representing each character using, for example, morphological analysis, and uses the feature amount (feature amount indicating the feature amount) from the divided image. Vector).
  • feature amount feature amount indicating the feature amount
  • character recognition is performed by comparing the extracted feature amount with the character feature amount stored in advance in the storage device 11.
  • a feature amount used for character recognition includes a directional line element feature using the outline of a character.
  • the recognition unit 26 acquires position information indicating from which position in the image each character obtained as a result of character recognition is recognized. This position information is used to display a warning indicating that there is a defect in character drawing at or near the position where there is a defect in the screen of the display device 13 when a defect is found in the character drawing. .
  • the extraction unit 27 extracts the text included in the document as the body text by removing the tag from the document acquired by the acquisition unit 24.
  • the body text is text to be drawn from the document and matches the recognized text acquired by character recognition when the document is correctly drawn by the inspection browser.
  • FIG. 5A shows recognized text acquired by character recognition of the image 110 shown in FIG. 4, and FIG. 5B shows body text extracted from the document 100 shown in FIG.
  • FIG. 5B shows body text extracted from the document 100 shown in FIG.
  • the dashed box 120a in the recognized text it is understood that a part of the character string “health food / supplement”, that is, the characters “n” and “g” are missing.
  • This missing character is because part of the character string “health food / supplement” is hidden in the banner advertisement in the image 110 shown in FIG. 4, so that the recognition unit 26 can recognize the characters “n” and “g”. It is caused by not having. That is, if a character is missing in the inspection browser, the defect is reflected in the recognized text.
  • the symbol string “ ⁇ ⁇ ⁇ ⁇ # ⁇ ⁇ ⁇ ⁇ ⁇ *” in the dashed box 120b is erroneously recognized as a different character string by the character recognition when the overlapping character string is displayed in the image 110. It shows that. Each character displayed in an overlapping manner is recognized as another character or symbol having a similar shape because there is no matching character. Therefore, the character string displayed in an overlapping manner is converted into another character string and appears in the recognized text. In this way, the defect that characters are displayed overlapping each other is also reflected in the recognized text.
  • the body text shown in FIG. 5B is a text extracted by removing the tag from the document, and thus does not include defects such as missing characters. Therefore, the comparison between the recognized text and the body text indicates that the document is drawn correctly if there is no difference between the two texts, and the document is not drawn correctly if there is a difference between the two texts.
  • the determining unit 28 compares the recognized text acquired by the recognizing unit 26 with the body text extracted by the extracting unit 27, and determines whether or not the document is correctly rendered by the inspection browser. If there is a character that does not appear in the body text among the characters that appear in the recognized text, it is determined that the document is not drawn correctly by the inspection browser. Also, if there are characters that do not appear in the recognized text among the characters that appear in the body text, it is determined that the document is not drawn correctly by the inspection browser. On the other hand, if the recognized text matches the body text, it is determined that the document has been correctly rendered by the inspection browser.
  • the determination unit 28 may compare the recognized text and the body text in units of characters or in units of words (character strings delimited by blank characters or symbols). Further, the determination unit 28 may switch between character-based comparison and word-based comparison depending on the language in which the document is described. For example, when the document is written in English, the determination unit 28 compares the recognized text and the body text in units of words, and when the document is written in Japanese, The body text may be compared in character units. For example, the determination unit 28 refers to the value of the language designation lang tag or specifies the language of a character that appears in large numbers in the body tag, thereby determining which language the document is described in. Can be identified.
  • the determination unit 28 controls the display unit 25 to display the determination result on the screen of the display device 13.
  • FIG. 6 shows a screen that displays that the document is not correctly rendered by the inspection browser. If the determination unit 28 determines that the document is not drawn correctly, as shown in the figure, a character string “The document was not correctly displayed by the inspection browser” is displayed on the screen as a determination result. Is displayed.
  • the determination unit 28 controls the display unit 25 to display a warning that the document is not drawn correctly on a portion where the drawing is not performed correctly on the screen on which the document is drawn by the inspection browser. Further, the determination unit 28 controls the display unit 25 so that the user can see at a glance where the user is not drawing correctly, and fills the portion that is not drawn correctly or the portion before and after that with black.
  • FIG. 7 shows a screen of the display device 13 that displays a warning that the document shown in FIG. 3 is not drawn correctly.
  • an exclamation mark 130a is displayed around the character string “health food / supplement” partially hidden in the banner advertisement.
  • An exclamation mark 130b is displayed in the frame 110b. The user can know the approximate position of the portion that is not drawn correctly by using the exclamation marks 130a and 130b.
  • the character strings “Lime” and “Hell” are displayed in reverse colors, and the backgrounds of these character strings are filled with black. This indicates that the character between the portions painted in black is missing by filling the background of the two characters before and after the characters “n” and “g” with black.
  • the user can easily find out which position of the character is missing by looking at the fill. Also, in the figure, the color of the overlapping character strings in the frame 110b is inverted and displayed, and the portion where the characters overlap is filled with black. Thereby, the user can easily specify the position of the portion where the characters overlap.
  • a translucent exclamation mark may be displayed on the screen so that the user can distinguish characters hidden behind the exclamation mark.
  • the fill color for making it easy to find a defective portion of the drawing may be a color other than black as long as it is a conspicuous color, for example, red, blue, and yellow.
  • the determination unit 28 uses position information indicating from which position in the image 110 each character obtained as a result of character recognition is used, so that the position where the character is missing or the character overlaps. The position can be estimated. By this estimation, the determination unit 28 can display a warning around a character that is not correctly drawn in the screen of the display device 13 or color a portion that is not correctly displayed. Also, as shown in FIG.
  • the missing of the character is generated by filling the two characters before and after the missing character with black.
  • the two characters before and after, but also before and after the missing character with a predetermined number of characters. May be painted in black.
  • it may be one character before and after, or three characters before and after.
  • FIG. 8 is a flowchart showing the flow of processing executed by the inspection apparatus 1 according to the embodiment of the present invention.
  • This process is started by executing a program on the hardware of the inspection apparatus 1, and the inspection apparatus 1 is realized by this process.
  • the acquisition unit 24 determines whether or not there is a request (inspection request) for inspecting whether or not the document is correctly drawn (step S1). Then, when receiving the inspection request from the reception unit 23, the acquisition unit 24 determines that there is an inspection request (step S1; YES), and acquires the inspection software from the software storage unit 21 (step S2). The acquisition unit 24 determines that there is no inspection request when the inspection request is not supplied from the reception unit 23 (step S1; NO), and determines whether there is an inspection request until the inspection request is received from the reception unit 23. repeat.
  • the acquisition unit 24 activates the inspection software and performs an initialization process (step S3).
  • the initialization process is a process of canceling the designation when a specific browser is designated as a browser for inspection, or the process of releasing the designation when a specific document is designated as an inspection target. including.
  • the acquisition unit 24 determines whether there is an instruction indicating the document selected by the user as the inspection target (step S4).
  • the acquisition unit 24 determines that there is an instruction when the instruction indicating the document selected by the user as the inspection target is received from the reception unit 23 (step S4; YES). Then, based on the received instruction, the document selected by the user is acquired from the document storage unit 22 as an inspection target (step S5).
  • the acquisition unit 24 determines that there is no instruction (step S4; NO), and repeats the determination of the presence or absence of the instruction until the instruction is received from the reception unit 23.
  • the acquisition unit 24 determines whether there is an instruction indicating the browser selected by the user as the inspection browser (step S6).
  • the acquisition unit 24 determines that there is an instruction when the instruction indicating the browser selected by the user as the inspection browser is received from the reception unit 23 (step S6; YES), and the selected browser is the software storage unit 21.
  • To obtain an inspection browser step S7).
  • the acquisition unit 24 determines that there is no instruction when the instruction is not supplied from the reception unit 23 (step S6; NO), and repeats the determination of the presence or absence of the instruction until the instruction is received from the reception unit 23.
  • the acquisition unit 24 starts the inspection browser when acquiring the inspection browser. Then, the document acquired as the inspection target is given to the inspection browser (step S8) and displayed on the screen of the display device 13 (step S9), thereby acquiring the image on which the inspection target document is drawn (step S10). .
  • the recognition unit 26 recognizes the image acquired by the acquisition unit 24 (step S11), and acquires the recognized text (step S12).
  • the extraction unit 27 extracts the body text by removing the tag from the document acquired by the acquisition unit 24 (step S13).
  • the determination unit 28 compares the recognized text acquired by the recognition unit 26 with the body text extracted by the extraction unit 27 (step S14), and determines whether the document is correctly rendered by the inspection browser. (Step S15). When there are characters that do not appear in the body text among the characters that appear in the recognized text, the determination unit 28 determines that the document is not drawn correctly by the inspection browser (step S15; NO). If there is a character that does not appear in the recognized text among the characters that appear in the body text, it is determined that the document is not drawn correctly by the inspection browser (step S15; NO).
  • the determination unit 28 determines that the document is not drawn correctly by the inspection browser, the determination unit 28 controls the display unit 25 to display the fact on the screen of the display device 13 (step S16). See FIG. 6 for a specific example. In addition, the determination unit 28 displays a warning that the document is not drawn correctly on a portion where the document is not correctly drawn on the screen by the inspection browser (step S17). See FIG. 7 for a specific example.
  • step S15 when the recognized text matches the body text, the determination unit 28 determines that the document has been correctly drawn by the inspection browser (step S15; YES), and controls the display unit 25. This is displayed on the screen of the display device 13 (step S18).
  • step S17 or 18 the determination unit 28 returns to step S3 and executes the initialization process again.
  • the designation related to the inspection browser and the document to be inspected is canceled, and preparations for starting a new inspection are made.
  • the inspection apparatus 1 acquires an image on which a document is drawn by giving the document to an inspection browser and displaying the document on a screen. Then, by comparing the recognized text acquired by character recognition from the acquired image with the body text extracted by removing the tag from the document, it is determined whether or not the document is correctly rendered by the inspection browser. . Therefore, according to the inspection apparatus 1 according to the embodiment of the present invention, it is possible to appropriately inspect whether a document including text is correctly rendered regardless of a browser or a terminal.
  • the inspection apparatus 1 checks whether or not the document is drawn instead of the user, so that the check omission is suppressed.
  • the method of extracting the body text from the document to be inspected has been described as an example of the method of extracting the body text.
  • the present invention is not limited to this, and the body text is extracted by other methods. Also good.
  • the extraction unit 27 gives a document to be inspected to a browser other than the inspection browser (hereinafter referred to as a second browser) among the plurality of browsers stored in the software storage unit 21 to the acquisition unit 24.
  • An image on which a document is drawn (hereinafter referred to as a second image) may be acquired by displaying on the screen.
  • the extraction unit 27 may extract the body text by causing the recognition unit 26 to perform character recognition on the acquired second image.
  • the determination unit 28 may determine whether or not the document is correctly drawn by the inspection browser by comparing the body text extracted by character recognition with the recognized text.
  • the second browser is preferably a browser that guarantees that the document is correctly rendered.
  • FIG. 9 is a diagram showing a second image in which the document shown in FIG. 3 is drawn by the second browser.
  • the acquisition unit 24 can acquire the second image on which the document shown in FIG. 3 is drawn by giving the document shown in FIG. 3 to the second browser and displaying the document on the screen. Since the second browser is guaranteed to render the document correctly, as shown in FIG. 9, it is possible for characters to be hidden in the banner advertisement or to overlap each other in the second image. Absent.
  • the extraction unit 27 can extract the body text having no defect similarly to the body text shown in FIG. 5B by recognizing the character of the second image. it can. And the determination part 28 can determine whether the document was drawn correctly using this body text.
  • FIG. 10 is a flowchart showing the flow of processing executed by the inspection apparatus 1 when the text text extraction method using the second browser is adopted.
  • the flow of this processing in the case of extracting the body text using the second browser will be described with reference to FIG.
  • the description of the processing already described with reference to FIG. 8 is omitted.
  • step S12 when the recognition unit 26 acquires the recognized text, the extraction unit 27 causes the acquisition unit 24 to acquire the second browser from the software storage unit 21 (step S19), and activates the second browser.
  • the extraction unit 27 gives the document to be inspected to the second browser to the acquisition unit 24 (step S20), and displays it on the screen of the display device 13 (step S21), thereby drawing the document to be inspected.
  • the second image is acquired (step S22).
  • the extraction unit 27 causes the recognition unit 26 to perform character recognition on the second image acquired by the acquisition unit 24 (step S23), thereby extracting the body text (step S13).
  • the processing after step S14 has already been described with reference to FIG.
  • the second browser is a browser that guarantees that the document is drawn correctly. However, even if there is no such guarantee, the second browser is used to check whether or not the document is drawn. It is useful to do. Because if it is known that the characters recognized by the browser for inspection and the second browser are different, it can be determined that there is a defect in the drawing of the inspection browser, but it can be found that the drawing of either or both browsers is defective. This is because the document creator realizes that it is necessary to make some modifications to the document.
  • the determination unit 28 determines that the document is not correctly drawn by the inspection browser if the recognized text and the body text are different by one character. It may be determined that the image has been correctly drawn. For example, the determination unit 28 may set a tolerance indicating the degree of allowable difference, and may determine that the document has been correctly drawn if the difference between the recognized text and the body text is less than the tolerance. For example, the tolerance may be set to allow if the difference is up to 5 characters, or may be set to allow if the difference is up to 10 characters. Also, the user may set the tolerance in advance, or the determination unit 28 may ask the user to set the tolerance before comparing the recognized text and the body text.
  • the acquisition unit 24 gives the inspection target document to all browsers stored in the software storage unit 21, and acquires an image in which the inspection target document is drawn for each browser.
  • the recognition unit 26 recognizes characters for each acquired image and acquires text.
  • the determination unit 28 compares all the acquired texts with each other. If the difference between any two texts exceeds the tolerance, the determination unit 28 determines that the document is not drawn correctly, and the difference between the two texts is determined.
  • the acquisition unit 24 acquires the image on which the document is drawn by displaying the document on the screen.
  • the acquisition unit 24 can acquire an image in which the entire document is drawn from the virtual view. Therefore, the determination unit 28 can determine whether or not to draw the entire document even when the entire document does not fit on the screen.
  • a function for displaying the entire document on the screen by automatically scrolling the scroll bar displayed on the screen may be added to the inspection software.
  • the acquisition unit 24 can acquire an image in which the entire document is drawn without a virtual view.
  • the inspection device 1 may acquire various software and documents from an external device that functions as the software storage unit 21 and the document storage unit 22. Further, the external apparatus functioning as the receiving unit 23 may receive various requests and instructions from the user, and the inspection apparatus 1 may cause the external apparatus to supply various requests and instructions from the user. Further, the inspection apparatus 1 may obtain an image on which a document is drawn by providing the document to an external device functioning as the display unit 25 and displaying the document on a screen of the external device. Further, the inspection device 1 may display the inspection result on the screen of the external device.
  • the configuration for realizing the functions according to the present invention be provided in advance as the inspection apparatus 1, but also an existing personal computer or information terminal device or the like can be provided as the inspection apparatus 1 according to the present invention by applying the program. It can also function. That is, by applying the program for realizing each functional configuration by the inspection apparatus 1 exemplified in the above embodiment so that a CPU or the like for controlling an existing personal computer or information terminal device can be executed, the present invention is applied.
  • the inspection apparatus 1 can function.
  • the inspection method according to the present invention can be implemented using the inspection apparatus 1.
  • the application method of such a program is arbitrary.
  • the program can be stored and applied to a computer-readable recording medium [CD-ROM (Compact Disc Read-Only Memory), DVD (Digital Versatile Disc), MO (Magneto Optical Disc), etc.], the Internet, etc. It is also possible to apply the program by storing it in a storage on the network and downloading it.
  • CD-ROM Compact Disc Read-Only Memory
  • DVD Digital Versatile Disc
  • MO Magnetic Optical Disc
  • the Internet etc. It is also possible to apply the program by storing it in a storage on the network and downloading it.
  • An inspection apparatus is An acquisition unit that obtains an image in which the document is rendered by giving the document to the browser and causing the document to be displayed on the screen or rendered in a virtual view of the browser; A recognition unit that acquires recognized text by character recognition of the acquired image; An extractor for extracting body text to be drawn from the document; A determination unit that determines whether or not the document is correctly rendered by the browser by comparing the recognized text and the body text; Is provided.
  • the extraction unit may extract the body text by removing a tag from the document.
  • the determination unit may determine that the document is not correctly rendered in the browser when there is a character that does not appear in the body text among characters that appear in the recognized text.
  • the determination unit may determine that the document is not correctly rendered in the browser when there is a character that does not appear in the recognized text among characters that appear in the body text.
  • the extraction unit causes the acquisition unit to acquire the second image on which the document is drawn by giving the document to the second browser and causing the second browser to display the document or to display the document on a virtual view of the second browser.
  • the body text may be extracted by causing the recognition unit to perform character recognition on the acquired second image.
  • An inspection method includes: An acquisition step in which the inspection apparatus gives the document to the browser and displays it on the screen or draws it in the virtual view of the browser, thereby obtaining an image in which the document is drawn; A recognition step in which the inspection device acquires recognized text by character recognition of the acquired image; An extraction step in which the inspection device extracts the body text to be drawn from the document; A determination step for determining whether or not the document has been correctly rendered by the browser by comparing the recognized text and the body text by the inspection device; Is provided.
  • a computer-readable recording medium is provided.
  • Computer An acquisition unit that obtains an image in which the document is rendered by giving the document to the browser and causing the document to be displayed on the screen or rendered in a virtual view of the browser;
  • a recognition unit that acquires recognized text by character recognition of the acquired image;
  • An extractor for extracting body text to be drawn from the document;
  • a determination unit that determines whether or not the document is correctly rendered by the browser by comparing the recognized text and the body text; Record the program that will function as
  • a program is: Computer An acquisition unit that obtains an image in which the document is rendered by giving the document to the browser and causing the document to be displayed on the screen or rendered in a virtual view of the browser; A recognition unit that acquires recognized text by character recognition of the acquired image; An extractor for extracting body text to be drawn from the document; A determination unit that determines whether or not the document is correctly rendered by the browser by comparing the recognized text and the body text; To function as.
  • an inspection apparatus an inspection method, a recording medium, and a program for appropriately inspecting whether a document including text is correctly rendered by a browser.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Quality & Reliability (AREA)
  • Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • Machine Translation (AREA)
  • Debugging And Monitoring (AREA)

Abstract

An acquisition unit (24) acquires an image depicted by a document written in HTML by providing the document to a testing browser and causing the document to be displayed on a screen. A recognition unit (26) acquires a recognized text by performing character recognition on the acquired image. An extraction unit (27) extracts the actual text that should be depicted by eliminating tags from the document. A determination unit (28) determines whether the document is correctly displayed in the testing browser by comparing the recognized text and the actual text.

Description

検査装置、検査方法、記録媒体、及び、プログラムInspection device, inspection method, recording medium, and program
 本発明は、テキストを含む文書がブラウザにより正しく描画されるか否かを適切に検査する検査装置、検査方法、記録媒体、及び、プログラムに関する。 The present invention relates to an inspection apparatus, an inspection method, a recording medium, and a program for appropriately inspecting whether a document including text is correctly drawn by a browser.
 HTML(Hypertext Markup Language)等の所定のマークアップ言語で記述された文書(以下、単に文書と言う。)がウェブブラウザ(以下、単にブラウザと言う。)に与えられると、ブラウザが文書内のタグを解釈し、テキストや画像を画面に表示する。しかしながら、異なるブラウザ間ではタグの解釈が違うので、あるブラウザによるとテキストや画像が整然と表示されるにも関わらず、他のブラウザによればレイアウトが崩れてしまうことがある。また、同様の課題は、同一のブラウザの異なるバージョン間や異なる端末間においても生じ得る。そこで、テキストや画像のレイアウトが崩れていないかどうかを検査する方法が種々提案されている。 When a document described in a predetermined markup language such as HTML (Hypertext Markup Language) (hereinafter simply referred to as a document) is given to a web browser (hereinafter simply referred to as a browser), the browser tags the document. To display text and images on the screen. However, because the interpretation of tags differs between different browsers, the layout may be corrupted by other browsers even though text and images are displayed neatly in some browsers. Similar problems may occur between different versions of the same browser or between different terminals. Therefore, various methods for inspecting whether the layout of texts and images is not broken have been proposed.
 例えば、特許文献1には、HTMLで記述されたウェブページを仕様が異なる2つのウェブブラウザに与え、描画された画像対のビジュアルを比較し、各画像中の要素の位置、大きさ、スタイル等に違いがあれば、それをエラーとして検出する方法が開示されている。 For example, in Patent Document 1, a web page described in HTML is given to two web browsers with different specifications, the visuals of rendered image pairs are compared, and the position, size, style, etc. of the elements in each image are compared. If there is a difference, a method for detecting it as an error is disclosed.
特開2013-77301号公報JP2013-77301A
 しかしながら、特許文献1等に開示された検査方法は、一部のタブが隠されたり画像が画面の表示領域からはみ出たりすることよるレイアウトの崩れを検出できるものの、文字同士が重なって表示されていたり文字が欠落していたりする場合にその欠陥を検出することができない。そのため、テキストを含む文書については、文書がブラウザにより正しく描画されたか否かを適切に検査することができなかった。 However, although the inspection method disclosed in Patent Document 1 and the like can detect a layout collapse due to a part of the tabs being hidden or an image protruding from the display area of the screen, the characters are displayed overlapping each other. If a character is missing, the defect cannot be detected. For this reason, it has not been possible to properly check whether or not a document including text is correctly rendered by the browser.
 本発明は、上記の課題を解決するためのものであり、テキストを含む文書がブラウザにより正しく描画されるか否かを適切に検査する検査装置、検査方法、記録媒体、及び、プログラムを提供することを目的とする。 The present invention is to solve the above problems, and provides an inspection apparatus, an inspection method, a recording medium, and a program for appropriately inspecting whether a document including text is correctly rendered by a browser. For the purpose.
 上記目的を達成するために、本発明に係る検査装置は、
 文書をブラウザに与えて、画面に表示させ若しくはブラウザの仮想ビューに描画させることにより、当該文書が描画された画像を取得し、
 前記取得された画像を文字認識することにより、認識済テキストを取得し、
 前記文書から描画されるべき本文テキストを抽出し、
 前記認識済テキストと前記本文テキストとを対比することにより、前記文書が前記ブラウザにより正しく描画されたか否かを判定する。
In order to achieve the above object, an inspection apparatus according to the present invention comprises:
By giving the document to the browser and displaying it on the screen or drawing it in the virtual view of the browser, an image in which the document is drawn is acquired,
By recognizing the acquired image to obtain recognized text,
Extract body text to be drawn from the document,
By comparing the recognized text and the body text, it is determined whether or not the document is correctly rendered by the browser.
 本発明によれば、テキストを含む文書がブラウザにより正しく描画されるか否かを適切に検査することができる。 According to the present invention, it is possible to appropriately check whether or not a document including text is correctly rendered by the browser.
本発明の実施形態に係る検査装置のハードウェア構成を示す図である。It is a figure which shows the hardware constitutions of the test | inspection apparatus which concerns on embodiment of this invention. 本発明の実施形態に係る検査装置の機能構成を示す図である。It is a figure which shows the function structure of the inspection apparatus which concerns on embodiment of this invention. 本発明の実施形態に係る文書記憶部が記憶する文書の一例を示す図である。It is a figure which shows an example of the document which the document memory | storage part concerning embodiment of this invention memorize | stores. 文書が検査用ブラウザにより描画された画像を示す図である。It is a figure which shows the image by which the document was drawn by the inspection browser. (a)本発明の実施形態に係る認識済テキスト、(b)本発明の実施形態に係る本文テキストを示す図である。(A) The recognized text which concerns on embodiment of this invention, (b) It is a figure which shows the text body based on embodiment of this invention. 文書が検査用ブラウザにより正しく描画されていない旨を表示する画面を示す図である。It is a figure which shows the screen which displays that the document is not drawn correctly by the inspection browser. 文書が正しく描画されていない旨の警告を表示する画面を示す図である。It is a figure which shows the screen which displays the warning that a document is not drawn correctly. 本発明の実施形態に係る検査装置により実行される処理の流れを表すフローチャートである。It is a flowchart showing the flow of the process performed by the test | inspection apparatus which concerns on embodiment of this invention. 文書が第2ブラウザにより描画された第2画像を示す図である。It is a figure which shows the 2nd image by which the document was drawn by the 2nd browser. 本発明の変形例に係る検査装置により実行される処理の流れを表すフローチャートである。It is a flowchart showing the flow of the process performed by the test | inspection apparatus which concerns on the modification of this invention.
 以下に本発明の実施形態を説明する。なお、本実施形態は説明のためのものであり、本願発明の範囲を制限するものではない。従って、当業者であればこれらの各要素若しくは全要素をこれと均等なものに置換した実施形態を採用することが可能であるが、これらの実施形態も本発明の範囲に含まれる。また、本願発明の実施形態を図面を参照して説明するにあたり、図中同一又は相当する部分には同一符号を付す。 Embodiments of the present invention will be described below. In addition, this embodiment is for description and does not limit the scope of the present invention. Accordingly, those skilled in the art can employ embodiments in which each or all of these elements are replaced with equivalent ones, and these embodiments are also included in the scope of the present invention. Further, in describing an embodiment of the present invention with reference to the drawings, the same or corresponding parts in the drawings are denoted by the same reference numerals.
 図1は、本発明の実施形態に係る検査装置1のハードウェア構成を示す。以下、本図を参照して説明する。検査装置1は、同図に示すように、記憶装置11と、受付装置12と、表示装置13と、制御部14と、を備える。 FIG. 1 shows a hardware configuration of an inspection apparatus 1 according to an embodiment of the present invention. Hereinafter, a description will be given with reference to FIG. As shown in FIG. 1, the inspection apparatus 1 includes a storage device 11, a reception device 12, a display device 13, and a control unit 14.
 記憶装置11は、ハードディスク等の記録媒体を備え、各種ブラウザ、文書の描画の是非を検査するための検査用ソフトウェア等の様々なソフトウェアを記憶する。また、記憶装置11は、HTMLで記述された複数の文書を記憶する。なお、上記文書は、HTMLで記述されたものに限定されず、XHTML(Extensible Hypertext Markup Language)、XML(Extensible Markup Language)等の他のマークアップ言語で記述されていて良い。また、記憶装置11は、ハードディスク以外の不揮発な非一時的な記録媒体、例えば、フラッシュメモリ、光ディスク、光磁気ディスクから構成されていても良い。 The storage device 11 includes a recording medium such as a hard disk and stores various software such as various browsers and inspection software for inspecting whether or not to draw a document. The storage device 11 stores a plurality of documents described in HTML. The document is not limited to the one described in HTML, and may be described in other markup languages such as XHTML (Extensible Hypertext Markup Language) and XML (Extensible Markup Language). Further, the storage device 11 may be configured by a non-volatile non-temporary recording medium other than a hard disk, for example, a flash memory, an optical disk, or a magneto-optical disk.
 受付装置12は、キーボード、マウス、トラックパッド等を備える。受付装置12は、ユーザの操作を介してユーザの指示を受け付け、受け付けたユーザの指示を示す入力信号を生成する。そして、生成した入力信号を制御部14に供給する。 The reception device 12 includes a keyboard, a mouse, a track pad, and the like. The accepting device 12 accepts a user instruction through a user operation, and generates an input signal indicating the accepted user instruction. Then, the generated input signal is supplied to the control unit 14.
 表示装置13は、液晶ディスプレイ等の画面を備え、制御部14から供給されたテキストデータ、画像、動画等の各種データを表示する。 The display device 13 includes a screen such as a liquid crystal display and displays various data such as text data, images, and moving images supplied from the control unit 14.
 制御部14は、CPU(Central Processing Unit)、ROM(Read Only Memory)、RAM(Random Access Memory)等を備え、ROM等の非一時的な記録媒体に記憶されたプログラムを一時的な記録媒体であるRAMに読み出し、読み出したプログラムに含まれる指令をCPUにより実行する。また、制御部14は、受付装置12から供給された入力信号を受け付け、受け付けた入力信号が示すユーザの指示をCPUにより実行する。 The control unit 14 includes a CPU (Central Processing Unit), a ROM (Read Only Memory), a RAM (Random Access Memory), and the like, and a program stored in a non-temporary recording medium such as a ROM is a temporary recording medium. The data is read into a certain RAM, and a command included in the read program is executed by the CPU. Moreover, the control part 14 receives the input signal supplied from the reception apparatus 12, and performs the user's instruction | indication which the received input signal shows by CPU.
 図2は、検査装置1の機能構成を示す。制御部14は、ROM等が記憶するプログラムをRAMに読み出して、そのプログラムを実行制御することにより、検査装置1を、ソフトウェア記憶部21、文書記憶部22、受付部23、取得部24、表示部25、認識部26、抽出部27、判定部28として機能させる。 FIG. 2 shows a functional configuration of the inspection apparatus 1. The control unit 14 reads the program stored in the ROM or the like into the RAM and controls the execution of the program, whereby the inspection device 1 is displayed in the software storage unit 21, the document storage unit 22, the reception unit 23, the acquisition unit 24, and the display. Function as a unit 25, a recognition unit 26, an extraction unit 27, and a determination unit 28.
 なお、ソフトウェア記憶部21及び文書記憶部22は、記憶装置11に構築される。また、受付部23は、受付装置12と制御部14とが協働することにより実現される。また、取得部24、認識部26、抽出部27、判定部28は、制御部14により実現される。表示部25は、表示装置13と制御部14とが協働することにより実現される。 The software storage unit 21 and the document storage unit 22 are constructed in the storage device 11. The receiving unit 23 is realized by the cooperation of the receiving device 12 and the control unit 14. The acquisition unit 24, the recognition unit 26, the extraction unit 27, and the determination unit 28 are realized by the control unit 14. The display unit 25 is realized by the cooperation of the display device 13 and the control unit 14.
 ソフトウェア記憶部21は、各種ブラウザ、文書の描画の是非を検査するための検査用ソフトウェア等の様々なソフトウェアを記憶する。各種ブラウザは、インターネットエクスプローラ[Internet Explorer(登録商標)]、モジラファイアフォックス[Mozilla Firefox(登録商標)]、グーグルクローム[Google Chrome(登録商標)]、オペラ[Opera(登録商標)]等の著名なブラウザを含む。ただし、各種ブラウザは、上述した著名なブラウザ以外の様々なブラウザを含んでいて良い。 The software storage unit 21 stores various software such as various browsers and inspection software for inspecting whether or not a document is drawn. Various browsers include Internet Explorer [Internet Explorer (registered trademark)], Mozilla Firefox (Mozilla Firefox (registered trademark)), Google Chrome [Google Chrome (registered trademark)], Opera [Opera (registered trademark)] Includes browser. However, the various browsers may include various browsers other than the famous browsers described above.
 文書記憶部22は、HTMLで記述された複数の文書を記憶する。なお、上述したように、複数の文書は、HTML以外のマークアップ言語で記述されていて良いことに留意すべきである。 The document storage unit 22 stores a plurality of documents described in HTML. Note that, as described above, the plurality of documents may be described in a markup language other than HTML.
 受付部23は、ユーザによる受付装置12の操作を介して、ユーザから各種要求及び指示を受け付ける。例えば、文書が正しく描画されるか否かを検査する旨の要求(以下、検査要求と言う。)を受け付ける。また、ユーザが検査対象として選択した文書を示す指示を受け付ける。また、ユーザが検査用ブラウザとして選択したブラウザを示す指示を受け付ける。そして、受け付けた要求及び指示を取得部24に供給する。なお、文書は文書記憶部22が記憶する複数の文書のうちから選択される。また、ブラウザはソフトウェア記憶部21が記憶する複数のブラウザのうちから選択される。 The accepting unit 23 accepts various requests and instructions from the user through the operation of the accepting device 12 by the user. For example, a request for inspecting whether or not a document is correctly drawn (hereinafter referred to as an inspection request) is received. Also, an instruction indicating a document selected by the user as an inspection target is received. Also, an instruction indicating the browser selected by the user as the inspection browser is received. Then, the received request and instruction are supplied to the acquisition unit 24. The document is selected from a plurality of documents stored in the document storage unit 22. The browser is selected from a plurality of browsers stored in the software storage unit 21.
 取得部24は、受付部23から検査要求を受け付けると、検査用ソフトウェアをソフトウェア記憶部21から取得する。そして、検査用ソフトウェアを起動し、初期化処理を行う。初期化処理は、例えば、特定のブラウザが検査用ブラウザとして指定されている場合にその指定を解除する処理、特定の文書が検査対象として指定されている場合にその指定を解除する処理を含む。 When the acquisition unit 24 receives the inspection request from the reception unit 23, the acquisition unit 24 acquires the inspection software from the software storage unit 21. Then, the inspection software is activated and initialization processing is performed. The initialization process includes, for example, a process of canceling the designation when a specific browser is designated as an inspection browser, and a process of canceling the designation when a specific document is designated as an inspection target.
 また、取得部24は、受付部23からユーザが検査対象として選択した文書を示す指示を受け付けると、指示に基づいて、選択された文書を検査対象として文書記憶部22から取得する。 Further, when receiving the instruction indicating the document selected by the user as the inspection target from the receiving unit 23, the acquiring unit 24 acquires the selected document from the document storage unit 22 as the inspection target based on the instruction.
 また、取得部24は、受付部23からユーザが検査用ブラウザとして選択したブラウザを示す指示を受け付けると、選択されたブラウザを検査用ブラウザとしてソフトウェア記憶部21から取得し、検査用ブラウザを起動する。 Further, when receiving the instruction indicating the browser selected by the user as the inspection browser from the reception unit 23, the acquisition unit 24 acquires the selected browser from the software storage unit 21 as the inspection browser, and starts the inspection browser. .
 また、取得部24は、検査対象として取得した文書を検査用ブラウザに与え、表示装置13の画面に表示させることにより、検査対象の文書が描画された画像を取得する。 Also, the acquisition unit 24 acquires an image in which the document to be inspected is drawn by giving the document acquired as the inspection target to the inspection browser and displaying the document on the screen of the display device 13.
 図3は、文書記憶部22が記憶する文書の一例を示す。同図に示すように、文書記憶部22が記憶する文書はHTMLで記述されている。取得部24は、同図に示す文書を検査用ブラウザに与え、表示装置13の画面に表示させることにより、同図に示す文書が描画された画像を取得する。 FIG. 3 shows an example of a document stored in the document storage unit 22. As shown in the figure, the document stored in the document storage unit 22 is described in HTML. The acquisition unit 24 gives the document shown in the figure to the inspection browser and displays it on the screen of the display device 13, thereby acquiring an image on which the document shown in the figure is drawn.
 図4は、図3に示す文書が検査用ブラウザにより描画された画像110を示す。画像110において、図3に示す文書に含まれるテキストは、タグが指定する位置に描画されている。なお、本明細書において、文書が描画された画像とは、HTMLで記述されたソースコードが描画された画像ではないことに留意されたい。 FIG. 4 shows an image 110 in which the document shown in FIG. 3 is drawn by the inspection browser. In the image 110, the text included in the document shown in FIG. 3 is drawn at the position specified by the tag. In this specification, it should be noted that an image on which a document is drawn is not an image on which a source code written in HTML is drawn.
 また、図4に示す画像110について注目すべきことが2つ有る。1つは、画像110の左側のフレーム110aにおいて「健康食品・サプリメント」という文字列が表示されるべきところ、文字列の一部がフレーム110aからはみ出たことにより、文字「ン」、「ト」がバナー広告に隠れてしまっていることである。もう1つは、画像110の右下のフレーム110bにおいて「まとめ買いに最適・・・大活躍!」という文字列の一部と、「また、職場・学校・お店・レジャー等・・・使えますよ。」という文字列の一部と、が重なっていることである。 There are also two things to note about the image 110 shown in FIG. One is that the character string “health food / supplement” is to be displayed in the left frame 110a of the image 110, and a part of the character string protrudes from the frame 110a. Is hidden in the banner ad. The other is a part of the character string “best for bulk buying ... very active!” In the frame 110b on the lower right side of the image 110, and “it can also be used at work, school, shops, leisure, etc. It overlaps with a part of the character string.
 表示部25は、取得部24の制御に基づいて、検査用ブラウザに与えられた文書を、表示装置13の画面に表示する。また、判定部28の制御に基づいて、文書が検査用ブラウザにより正しく描画されたか否かを示す判定結果を表示装置13の画面に表示する。 The display unit 25 displays the document given to the inspection browser on the screen of the display device 13 based on the control of the acquisition unit 24. Further, based on the control of the determination unit 28, a determination result indicating whether or not the document has been correctly rendered by the inspection browser is displayed on the screen of the display device 13.
 認識部26は、取得部24により取得された画像を文字認識することにより、画像に描画されたテキストを認識済テキストとして取得する。なお、認識部26は、例えば、形態素解析を用いて、取得部24により取得された画像を、各文字を表す画像に分割し、分割された画像から文字認識に用いる特徴量(特徴量を示すベクトル)を抽出する。そして、抽出された特徴量と記憶装置11に予め記憶されている文字の特徴量とを対比することによって文字認識を行う。文字認識に用いる特徴量としては、文字の輪郭を利用した方向線素特徴等が有る。また、認識部26は、文字認識の結果得られた各文字が画像内におけるどの位置から認識されたものか示す位置情報を取得する。この位置情報は、文字の描画に欠陥が発見された場合に、表示装置13の画面内における欠陥が有る位置又はその周辺に文字の描画に欠陥が有る旨の警告を表示するために使用される。 The recognition unit 26 acquires the text drawn on the image as recognized text by recognizing the image acquired by the acquisition unit 24. Note that the recognition unit 26 divides the image acquired by the acquisition unit 24 into an image representing each character using, for example, morphological analysis, and uses the feature amount (feature amount indicating the feature amount) from the divided image. Vector). Then, character recognition is performed by comparing the extracted feature amount with the character feature amount stored in advance in the storage device 11. A feature amount used for character recognition includes a directional line element feature using the outline of a character. Further, the recognition unit 26 acquires position information indicating from which position in the image each character obtained as a result of character recognition is recognized. This position information is used to display a warning indicating that there is a defect in character drawing at or near the position where there is a defect in the screen of the display device 13 when a defect is found in the character drawing. .
 抽出部27は、取得部24により取得された文書からタグを除去することにより、文書に含まれるテキストを本文テキストとして抽出する。本文テキストは、文書から描画されるべきテキストであり、文書が検査用ブラウザにより正しく描画されたときに文字認識により取得される認識済テキストと合致する。 The extraction unit 27 extracts the text included in the document as the body text by removing the tag from the document acquired by the acquisition unit 24. The body text is text to be drawn from the document and matches the recognized text acquired by character recognition when the document is correctly drawn by the inspection browser.
 図5(a)は、図4に示す画像110を文字認識することにより取得された認識済テキストを示し、図5(b)は、図3に示す文書100から抽出された本文テキストを示す。ここで、認識済テキストにおける破線の囲み120aに注目すると、「健康食品・サプリメント」という文字列の一部、即ち、文字「ン」、「ト」が欠落していることが分かる。この文字の欠落は、図4に示す画像110において「健康食品・サプリメント」という文字列の一部がバナー広告に隠れてしまったため、認識部26が文字「ン」、「ト」を文字認識できなかったことにより生じる。つまり、検査用ブラウザにおいて文字が欠落していると、その欠陥は認識済テキストに反映される。 FIG. 5A shows recognized text acquired by character recognition of the image 110 shown in FIG. 4, and FIG. 5B shows body text extracted from the document 100 shown in FIG. Here, when attention is paid to the dashed box 120a in the recognized text, it is understood that a part of the character string “health food / supplement”, that is, the characters “n” and “g” are missing. This missing character is because part of the character string “health food / supplement” is hidden in the banner advertisement in the image 110 shown in FIG. 4, so that the recognition unit 26 can recognize the characters “n” and “g”. It is caused by not having. That is, if a character is missing in the inspection browser, the defect is reflected in the recognized text.
 また、破線の囲み120b内の「○×△☆♯♭●□▲★※」という記号列は、画像110において重なって表示されている文字列が文字認識により別の文字列として誤って認識されたことを示す。重なって表示された各文字は、マッチングする文字が存在しないので、形状が似た別の文字若しくは記号として認識される。従って、重なって表示された文字列は、別の文字列に変換されて認識済テキストに出現する。このように、文字同士が重なって表示されるという欠陥も、認識済テキストに反映される。 In addition, the symbol string “◯ × Δ ☆ # ♭ ● □ ▲ ★ *” in the dashed box 120b is erroneously recognized as a different character string by the character recognition when the overlapping character string is displayed in the image 110. It shows that. Each character displayed in an overlapping manner is recognized as another character or symbol having a similar shape because there is no matching character. Therefore, the character string displayed in an overlapping manner is converted into another character string and appears in the recognized text. In this way, the defect that characters are displayed overlapping each other is also reflected in the recognized text.
 一方、図5(b)に示す本文テキストは、文書からタグを除去して抽出されたテキストであるため、文字の欠落等の欠陥を含まない。従って、認識済テキストと本文テキストとの対比により、両テキストに差異が無ければ文書が正しく描画されており、両テキストに差異が有れば文書が正しく描画されていないと分かる。 On the other hand, the body text shown in FIG. 5B is a text extracted by removing the tag from the document, and thus does not include defects such as missing characters. Therefore, the comparison between the recognized text and the body text indicates that the document is drawn correctly if there is no difference between the two texts, and the document is not drawn correctly if there is a difference between the two texts.
 判定部28は、認識部26が取得した認識済テキストと、抽出部27が抽出した本文テキストと、を対比し、文書が検査用ブラウザにより正しく描画されたか否かを判定する。そして、認識済テキストに出現する文字のうちに、本文テキストに出現しない文字が有る場合には、文書が検査用ブラウザにより正しく描画されていないと判定する。また、本文テキストに出現する文字のうちに、認識済テキストに出現しない文字が有る場合も、文書が検査用ブラウザにより正しく描画されていないと判定する。一方、認識済テキストと本文テキストとが合致する場合には、文書が検査用ブラウザにより正しく描画されたと判定する。 The determining unit 28 compares the recognized text acquired by the recognizing unit 26 with the body text extracted by the extracting unit 27, and determines whether or not the document is correctly rendered by the inspection browser. If there is a character that does not appear in the body text among the characters that appear in the recognized text, it is determined that the document is not drawn correctly by the inspection browser. Also, if there are characters that do not appear in the recognized text among the characters that appear in the body text, it is determined that the document is not drawn correctly by the inspection browser. On the other hand, if the recognized text matches the body text, it is determined that the document has been correctly rendered by the inspection browser.
 なお、判定部28は、認識済テキストと本文テキストとを文字単位で対比しても良いし、単語(空白文字や記号で区切られた文字列)単位で対比しても良い。また、判定部28は、文書がどのような言語で記述されているかに応じて、文字単位での対比か単語単位での対比かを切り替えても良い。例えば、判定部28は、文書が英語で記述されている場合には、認識済テキストと本文テキストとを単語単位で対比し、文書が日本語で記述されている場合には、認識済テキストと本文テキストとを文字単位で対比しても良い。なお、判定部28は、例えば、言語指定用langタグの値を参照したり、bodyタグ内に多数出現する文字の言語を特定したりすることにより、文書がいずれの言語で記述されているかを特定することができる。 Note that the determination unit 28 may compare the recognized text and the body text in units of characters or in units of words (character strings delimited by blank characters or symbols). Further, the determination unit 28 may switch between character-based comparison and word-based comparison depending on the language in which the document is described. For example, when the document is written in English, the determination unit 28 compares the recognized text and the body text in units of words, and when the document is written in Japanese, The body text may be compared in character units. For example, the determination unit 28 refers to the value of the language designation lang tag or specifies the language of a character that appears in large numbers in the body tag, thereby determining which language the document is described in. Can be identified.
 また、判定部28は、判定結果を、表示部25を制御して、表示装置13の画面に表示させる。 Further, the determination unit 28 controls the display unit 25 to display the determination result on the screen of the display device 13.
 図6は、文書が検査用ブラウザにより正しく描画されていない旨を表示する画面を示す。判定部28により文書が正しく描画されていないと判定された場合には、同図に示すように、「文書は検査用ブラウザにより正しく表示されませんでした。」という文字列が画面に判定結果として表示される。 FIG. 6 shows a screen that displays that the document is not correctly rendered by the inspection browser. If the determination unit 28 determines that the document is not drawn correctly, as shown in the figure, a character string “The document was not correctly displayed by the inspection browser” is displayed on the screen as a determination result. Is displayed.
 また、判定部28は、表示部25を制御して、文書が正しく描画されていない旨の警告を、文書が検査用ブラウザにより描画された画面における正しい描画が為されていない部分に表示する。さらに、判定部28は、ユーザが正しく描画されていない部分がどこか一目で分かるように、表示部25を制御して、正しく描画されていない部分又はその前後の部分を黒で塗り潰す。以下、図7を参照して具体的に説明する。 Also, the determination unit 28 controls the display unit 25 to display a warning that the document is not drawn correctly on a portion where the drawing is not performed correctly on the screen on which the document is drawn by the inspection browser. Further, the determination unit 28 controls the display unit 25 so that the user can see at a glance where the user is not drawing correctly, and fills the portion that is not drawn correctly or the portion before and after that with black. Hereinafter, a specific description will be given with reference to FIG.
 図7は、図3に示す文書が正しく描画されていない旨の警告を表示する表示装置13の画面を示す。同画面において、バナー広告に部分的に隠された「健康食品・サプリメント」という文字列の周辺にエクスクラメーションマーク130aが表示されている。また、フレーム110b内にエクスクラメーションマーク130bが表示されている。ユーザは、エクスクラメーションマーク130a、130bにより、正しく描画されていない部分のおおよその位置を知ることができる。また、図7では、文字列「リメ」と文字列「ヘル」の色が反転して表示されると共に、これらの文字列の背景が黒で塗り潰されている。これは、文字「ン」、「ト」の前後2文字の背景を黒で塗り潰すことにより、黒で塗り潰された部分の間の文字が欠落していることを表している。そのため、ユーザはこの塗り潰しを見ればどの位置の文字が欠落しているかを簡単に発見することができる。また、同図では、フレーム110b内の重なった文字列の色が反転して表示されると共に、文字同士が重なっている部分が黒で塗り潰されている。これにより、ユーザは文字同士が重なっている部分の位置を容易に特定することができる。 FIG. 7 shows a screen of the display device 13 that displays a warning that the document shown in FIG. 3 is not drawn correctly. In the same screen, an exclamation mark 130a is displayed around the character string “health food / supplement” partially hidden in the banner advertisement. An exclamation mark 130b is displayed in the frame 110b. The user can know the approximate position of the portion that is not drawn correctly by using the exclamation marks 130a and 130b. In FIG. 7, the character strings “Lime” and “Hell” are displayed in reverse colors, and the backgrounds of these character strings are filled with black. This indicates that the character between the portions painted in black is missing by filling the background of the two characters before and after the characters “n” and “g” with black. Therefore, the user can easily find out which position of the character is missing by looking at the fill. Also, in the figure, the color of the overlapping character strings in the frame 110b is inverted and displayed, and the portion where the characters overlap is filled with black. Thereby, the user can easily specify the position of the portion where the characters overlap.
 なお、エクスクラメーションマークに隠れた文字をユーザが判別できるように、半透明のエクスクラメーションマークを画面に表示しても良い。また、描画の欠陥部分を発見し易くするための塗り潰しの色は、目立つ色であれば黒以外の色であって良く、例えば、赤、青、黄であって良い。また、判定部28は、文字認識の結果得られた各文字が画像110内におけるどの位置から認識されたかを示す位置情報を使用することによって、文字が抜けている位置や、文字が重なっている位置を推定することができる。この推定により、判定部28は、表示装置13の画面内における正しく描画されていない文字の周辺に警告を表示させたり、正しく表示されていない部分に色付けさせたりすることができる。また、図7に示すように、文字がフレームからはみ出てしまうと、文字とフレームの枠線が重なってしまうことがある。この場合、フレームの枠線と重なる文字については、正しい文字が認識されない。この特徴を利用すれば、レイアウト上の問題が生じていることを警告することも可能となる。従って、文字の欠落や文字同士の重なりの他に、各種の枠線から文字の一部がはみ出て重なった場合にも、本願発明により警告を出すことができる。また、上記説明では、欠落した文字の前後2文字を黒で塗り潰すことにより、文字の欠落が生じていることを表現したが、前後2文字に限らず、所定の文字数で欠落した文字の前後の文字を黒で塗り潰しても良い。例えば、前後1文字であっても良いし、前後3文字であっても良い。また、欠落した文字の直前の文字だけ黒で塗り潰す又は直後の文字だけ黒で塗り潰すことにより、文字の欠落が生じていることを表現しても良い。 It should be noted that a translucent exclamation mark may be displayed on the screen so that the user can distinguish characters hidden behind the exclamation mark. Further, the fill color for making it easy to find a defective portion of the drawing may be a color other than black as long as it is a conspicuous color, for example, red, blue, and yellow. In addition, the determination unit 28 uses position information indicating from which position in the image 110 each character obtained as a result of character recognition is used, so that the position where the character is missing or the character overlaps. The position can be estimated. By this estimation, the determination unit 28 can display a warning around a character that is not correctly drawn in the screen of the display device 13 or color a portion that is not correctly displayed. Also, as shown in FIG. 7, if a character protrudes from the frame, the character and the frame border may overlap. In this case, the correct character is not recognized for the character overlapping the frame border. By using this feature, it is possible to warn that a layout problem has occurred. Therefore, in addition to missing characters and overlapping characters, a warning can be issued according to the present invention even when a part of characters protrudes from various frame lines and overlaps. Further, in the above description, it is expressed that the missing of the character is generated by filling the two characters before and after the missing character with black. However, not only the two characters before and after, but also before and after the missing character with a predetermined number of characters. May be painted in black. For example, it may be one character before and after, or three characters before and after. In addition, it is possible to express that a missing character has occurred by filling only the character immediately before the missing character with black or painting the next character with black.
 図8は、本発明の実施形態に係る検査装置1により実行される処理の流れを表すフローチャートである。以下、同図を参照して説明する。本処理は、検査装置1のハードウェア上でプログラムを実行することにより開始され、この処理によって検査装置1が実現される。 FIG. 8 is a flowchart showing the flow of processing executed by the inspection apparatus 1 according to the embodiment of the present invention. Hereinafter, a description will be given with reference to FIG. This process is started by executing a program on the hardware of the inspection apparatus 1, and the inspection apparatus 1 is realized by this process.
 本処理が開始されると、取得部24は、文書が正しく描画されるか否かを検査する旨の要求(検査要求)の有無を判定する(ステップS1)。そして、取得部24は、受付部23から検査要求を受け付けた場合には、検査要求が有ると判定し(ステップS1;YES)、検査用ソフトウェアをソフトウェア記憶部21から取得する(ステップS2)。なお、取得部24は、受付部23から検査要求が供給されない場合には、検査要求が無いと判定し(ステップS1;NO)、受付部23から検査要求を受け付けるまで検査要求の有無の判定を繰り返す。 When this processing is started, the acquisition unit 24 determines whether or not there is a request (inspection request) for inspecting whether or not the document is correctly drawn (step S1). Then, when receiving the inspection request from the reception unit 23, the acquisition unit 24 determines that there is an inspection request (step S1; YES), and acquires the inspection software from the software storage unit 21 (step S2). The acquisition unit 24 determines that there is no inspection request when the inspection request is not supplied from the reception unit 23 (step S1; NO), and determines whether there is an inspection request until the inspection request is received from the reception unit 23. repeat.
 取得部24は、検査用ソフトウェアを取得すると、検査用ソフトウェアを起動し、初期化処理を行う(ステップS3)。初期化処理は、上述したように、特定のブラウザが検査用ブラウザとして指定されている場合にその指定を解除する処理、特定の文書が検査対象として指定されている場合にその指定を解除する処理を含む。 When acquiring the inspection software, the acquisition unit 24 activates the inspection software and performs an initialization process (step S3). As described above, the initialization process is a process of canceling the designation when a specific browser is designated as a browser for inspection, or the process of releasing the designation when a specific document is designated as an inspection target. including.
 次に、取得部24は、ユーザが検査対象として選択した文書を示す指示の有無を判定する(ステップS4)。取得部24は、受付部23からユーザが検査対象として選択した文書を示す指示を受け付けた場合には、指示が有ると判定する(ステップS4;YES)。そして、受け付けた指示に基づいて、ユーザが選択した文書を文書記憶部22から検査対象として取得する(ステップS5)。なお、取得部24は、受付部23から指示が供給されない場合には、指示が無いと判定し(ステップS4;NO)、受付部23から指示を受け付けるまで指示の有無の判定を繰り返す。 Next, the acquisition unit 24 determines whether there is an instruction indicating the document selected by the user as the inspection target (step S4). The acquisition unit 24 determines that there is an instruction when the instruction indicating the document selected by the user as the inspection target is received from the reception unit 23 (step S4; YES). Then, based on the received instruction, the document selected by the user is acquired from the document storage unit 22 as an inspection target (step S5). When the instruction is not supplied from the reception unit 23, the acquisition unit 24 determines that there is no instruction (step S4; NO), and repeats the determination of the presence or absence of the instruction until the instruction is received from the reception unit 23.
 次に、取得部24は、ユーザが検査用ブラウザとして選択したブラウザを示す指示の有無を判定する(ステップS6)。取得部24は、受付部23からユーザが検査用ブラウザとして選択したブラウザを示す指示を受け付けた場合には、指示が有ると判定し(ステップS6;YES)、選択されたブラウザをソフトウェア記憶部21から検査用ブラウザとして取得する(ステップS7)。なお、取得部24は、受付部23から指示が供給されない場合には、指示が無いと判定し(ステップS6;NO)、受付部23から指示を受け付けるまで指示の有無の判定を繰り返す。 Next, the acquisition unit 24 determines whether there is an instruction indicating the browser selected by the user as the inspection browser (step S6). The acquisition unit 24 determines that there is an instruction when the instruction indicating the browser selected by the user as the inspection browser is received from the reception unit 23 (step S6; YES), and the selected browser is the software storage unit 21. To obtain an inspection browser (step S7). The acquisition unit 24 determines that there is no instruction when the instruction is not supplied from the reception unit 23 (step S6; NO), and repeats the determination of the presence or absence of the instruction until the instruction is received from the reception unit 23.
 取得部24は、検査用ブラウザを取得すると、検査用ブラウザを起動する。そして、検査対象として取得した文書を検査用ブラウザに与え(ステップS8)、表示装置13の画面に表示させることにより(ステップS9)、検査対象の文書が描画された画像を取得する(ステップS10)。 The acquisition unit 24 starts the inspection browser when acquiring the inspection browser. Then, the document acquired as the inspection target is given to the inspection browser (step S8) and displayed on the screen of the display device 13 (step S9), thereby acquiring the image on which the inspection target document is drawn (step S10). .
 認識部26は、取得部24により取得された画像を文字認識することにより(ステップS11)、認識済テキストを取得する(ステップS12)。一方、抽出部27は、取得部24により取得された文書からタグを除去することにより、本文テキストを抽出する(ステップS13)。 The recognition unit 26 recognizes the image acquired by the acquisition unit 24 (step S11), and acquires the recognized text (step S12). On the other hand, the extraction unit 27 extracts the body text by removing the tag from the document acquired by the acquisition unit 24 (step S13).
 そして、判定部28は、認識部26が取得した認識済テキストと、抽出部27が抽出した本文テキストと、を対比し(ステップS14)、文書が検査用ブラウザにより正しく描画されたか否かを判定する(ステップS15)。判定部28は、認識済テキストに出現する文字のうちに、本文テキストに出現しない文字が有る場合には、文書が検査用ブラウザにより正しく描画されていないと判定する(ステップS15;NO)。また、本文テキストに出現する文字のうちに、認識済テキストに出現しない文字が有る場合も、文書が検査用ブラウザにより正しく描画されていないと判定する(ステップS15;NO)。 Then, the determination unit 28 compares the recognized text acquired by the recognition unit 26 with the body text extracted by the extraction unit 27 (step S14), and determines whether the document is correctly rendered by the inspection browser. (Step S15). When there are characters that do not appear in the body text among the characters that appear in the recognized text, the determination unit 28 determines that the document is not drawn correctly by the inspection browser (step S15; NO). If there is a character that does not appear in the recognized text among the characters that appear in the body text, it is determined that the document is not drawn correctly by the inspection browser (step S15; NO).
 判定部28は、文書が検査用ブラウザにより正しく描画されていないと判定した場合には、表示部25を制御して、その旨を表示装置13の画面に表示させる(ステップS16)。具体例については、図6を参照されたい。また、判定部28は、文書が正しく描画されていない旨の警告を、文書が検査用ブラウザにより画面における正しい描画が為されていない部分に表示させる(ステップS17)。具体例については、図7を参照されたい。 If the determination unit 28 determines that the document is not drawn correctly by the inspection browser, the determination unit 28 controls the display unit 25 to display the fact on the screen of the display device 13 (step S16). See FIG. 6 for a specific example. In addition, the determination unit 28 displays a warning that the document is not drawn correctly on a portion where the document is not correctly drawn on the screen by the inspection browser (step S17). See FIG. 7 for a specific example.
 なお、ステップS15において、判定部28は、認識済テキストと本文テキストとが合致する場合には、文書が検査用ブラウザにより正しく描画されたと判定し(ステップS15;YES)、表示部25を制御して、その旨を表示装置13の画面に表示させる(ステップS18)。 In step S15, when the recognized text matches the body text, the determination unit 28 determines that the document has been correctly drawn by the inspection browser (step S15; YES), and controls the display unit 25. This is displayed on the screen of the display device 13 (step S18).
 また、判定部28は、ステップS17又は18の処理を実行すると、ステップS3に戻り、初期化処理を再度実行する。これにより、検査用ブラウザと検査対象の文書に係る指定が解除され、新たな検査の開始準備が整えられる。 Further, when the determination unit 28 executes the process of step S17 or 18, the determination unit 28 returns to step S3 and executes the initialization process again. As a result, the designation related to the inspection browser and the document to be inspected is canceled, and preparations for starting a new inspection are made.
 以上説明したように、本発明の実施形態に係る検査装置1は、文書を検査用ブラウザに与えて、画面に表示させることにより、文書が描画された画像を取得する。そして、取得した画像から文字認識により取得した認識済テキストと、文書からタグを除去して抽出した本文テキストと、を対比することにより、文書が検査用ブラウザにより正しく描画されたか否かを判定する。従って、本発明の実施形態に係る検査装置1によれば、テキストを含む文書がブラウザや端末に関わらず正しく描画されるか否かを適切に検査することができる。 As described above, the inspection apparatus 1 according to the embodiment of the present invention acquires an image on which a document is drawn by giving the document to an inspection browser and displaying the document on a screen. Then, by comparing the recognized text acquired by character recognition from the acquired image with the body text extracted by removing the tag from the document, it is determined whether or not the document is correctly rendered by the inspection browser. . Therefore, according to the inspection apparatus 1 according to the embodiment of the present invention, it is possible to appropriately inspect whether a document including text is correctly rendered regardless of a browser or a terminal.
 また、ユーザが文書の描画の是非をチェックすると、チェック漏れが生じ易い。特に文字の欠落はユーザに見過ごされてしまいがちである。この点に関し、検査装置1がユーザの代わりに文書の描画の是非をチェックすることにより、チェック漏れは抑制される。 Also, if the user checks whether or not the document is drawn, check omissions are likely to occur. In particular, missing characters tend to be overlooked by the user. In this regard, the inspection apparatus 1 checks whether or not the document is drawn instead of the user, so that the check omission is suppressed.
(変形例)
 以上に本発明の実施形態について説明したが、上記実施形態は一例であり、本発明の適用範囲はこれに限られない。即ち、本発明の実施形態は種々の応用が可能であり、あらゆる実施の形態が本発明の範囲に含まれる。
(Modification)
Although the embodiment of the present invention has been described above, the above embodiment is an example, and the scope of application of the present invention is not limited to this. That is, the embodiments of the present invention can be applied in various ways, and all the embodiments are included in the scope of the present invention.
 上記実施形態では、本文テキストを抽出する方法の一例として、検査対象の文書から本文テキストを抽出する方法を説明したが、本発明はこれに限定されず、他の方法により本文テキストを抽出しても良い。 In the above embodiment, the method of extracting the body text from the document to be inspected has been described as an example of the method of extracting the body text. However, the present invention is not limited to this, and the body text is extracted by other methods. Also good.
 例えば、抽出部27は、取得部24に、ソフトウェア記憶部21が記憶する複数のブラウザのうちの検査用ブラウザ以外のブラウザ(以下、第2ブラウザと言う。)に検査対象の文書を与えて、画面に表示させることにより、文書が描画された画像(以下、第2画像と言う。)を取得させて良い。そして、抽出部27は、認識部26に、取得された第2画像を文字認識させることにより、本文テキストを抽出しても良い。また、判定部28は、文字認識により抽出された本文テキストと認識済テキストとを対比することにより、文書が検査用ブラウザにより正しく描画されたか否かを判定して良い。ただし、第2ブラウザは、文書が正しく描画されることを保証されているブラウザであるほうが望ましい。認識済テキストと本文テキストとが合致しない場合に、検査用ブラウザの描画に欠陥があると断定できるからである。以下、図9、10を参照して、正しい文書の描画が保証された第2ブラウザを用いた場合における、文書の描画の是非を判定する処理の流れを具体的に説明する。 For example, the extraction unit 27 gives a document to be inspected to a browser other than the inspection browser (hereinafter referred to as a second browser) among the plurality of browsers stored in the software storage unit 21 to the acquisition unit 24. An image on which a document is drawn (hereinafter referred to as a second image) may be acquired by displaying on the screen. Then, the extraction unit 27 may extract the body text by causing the recognition unit 26 to perform character recognition on the acquired second image. Further, the determination unit 28 may determine whether or not the document is correctly drawn by the inspection browser by comparing the body text extracted by character recognition with the recognized text. However, the second browser is preferably a browser that guarantees that the document is correctly rendered. This is because if the recognized text and the body text do not match, it can be determined that the drawing of the inspection browser is defective. In the following, the flow of processing for determining whether or not to draw a document in the case of using a second browser in which correct document drawing is guaranteed will be described in detail with reference to FIGS.
 図9は、図3に示す文書が第2ブラウザにより描画された第2画像を示す図である。取得部24は、図3に示す文書を第2ブラウザに与え、画面に表示させることにより、同図に示す文書が描画された第2画像を取得することができる。第2ブラウザは文書が正しく描画されることを保証されているため、図9に示すように、第2画像において文字がバナー広告に隠されたり文字同士が重なって表示されたりすることは起こりえない。このように第2画像には描画の欠陥が無いので、抽出部27は第2画像を文字認識することにより図5(b)に示す本文テキストと同様に欠陥が無い本文テキストを抽出することができる。そして、判定部28は、この本文テキストを用いて、文書が正しく描画されたか否かを判定することができる。 FIG. 9 is a diagram showing a second image in which the document shown in FIG. 3 is drawn by the second browser. The acquisition unit 24 can acquire the second image on which the document shown in FIG. 3 is drawn by giving the document shown in FIG. 3 to the second browser and displaying the document on the screen. Since the second browser is guaranteed to render the document correctly, as shown in FIG. 9, it is possible for characters to be hidden in the banner advertisement or to overlap each other in the second image. Absent. As described above, since there is no drawing defect in the second image, the extraction unit 27 can extract the body text having no defect similarly to the body text shown in FIG. 5B by recognizing the character of the second image. it can. And the determination part 28 can determine whether the document was drawn correctly using this body text.
 図10は、第2ブラウザを用いた本文テキスト抽出方法を採用した場合における、検査装置1により実行される処理の流れを表すフローチャートである。以下、同図を参照して、第2ブラウザを用いて本文テキストを抽出する場合の本処理の流れを説明する。ただし、図8を参照して既に説明した処理については説明を省略する。 FIG. 10 is a flowchart showing the flow of processing executed by the inspection apparatus 1 when the text text extraction method using the second browser is adopted. Hereinafter, the flow of this processing in the case of extracting the body text using the second browser will be described with reference to FIG. However, the description of the processing already described with reference to FIG. 8 is omitted.
 ステップS12において、認識部26が認識済テキストを取得すると、抽出部27は、取得部24に、ソフトウェア記憶部21から第2ブラウザを取得させ(ステップS19)、第2ブラウザを起動させる。次に、抽出部27は、取得部24に、検査対象の文書を第2ブラウザに与え(ステップS20)、表示装置13の画面に表示させることにより(ステップS21)、検査対象の文書が描画された第2画像を取得させる(ステップS22)。 In step S12, when the recognition unit 26 acquires the recognized text, the extraction unit 27 causes the acquisition unit 24 to acquire the second browser from the software storage unit 21 (step S19), and activates the second browser. Next, the extraction unit 27 gives the document to be inspected to the second browser to the acquisition unit 24 (step S20), and displays it on the screen of the display device 13 (step S21), thereby drawing the document to be inspected. The second image is acquired (step S22).
 次に、抽出部27は、認識部26に、取得部24により取得された第2画像を文字認識させることにより(ステップS23)、本文テキストを抽出する(ステップS13)。ステップS14以降の処理については図8を参照して既に説明したので省略する。 Next, the extraction unit 27 causes the recognition unit 26 to perform character recognition on the second image acquired by the acquisition unit 24 (step S23), thereby extracting the body text (step S13). The processing after step S14 has already been described with reference to FIG.
 上述したように、第2ブラウザは、文書が正しく描画されることを保証されたブラウザであることが望ましいが、このような保証が無くても第2ブラウザを用いて文書の描画の是非を検査することは有用である。なぜなら、検査用ブラウザと第2ブラウザで認識された文字が違うことが分かれば、検査用ブラウザの描画に欠陥があると断定できないものの、どちらか若しくは両方のブラウザの描画に欠陥があると分かるので、文書作成者は文書に何らかの修正を加える必要があると気付けるからである。 As described above, it is desirable that the second browser is a browser that guarantees that the document is drawn correctly. However, even if there is no such guarantee, the second browser is used to check whether or not the document is drawn. It is useful to do. Because if it is known that the characters recognized by the browser for inspection and the second browser are different, it can be determined that there is a defect in the drawing of the inspection browser, but it can be found that the drawing of either or both browsers is defective. This is because the document creator realizes that it is necessary to make some modifications to the document.
 また、上記実施形態では、判定部28は、認識済テキストと本文テキストとが1文字でも違えば、文書が検査用ブラウザにより正しく描画されていないと判定したが、軽微な差異であれば文書が正しく描画されたと判定しても良い。例えば、判定部28は、許容できる差異の度合いを示す許容度を設定し、認識済テキストと本文テキストとの差異が許容度以下であれば、文書が正しく描画されたと判定しても良い。許容度は、例えば、5文字までの差異なら許容すると設定されても良いし、10文字までの差異なら許容すると設定されても良い。また、ユーザが予め許容度を設定しても良いし、判定部28が認識済テキストと本文テキストとを対比する前に許容度の設定をユーザに求めても良い。また、判定部28は、判定結果を表示装置13の画面に表示すると共に、本文テキストに対する認識済テキストの適合度を表示しても良い。例えば、本文テキストの文字数が100文字であり、本文テキストと認識済テキストとの差異が5文字である場合には、適合度=95%と表示する。また、判定部28は、ユーザが差異の内容を把握できるように詳細レポートを表示装置13の画面に表示しても良い。詳細レポートには、「文書に出現する全文字のうち画面に正しく表示されていない文字が3文字存在します。」というメッセージや、「文書に出現する50文字目の文字「あ」及び60文字目の文字「い」が、画面に表示されていません。」というメッセージが含まれていて良い。 In the above-described embodiment, the determination unit 28 determines that the document is not correctly drawn by the inspection browser if the recognized text and the body text are different by one character. It may be determined that the image has been correctly drawn. For example, the determination unit 28 may set a tolerance indicating the degree of allowable difference, and may determine that the document has been correctly drawn if the difference between the recognized text and the body text is less than the tolerance. For example, the tolerance may be set to allow if the difference is up to 5 characters, or may be set to allow if the difference is up to 10 characters. Also, the user may set the tolerance in advance, or the determination unit 28 may ask the user to set the tolerance before comparing the recognized text and the body text. Further, the determination unit 28 may display the determination result on the screen of the display device 13 and display the degree of fitness of the recognized text with respect to the body text. For example, when the number of characters of the body text is 100 characters and the difference between the body text and the recognized text is 5 characters, the fitness = 95% is displayed. Further, the determination unit 28 may display a detailed report on the screen of the display device 13 so that the user can grasp the content of the difference. In the detailed report, a message “There are 3 characters that are not displayed correctly on the screen among all the characters that appear in the document.” And the “50th character“ A ”and 60 characters that appear in the document. The eye letter "I" is not displayed on the screen. May be included.
 また、ここまでは、ソフトウェア記憶部21が記憶する複数のブラウザのうちの1つを検査用ブラウザとして用いる例を説明したが、ソフトウェア記憶部21が記憶する全ブラウザを検査用ブラウザとして用いて文書の描画の是非を検査しても良い。この場合、取得部24は、検査対象の文書をソフトウェア記憶部21が記憶する全ブラウザに与え、ブラウザごとに検査対象の文書が描画された画像を取得する。続いて、認識部26は、取得された画像ごとに文字認識してテキストを取得する。そして、判定部28は、取得された全テキストを互いに対比し、いずれか2つのテキストの差異が許容度を超えていれば、文書が正しく描画されていないと判定し、2つのテキストの差異がいずれも許容度以下であれば、文書が正しく描画されたと判定する。これにより、文書作成者は、文書がブラウザや端末の種類に関わらずほぼ同じ内容で描画されているかを、ブラウザ又は端末ごとにいちいちチェックしなくても確認することができる。従って、文書の調整にかかる作業負担が軽減される。 In the above description, an example in which one of a plurality of browsers stored in the software storage unit 21 is used as an inspection browser has been described. However, all browsers stored in the software storage unit 21 are used as inspection browsers. You may inspect the right or wrong of drawing. In this case, the acquisition unit 24 gives the inspection target document to all browsers stored in the software storage unit 21, and acquires an image in which the inspection target document is drawn for each browser. Subsequently, the recognition unit 26 recognizes characters for each acquired image and acquires text. Then, the determination unit 28 compares all the acquired texts with each other. If the difference between any two texts exceeds the tolerance, the determination unit 28 determines that the document is not drawn correctly, and the difference between the two texts is determined. If both are below the tolerance, it is determined that the document has been correctly drawn. As a result, the document creator can check whether the document is drawn with substantially the same content regardless of the type of browser or terminal without checking each browser or terminal. Therefore, the work burden related to document adjustment is reduced.
 また、上記実施形態では、取得部24は、文書を画面に表示させることにより、文書が描画された画像を取得した。この場合、画面に表示されたテキストしか描画の是非の判定対象にならないので、文書全体について描画の是非が判定されないことがある。そこで、文書が検査用ブラウザに提供されると、検査用ブラウザの仮想ビューを起動し、文書全体を仮想ビューに描画させる機能を検査用ソフトウェアに追加しても良い。この機能により、取得部24は文書全体が描画された画像を仮想ビューから取得することができる。従って、判定部28は、文書全体が画面に収まりきらない場合でも、文書全体について描画の是非を判定することができる。 In the above embodiment, the acquisition unit 24 acquires the image on which the document is drawn by displaying the document on the screen. In this case, since only the text displayed on the screen is subject to the determination of whether or not to draw, whether or not the entire document is drawn may not be determined. Therefore, when the document is provided to the inspection browser, a function for starting the virtual view of the inspection browser and drawing the entire document in the virtual view may be added to the inspection software. With this function, the acquisition unit 24 can acquire an image in which the entire document is drawn from the virtual view. Therefore, the determination unit 28 can determine whether or not to draw the entire document even when the entire document does not fit on the screen.
 また、文書全体が画面に収まりきらない場合には、画面に表示されているスクロールバーを自動的にスクロールさせることにより、文書全体を画面に表示させる機能を検査用ソフトウェアに追加しても良い。この機能により、取得部24は仮想ビューが無くても文書全体が描画された画像を取得することができる。 Also, when the entire document does not fit on the screen, a function for displaying the entire document on the screen by automatically scrolling the scroll bar displayed on the screen may be added to the inspection software. With this function, the acquisition unit 24 can acquire an image in which the entire document is drawn without a virtual view.
 なお、検査装置1とネットワークを介して通信可能な外部装置に図2に示す機能構成の一部を実装しても構わない。例えば、検査装置1は、ソフトウェア記憶部21、文書記憶部22として機能する外部装置から、各種ソフトウェアや文書を取得しても良い。また、受付部23として機能する外部装置がユーザからの各種要求や指示を受け付けて、検査装置1は、外部装置にユーザからの各種要求や指示を供給させても良い。また、検査装置1は、文書を表示部25として機能する外部装置に提供し、外部装置の画面に表示させることにより、文書が描画された画像を取得しても良い。さらに、検査装置1は、検査結果を外部装置の画面に表示させても良い。 Note that a part of the functional configuration shown in FIG. 2 may be mounted on an external device capable of communicating with the inspection device 1 via a network. For example, the inspection device 1 may acquire various software and documents from an external device that functions as the software storage unit 21 and the document storage unit 22. Further, the external apparatus functioning as the receiving unit 23 may receive various requests and instructions from the user, and the inspection apparatus 1 may cause the external apparatus to supply various requests and instructions from the user. Further, the inspection apparatus 1 may obtain an image on which a document is drawn by providing the document to an external device functioning as the display unit 25 and displaying the document on a screen of the external device. Further, the inspection device 1 may display the inspection result on the screen of the external device.
 なお、本発明に係る機能を実現するための構成を予め備えた検査装置1として提供できることはもとより、プログラムの適用により、既存のパーソナルコンピュータや情報端末機器等を、本発明に係る検査装置1として機能させることもできる。即ち、上記実施形態で例示した検査装置1による各機能構成を実現させるためのプログラムを、既存のパーソナルコンピュータや情報端末機器等を制御するCPU等が実行できるように適用することで、本発明に係る検査装置1として機能させることができる。また、本発明に係る検査方法は、検査装置1を用いて実施できる。 In addition, not only can the configuration for realizing the functions according to the present invention be provided in advance as the inspection apparatus 1, but also an existing personal computer or information terminal device or the like can be provided as the inspection apparatus 1 according to the present invention by applying the program. It can also function. That is, by applying the program for realizing each functional configuration by the inspection apparatus 1 exemplified in the above embodiment so that a CPU or the like for controlling an existing personal computer or information terminal device can be executed, the present invention is applied. The inspection apparatus 1 can function. In addition, the inspection method according to the present invention can be implemented using the inspection apparatus 1.
 また、このようなプログラムの適用方法は任意である。プログラムを、例えば、コンピュータが読取可能な記録媒体[CD-ROM(Compact Disc Read-Only Memory)、DVD(Digital Versatile Disc)、MO(Magneto Optical disc)等]に格納して適用できる他、インターネット等のネットワーク上のストレージにプログラムを格納しておき、これをダウンロードさせることにより適用することもできる。 Moreover, the application method of such a program is arbitrary. For example, the program can be stored and applied to a computer-readable recording medium [CD-ROM (Compact Disc Read-Only Memory), DVD (Digital Versatile Disc), MO (Magneto Optical Disc), etc.], the Internet, etc. It is also possible to apply the program by storing it in a storage on the network and downloading it.
 なお、本発明は、本発明の広義の精神と範囲を逸脱することなく、様々な実施の形態及び変形が可能とされるものである。また、上述した実施の形態は、この発明を説明するためのものであり、本発明の範囲を限定するものではない。即ち、本発明の範囲は、実施の形態ではなく、特許請求の範囲によって示される。そして、特許請求の範囲内及びそれと同等の発明の意義の範囲内で施される様々な変形が、この発明の範囲内とみなされる。 The present invention is capable of various embodiments and modifications without departing from the broad spirit and scope of the present invention. The above-described embodiments are for explaining the present invention and do not limit the scope of the present invention. That is, the scope of the present invention is shown not by the embodiments but by the claims. Various modifications within the scope of the claims and within the scope of the equivalent invention are considered to be within the scope of the present invention.
(まとめ)
 以下に本発明のまとめを記載する。
(Summary)
The summary of the present invention is described below.
 本発明の一形態に係る検査装置は、
 文書をブラウザに与えて、画面に表示させ若しくはブラウザの仮想ビューに描画させることにより、当該文書が描画された画像を取得する取得部、
 前記取得された画像を文字認識することにより、認識済テキストを取得する認識部、
 前記文書から描画されるべき本文テキストを抽出する抽出部、
 前記認識済テキストと前記本文テキストとを対比することにより、前記文書が前記ブラウザにより正しく描画されたか否かを判定する判定部、
 を備える。
An inspection apparatus according to an aspect of the present invention is
An acquisition unit that obtains an image in which the document is rendered by giving the document to the browser and causing the document to be displayed on the screen or rendered in a virtual view of the browser;
A recognition unit that acquires recognized text by character recognition of the acquired image;
An extractor for extracting body text to be drawn from the document;
A determination unit that determines whether or not the document is correctly rendered by the browser by comparing the recognized text and the body text;
Is provided.
 前記抽出部は、前記文書からタグを除去することにより、前記本文テキストを抽出しても良い。 The extraction unit may extract the body text by removing a tag from the document.
 前記判定部は、前記認識済テキストに出現する文字のうちに、前記本文テキストに出現しない文字が有る場合には、前記文書が前記ブラウザに正しく描画されていないと判定しても良い。 The determination unit may determine that the document is not correctly rendered in the browser when there is a character that does not appear in the body text among characters that appear in the recognized text.
 前記判定部は、前記本文テキストに出現する文字のうちに、前記認識済テキストに出現しない文字が有る場合には、前記文書が前記ブラウザに正しく描画されていないと判定しても良い。 The determination unit may determine that the document is not correctly rendered in the browser when there is a character that does not appear in the recognized text among characters that appear in the body text.
 前記抽出部は、前記取得部に、前記文書を第2ブラウザに与えて、画面に表示させ若しくは前記第2ブラウザの仮想ビューに描画させることにより、当該文書が描画された第2画像を取得させ、前記認識部に、当該取得された第2画像を文字認識させることにより、前記本文テキストを抽出しても良い。 The extraction unit causes the acquisition unit to acquire the second image on which the document is drawn by giving the document to the second browser and causing the second browser to display the document or to display the document on a virtual view of the second browser. The body text may be extracted by causing the recognition unit to perform character recognition on the acquired second image.
 本発明の一形態に係る検査方法は、
 検査装置が、文書をブラウザに与えて、画面に表示させ若しくはブラウザの仮想ビューに描画させることにより、当該文書が描画された画像を取得する取得ステップ、
 前記検査装置が、前記取得された画像を文字認識することにより、認識済テキストを取得する認識ステップ、
 前記検査装置が、前記文書から描画されるべき本文テキストを抽出する抽出ステップ、
 前記検査装置が、前記認識済テキストと前記本文テキストとを対比することにより、前記文書が前記ブラウザにより正しく描画されたか否かを判定する判定ステップ、
 を備える。
An inspection method according to an aspect of the present invention includes:
An acquisition step in which the inspection apparatus gives the document to the browser and displays it on the screen or draws it in the virtual view of the browser, thereby obtaining an image in which the document is drawn;
A recognition step in which the inspection device acquires recognized text by character recognition of the acquired image;
An extraction step in which the inspection device extracts the body text to be drawn from the document;
A determination step for determining whether or not the document has been correctly rendered by the browser by comparing the recognized text and the body text by the inspection device;
Is provided.
 本発明の一形態に係るコンピュータ読み取り可能な記録媒体は、
 コンピュータを、
 文書をブラウザに与えて、画面に表示させ若しくはブラウザの仮想ビューに描画させることにより、当該文書が描画された画像を取得する取得部、
 前記取得された画像を文字認識することにより、認識済テキストを取得する認識部、
 前記文書から描画されるべき本文テキストを抽出する抽出部、
 前記認識済テキストと前記本文テキストとを対比することにより、前記文書が前記ブラウザにより正しく描画されたか否かを判定する判定部、
 として機能させるプログラムを記録する。
A computer-readable recording medium according to an aspect of the present invention is provided.
Computer
An acquisition unit that obtains an image in which the document is rendered by giving the document to the browser and causing the document to be displayed on the screen or rendered in a virtual view of the browser;
A recognition unit that acquires recognized text by character recognition of the acquired image;
An extractor for extracting body text to be drawn from the document;
A determination unit that determines whether or not the document is correctly rendered by the browser by comparing the recognized text and the body text;
Record the program that will function as
 本発明の一形態に係るプログラムは、
 コンピュータを、
 文書をブラウザに与えて、画面に表示させ若しくはブラウザの仮想ビューに描画させることにより、当該文書が描画された画像を取得する取得部、
 前記取得された画像を文字認識することにより、認識済テキストを取得する認識部、
 前記文書から描画されるべき本文テキストを抽出する抽出部、
 前記認識済テキストと前記本文テキストとを対比することにより、前記文書が前記ブラウザにより正しく描画されたか否かを判定する判定部、
 として機能させる。
A program according to one aspect of the present invention is:
Computer
An acquisition unit that obtains an image in which the document is rendered by giving the document to the browser and causing the document to be displayed on the screen or rendered in a virtual view of the browser;
A recognition unit that acquires recognized text by character recognition of the acquired image;
An extractor for extracting body text to be drawn from the document;
A determination unit that determines whether or not the document is correctly rendered by the browser by comparing the recognized text and the body text;
To function as.
 本発明によれば、テキストを含む文書がブラウザにより正しく描画されるか否かを適切に検査する検査装置、検査方法、記録媒体、及び、プログラムを提供することができる。 According to the present invention, it is possible to provide an inspection apparatus, an inspection method, a recording medium, and a program for appropriately inspecting whether a document including text is correctly rendered by a browser.
1   検査装置
11  記憶装置
12  受付装置
13  表示装置
14  制御部
21  ソフトウェア記憶部
22  文書記憶部
23  受付部
24  取得部
25  表示部
26  認識部
27  抽出部
28  判定部
100 文書
110 画像
DESCRIPTION OF SYMBOLS 1 Inspection apparatus 11 Storage apparatus 12 Reception apparatus 13 Display apparatus 14 Control part 21 Software storage part 22 Document storage part 23 Reception part 24 Acquisition part 25 Display part 26 Recognition part 27 Extraction part 28 Determination part 100 Document 110 Image

Claims (8)

  1.  文書をブラウザに与えて、画面に表示させ若しくはブラウザの仮想ビューに描画させることにより、当該文書が描画された画像を取得する取得部、
     前記取得された画像を文字認識することにより、認識済テキストを取得する認識部、
     前記文書から描画されるべき本文テキストを抽出する抽出部、
     前記認識済テキストと前記本文テキストとを対比することにより、前記文書が前記ブラウザにより正しく描画されたか否かを判定する判定部、
     を備える検査装置。
    An acquisition unit that obtains an image in which the document is rendered by giving the document to the browser and causing the document to be displayed on the screen or rendered in a virtual view of the browser;
    A recognition unit that acquires recognized text by character recognition of the acquired image;
    An extractor for extracting body text to be drawn from the document;
    A determination unit that determines whether or not the document is correctly rendered by the browser by comparing the recognized text and the body text;
    An inspection apparatus comprising:
  2.  前記抽出部は、前記文書からタグを除去することにより、前記本文テキストを抽出する請求項1に記載の検査装置。 The inspection apparatus according to claim 1, wherein the extraction unit extracts the body text by removing a tag from the document.
  3.  前記判定部は、前記認識済テキストに出現する文字のうちに、前記本文テキストに出現しない文字が有る場合には、前記文書が前記ブラウザに正しく描画されていないと判定する請求項1に記載の検査装置。 The determination unit according to claim 1, wherein the determination unit determines that the document is not correctly rendered in the browser when there is a character that does not appear in the body text among characters that appear in the recognized text. Inspection device.
  4.  前記判定部は、前記本文テキストに出現する文字のうちに、前記認識済テキストに出現しない文字が有る場合には、前記文書が前記ブラウザに正しく描画されていないと判定する請求項1に記載の検査装置。 The determination unit according to claim 1, wherein the determination unit determines that the document is not correctly rendered in the browser when there is a character that does not appear in the recognized text among characters that appear in the body text. Inspection device.
  5.  前記抽出部は、前記取得部に、前記文書を第2ブラウザに与えて、画面に表示させ若しくは前記第2ブラウザの仮想ビューに描画させることにより、当該文書が描画された第2画像を取得させ、前記認識部に、当該取得された第2画像を文字認識させることにより、前記本文テキストを抽出する請求項1に記載の検査装置。 The extraction unit causes the acquisition unit to acquire the second image on which the document is drawn by giving the document to the second browser and causing the second browser to display the document or to display the document on a virtual view of the second browser. The inspection apparatus according to claim 1, wherein the body text is extracted by causing the recognition unit to recognize characters of the acquired second image.
  6.  検査装置が、文書をブラウザに与えて、画面に表示させ若しくはブラウザの仮想ビューに描画させることにより、当該文書が描画された画像を取得する取得ステップ、
     前記検査装置が、前記取得された画像を文字認識することにより、認識済テキストを取得する認識ステップ、
     前記検査装置が、前記文書から描画されるべき本文テキストを抽出する抽出ステップ、
     前記検査装置が、前記認識済テキストと前記本文テキストとを対比することにより、前記文書が前記ブラウザにより正しく描画されたか否かを判定する判定ステップ、
     を備える検査方法。
    An acquisition step in which the inspection apparatus gives the document to the browser and displays it on the screen or draws it in the virtual view of the browser, thereby obtaining an image in which the document is drawn;
    A recognition step in which the inspection device acquires recognized text by character recognition of the acquired image;
    An extraction step in which the inspection device extracts the body text to be drawn from the document;
    A determination step for determining whether or not the document has been correctly rendered by the browser by comparing the recognized text and the body text by the inspection device;
    An inspection method comprising:
  7.  コンピュータを、
     文書をブラウザに与えて、画面に表示させ若しくはブラウザの仮想ビューに描画させることにより、当該文書が描画された画像を取得する取得部、
     前記取得された画像を文字認識することにより、認識済テキストを取得する認識部、
     前記文書から描画されるべき本文テキストを抽出する抽出部、
     前記認識済テキストと前記本文テキストとを対比することにより、前記文書が前記ブラウザにより正しく描画されたか否かを判定する判定部、
     として機能させるプログラムを記録したコンピュータ読み取り可能な記録媒体。
    Computer
    An acquisition unit that obtains an image in which the document is rendered by giving the document to the browser and causing the document to be displayed on the screen or rendered in a virtual view of the browser;
    A recognition unit that acquires recognized text by character recognition of the acquired image;
    An extractor for extracting body text to be drawn from the document;
    A determination unit that determines whether or not the document is correctly rendered by the browser by comparing the recognized text and the body text;
    A computer-readable recording medium storing a program that functions as a computer.
  8.  コンピュータを、
     文書をブラウザに与えて、画面に表示させ若しくはブラウザの仮想ビューに描画させることにより、当該文書が描画された画像を取得する取得部、
     前記取得された画像を文字認識することにより、認識済テキストを取得する認識部、
     前記文書から描画されるべき本文テキストを抽出する抽出部、
     前記認識済テキストと前記本文テキストとを対比することにより、前記文書が前記ブラウザにより正しく描画されたか否かを判定する判定部、
     として機能させるプログラム。
    Computer
    An acquisition unit that obtains an image in which the document is rendered by giving the document to the browser and causing the document to be displayed on the screen or rendered in a virtual view of the browser;
    A recognition unit that acquires recognized text by character recognition of the acquired image;
    An extractor for extracting body text to be drawn from the document;
    A determination unit that determines whether or not the document is correctly rendered by the browser by comparing the recognized text and the body text;
    Program to function as.
PCT/JP2015/080288 2015-10-27 2015-10-27 Testing device, testing method, recording medium, and program WO2017072865A1 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
JP2017547240A JP6356924B2 (en) 2015-10-27 2015-10-27 Inspection device, inspection method, recording medium, and program
PCT/JP2015/080288 WO2017072865A1 (en) 2015-10-27 2015-10-27 Testing device, testing method, recording medium, and program

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/JP2015/080288 WO2017072865A1 (en) 2015-10-27 2015-10-27 Testing device, testing method, recording medium, and program

Publications (1)

Publication Number Publication Date
WO2017072865A1 true WO2017072865A1 (en) 2017-05-04

Family

ID=58629960

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/JP2015/080288 WO2017072865A1 (en) 2015-10-27 2015-10-27 Testing device, testing method, recording medium, and program

Country Status (2)

Country Link
JP (1) JP6356924B2 (en)
WO (1) WO2017072865A1 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2019117535A (en) * 2017-12-27 2019-07-18 Necソリューションイノベータ株式会社 Inspection apparatus, inspection method and inspection program
JP2019185623A (en) * 2018-04-16 2019-10-24 株式会社日立製作所 Device and method for generating information on operation difference between web browsers

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH11282840A (en) * 1998-03-26 1999-10-15 Hitachi Software Eng Co Ltd Document preparation edit support system and recording medium
JP2006079571A (en) * 2004-09-07 2006-03-23 Soar System Inc Printer print verification inspection device
JP2009134407A (en) * 2007-11-29 2009-06-18 Nomura Research Institute Ltd Test device and method for verifying execution result of computer program
JP2011008622A (en) * 2009-06-26 2011-01-13 Fujitsu Ltd Display test device, display test program, and display test method

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH11282840A (en) * 1998-03-26 1999-10-15 Hitachi Software Eng Co Ltd Document preparation edit support system and recording medium
JP2006079571A (en) * 2004-09-07 2006-03-23 Soar System Inc Printer print verification inspection device
JP2009134407A (en) * 2007-11-29 2009-06-18 Nomura Research Institute Ltd Test device and method for verifying execution result of computer program
JP2011008622A (en) * 2009-06-26 2011-01-13 Fujitsu Ltd Display test device, display test program, and display test method

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2019117535A (en) * 2017-12-27 2019-07-18 Necソリューションイノベータ株式会社 Inspection apparatus, inspection method and inspection program
JP7044546B2 (en) 2017-12-27 2022-03-30 Necソリューションイノベータ株式会社 Inspection equipment, inspection method and inspection program
JP2019185623A (en) * 2018-04-16 2019-10-24 株式会社日立製作所 Device and method for generating information on operation difference between web browsers
JP6993284B2 (en) 2018-04-16 2022-01-13 株式会社日立製作所 Operation difference information generation device between Web browsers and operation difference information generation method

Also Published As

Publication number Publication date
JPWO2017072865A1 (en) 2018-07-19
JP6356924B2 (en) 2018-07-11

Similar Documents

Publication Publication Date Title
CN112100063B (en) Interface language display test method and device, computer equipment and storage medium
US20060217956A1 (en) Translation processing method, document translation device, and programs
US11379536B2 (en) Classification device, classification method, generation method, classification program, and generation program
US20170132216A1 (en) Systems and methods for facilitating software infterface localization between multiple languages
US20190243875A1 (en) Intelligent font replacement of missing glyphs
IL226027A (en) Bidirectional text checker and method
WO2014101504A1 (en) Method and device for detecting words in application program
JP6356924B2 (en) Inspection device, inspection method, recording medium, and program
JP5528420B2 (en) Translation apparatus, translation method, and computer program
US10185705B2 (en) Detecting text truncation in a graphical user interface
JP2019212115A (en) Inspection device, inspection method, program, and learning device
CN111247517B (en) Image processing method, device and system
Darvishy et al. Comparison of the effectiveness of different accessibility plugins based on important accessibility criteria
JP7345012B2 (en) Information processing system, information processing method and program
US11227096B2 (en) Error detection in text string within displayed layout
CN116263784A (en) Picture text-oriented coarse granularity emotion analysis method and device
JP5483526B2 (en) Machine translation system and machine translation method
TWI777163B (en) Form data detection method, computer device and storage medium
US20170249301A1 (en) Information processing apparatus, information processing method, and non-transitory computer readable medium
JP6390858B2 (en) Apparatus and method for comparing two data including graphic element and text element
JP2022069122A (en) Typeface setting device, typeface setting method and computer program
CN110851349A (en) Page abnormal display detection method, terminal equipment and storage medium
JP2008158643A (en) Display inspection method, display inspection apparatus and display inspection program
US10203865B2 (en) Document content reordering for assistive technologies by connecting traced paths through the content
Ivanova et al. Usability strategy and guidelines for building an accessible web portal

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 15907229

Country of ref document: EP

Kind code of ref document: A1

ENP Entry into the national phase

Ref document number: 2017547240

Country of ref document: JP

Kind code of ref document: A

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 15907229

Country of ref document: EP

Kind code of ref document: A1