WO2005122002A2 - Structurized document creation method, and device thereof - Google Patents

Structurized document creation method, and device thereof Download PDF

Info

Publication number
WO2005122002A2
WO2005122002A2 PCT/JP2005/000102 JP2005000102W WO2005122002A2 WO 2005122002 A2 WO2005122002 A2 WO 2005122002A2 JP 2005000102 W JP2005000102 W JP 2005000102W WO 2005122002 A2 WO2005122002 A2 WO 2005122002A2
Authority
WO
WIPO (PCT)
Prior art keywords
structured document
input
document
structured
word
Prior art date
Application number
PCT/JP2005/000102
Other languages
French (fr)
Japanese (ja)
Other versions
WO2005122002A1 (en
Inventor
Masakazu Fujio
Hideyuki Ban
Original Assignee
Hitachi Medical Corp
Masakazu Fujio
Hideyuki Ban
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Hitachi Medical Corp, Masakazu Fujio, Hideyuki Ban filed Critical Hitachi Medical Corp
Priority to JP2006514408A priority Critical patent/JP4649405B2/en
Publication of WO2005122002A1 publication Critical patent/WO2005122002A1/en
Publication of WO2005122002A2 publication Critical patent/WO2005122002A2/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/31Indexing; Data structures therefor; Storage structures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/268Morphological analysis

Definitions

  • the present invention relates to a structured document creation method and apparatus suitable for use in a system that supports creation of an image interpretation report in the medical field, such as mammography (mammography).
  • mammography mammography
  • An image reading report refers to a photograph taken by a CT (Computer Tomography) or MRI (Magnetic Resonance Imaging), etc., which is observed by a specialist, and if there are any abnormal findings, characteristics such as the shape, size, type, etc. It describes the diagnosis results, instructions, etc. In recent years, securing the quality of these interpretation reports has become an issue.
  • DICOM In the DICOM standard, the standard for describing interpretation reports is DICOM— An SR is being formulated. As of January 2004, there are 16 DICOM basic standards and 93 standard supplements (for example, see Non-Patent Document 1).
  • SR Structured Reporting
  • mammography CAD (Suppl.50), chest CAD (Supple.65), catheter lab (Supple.66), circulatory organ (Supple.71), and cardiac echo (Supple.72).
  • Patient history (Supple.75), Numerical representation of vascular 'ventricular imaging (Supple.76), Vascular ultrasound (Supple.77), Fetal' pediatric (Supple.78), Breast cancer diagnosis (Supple.79), MR angiography ⁇ CT angiography (Supple.X).
  • DICOM To describe an image interpretation report in a standard structure format such as SR, determine which template (structured fixed format) the input sentence applies to, and fill in the blank part of the template. You have to determine the value that goes into. For this reason, a method has been adopted in which the content of the findings is templated and the template is filled in at the time of input (for example, see Patent Documents 1, 2, and 3).
  • Non-Patent Document 1 “DICOM Standard”, [online], search on May 5, 2004, Internet URL: http: / 1 meaical.nema.org/Dicom/
  • Patent Document 1 Japanese Patent Application Laid-Open No. 2001-125994 (paragraphs [0018] to [0034], FIG. 1)
  • Patent Document 2 Japanese Patent Application Laid-Open No. 2001-126007 (paragraph [0010], FIG. 1)
  • Patent Document 3 Japanese Patent Application Laid-Open No. 2003-288332 (paragraphs [0031]-[0038], FIGS. 1 and 2) Disclosure of the Invention
  • the present invention has been made based on the above-mentioned circumstances, and enables a doctor's opinion sentence to be input efficiently into a structured fixed format for each medical field in a form that reduces the burden on the doctor.
  • An object of the present invention is to provide a method and an apparatus for creating a structured document that can be converted and recorded.
  • the degree of conformity between an input text, a template, and a structured template stored in the past is provided by character string information or language analysis.
  • a template that is calculated using linguistic information and efficiently converted to The feature is to narrow down the list.
  • the input text is converted into a structured document according to the template by performing a correlation function operation using a word dictionary having category information necessary for the structure.
  • FIG. 1 is a drawing cited for explaining a system environment in which a structure creation apparatus of the present invention is used.
  • FIG. 2 is a diagram showing a template narrowing-down sequence according to the embodiment of the present invention.
  • FIG. 3 is a block diagram showing the internal configuration of the structure creation apparatus according to the embodiment of the present invention.
  • FIG. 4 is a diagram showing an example of a database and a program module according to the embodiment of the present invention.
  • FIG. 5 (a) is a diagram showing an example of a field-specific context group term dictionary, and (b) is an example of a field-specific SR template (template DB).
  • FIG. 6 is an example of an SR template according to the embodiment of the present invention.
  • FIG. 7 is a diagram showing an example of a structure display report visualization display according to the embodiment of the present invention.
  • FIG. 8 is a flowchart according to the embodiment of the present invention.
  • FIG. 9 is a conceptual diagram illustrating an operation of a structure processing according to an embodiment of the present invention.
  • FIG. 10 is a diagram showing an example of a context group defined in DICOM-SR.
  • FIG. 11 is a diagram showing an expression example of an in-process result of an input sentence according to the embodiment of the present invention.
  • FIG. 12 is a diagram showing an example of expressing an in-process result of an input sentence according to the embodiment of the present invention.
  • FIG. 13 is a diagram showing an example of an SR tree structure according to the embodiment of the present invention.
  • FIG. 14 is a diagram showing the concept of an SR tree structure.
  • FIG. 15 is a diagram showing an example of a Content Item defined in DICOM-SR.
  • FIG. 16 is a diagram showing an example of a constraint on Relationship in DICOM-SR.
  • FIG. 17 is a diagram showing one entry of an SR subtree bank according to the embodiment of the present invention.
  • FIG. 18 is a diagram showing an example of a time-series arrangement of interpretation reports according to the embodiment of the present invention.
  • DICOM Structure defined by SRs may contain not only information on disease names, diagnoses, plans, shapes, numbers, sizes, dates, etc. but also target images and reference information to past reports. it can.
  • Figure 14 shows the concept of a structured document. As shown in FIG. 14, each concept is expressed by an information unit called “Content Item” denoted by reference numeral 1001 and a “Relationship” denoted by reference numeral 1002, which defines a relationship between Content Items.
  • DICOM Structured documents in SR have Content ItemOOl in nodes
  • the Content Item 1001 is represented by a triple of a "value type” denoted by reference numeral 1101, a "concept name” denoted by reference numeral 1102, and a corresponding "value” denoted by reference numeral 1103. Is done.
  • value type 1101 six types of “CONTAINER”, “TEXT”, “CODE”, “NUM”, “PNAME”, and “SCOORD” are illustrated here.
  • rCONTAINERj is used not only as the title of the SR document (corresponding to the root node when expressed in the SR tree structure), but also as a container for grouping and hierarchizing Content Items. When giving meaning to a group, it is possible to write an appropriate name in the concept name 1102 and use it as a simple container without describing the name. “CONTAINER” is the only content item 1101 value type that does not have an actual “value (1103)”.
  • TXT has a character string as a value 1103, and always needs a “concept name (1102)” indicating what value the character string is.
  • concept name 1102 is not an arbitrary term but a standard term to which a force code is assigned in DICOM-SR.
  • CODE both the concept name 1102 and the value 1103 are expressed in standard terms.
  • NUM has a numeric concept name 1102 and a value 1103 with units.
  • PNAME is used to represent the concept name 1102 and its value 1103 relating to a personal name such as “doctor” and “nurse”.
  • SCOORD represents the coordinates in the image referenced by the report.
  • a Content Item is also defined to express the reference to the target image of the report, and to express the reference to the past report.
  • Types of Relationship (1002), which is another important component of the SR tree structure, are: “Contain”, “Has PropertiesJ, inferred From”, “By—Reference”. , “Selected FromJ,””Has Observation ContextJ,””Has Acquisition ContextJ,” and “Has Concept Modifie” (for example, see Figs. 14 and 16).
  • CONTAINER the parent node type is " CONTAINER ", which is used when there is no point in grouping child nodes.
  • “Has Properties” is used to further refine the contents of the parent node, and “inferred from” is used when the symptom, etc., on which the diagnosis was based is included as a child node.
  • “By-ReferenceJ,“ Selected From ”” is used to reference images.
  • “Has Observation ContextJ” “Has Acquisition Context” is used to describe attributes such as observation date and time and observer.
  • “Has Concept Modifie is used to add expressions that modify symptoms.
  • the SR report according to the present embodiment includes a Content Item 1001 as shown in FIG.
  • the content item type of the root node is “CONTAINER”, and the concept name 1102 describes the title of the SR report.
  • the template describes the subtree structure of the SR, and describes a fixed pattern for the partial structure of Content ItemOOlOl and Relationshipl002 for a part of the report.
  • the number “TID4006” represents the ID of the template
  • Reference numeral 402 shown in FIG. 6 indicates that the value 1103 of the content item 1001 of the root node must also select a neutral force of the CID 6014.
  • the CID is called a context group in the DIC OM-SR, and an ID is assigned to each predetermined value set.
  • the CID 6014 has values 1103 such as “Calcincation Cluster”, “Individuai CalcificationJ”, “Breast Composition J”, “Breast Geometry”.
  • there is a portion 1103 such as “reliability of finding” indicated by reference numeral 403 and a portion reading another template such as reference numerals 404 and 406. It is also possible to specify the same template ID as the caller, such as reference numeral 408.
  • the number of appearances and whether or not the item is a required item are described as indicated by reference numerals 405 and 407.
  • Reference numeral 405, “1, MC” means “only once” if it appears, “M” means “required”, and “C” means “if the condition is satisfied”. The conditions are described in words in the actual definition of the template, but are omitted in FIG.
  • DTID4006 template called from “Single Image Finding”, has “tumor description”, “calcification description”, “numerical expression”, “single image finding”, etc.
  • the templates for calling “Image Finding” include "Impression / RecommendationJ” and "Amplify FeatureJ”.
  • a context group is a group of standard terms that appear in a specific context.
  • Figure 10 shows an example of the context group “CID6132” for terms related to calcification.
  • the coding system for the first column force standard term described as “SRT” is This is an abbreviation, and various glossaries such as SNOMED, ICD10, and HI-BIRD are used.
  • the second column shows the version of the coding scheme used.
  • the third column shows the code value of the term.
  • the fourth column is intended to enhance readability, and describes the contents of the code for convenience.
  • the context group is used by specifying the context group ID in the template (SR template) described in the previous section. In the present embodiment, the template and the SR template indicate the same one.
  • FIG. 13 shows an example of a template description according to DICOM-SR.
  • DICOM-SR DICOM-SR
  • reference numeral 900 denotes a root node of the SR document tree, which represents a document title “Mammography Report”.
  • the child node has an ontent Item of IMAGE type 902 and a root node Item (905) with partial processing and Findings Summary information.
  • Reference numeral 902 has a reference to the image information for which the report is to be written, and describes the child node (reference numeral 904) with the relation name “Has Acq Context "(symbol 903).
  • FIG. 1 is a diagram cited for explaining a system environment in which the structure creation apparatus of the present invention is used.
  • the interpretation center 10 and the radiology department 11 It shows the configuration of the system that converts the input to the client to the DICOM-SR format and the environment for connecting to the network.
  • the system shown in Fig. 1 consists of an image interpretation center 10 equipped with an image interpretation client, a radiology department 11 equipped with an image interpretation client, a hospital 12 equipped with a DICOM image viewer, a structured Web server 13 (structured document creation device), an image interpretation client It is composed of 14 ant, 14 specialist certification body, and 16 health examination body.
  • the requesting doctor first transmits the DICOM image to be read to the reading client 14 of the reading doctor.
  • the transmission protocol various communication forms and protocols can be used depending on the system configuration in which DICOM communication is common in the diagnostic imaging system.
  • the image interpretation client 14 starts image interpretation work while viewing the DICOM image received via the image receiving means (not shown).
  • the doctor looks at the microphone while looking at the captured image displayed on the computer screen (DICOM image viewer ZDICOM—SR report viewer) or the film arranged on the shaft. Dictate.
  • DICOM image viewer ZDICOM—SR report viewer the captured image displayed on the computer screen
  • SR report viewer the captured image displayed on the computer screen
  • the input work of the doctor is not particularly burdened as much as the conventional interpretation work, but internally, the structure based on the input data that is not stored as it is as speech recognition text Conversion to DICOM-SR format to save power. There are a plurality of possible timings for starting the structural processing.
  • a presentation method a method of displaying when the structure is completely completed, and a method of dynamically displaying the state of partial structuring during the dictation can be considered.
  • a display method of the structured result for example, a display format shown in FIG. 7 can be considered.
  • the means for structuring a free sentence can be provided as an outsourcing technology to various medical institutions as a Web (World Wide Web) service.
  • a structured Web service is provided at a site somewhere on the network, such as a structured Web server 13.
  • a structured program can be used from the interpretation report system in the form of a function call.
  • FIG. 2 shows a DICOM-SR report sentence structured sequence in this case. That is, the examination part information and the image are respectively input from the image interpretation client 14 by voice input (S201) and from the PACS (Picture Archiving ana Communication system) or IS (Radiology Information System) of the hospital 12 to the examination site information and the image.
  • the structured Web server 13 narrows down template candidates (S203), and also narrows down existing structured report candidates accumulated in the past. Further, the structure is reduced by narrowing down the SR subtree candidates according to the finding input (S202) (S204, S205), and a DICOM-SR report, which is the final target, is created.
  • the narrowing down of the SR subtree candidates (S204) and the structuring process (S205) will be described later.
  • FIG. 3 shows the internal configuration of the structured document creation device mounted on the structured Web server 13. Note that the structured document creation device may be implemented in a computer of the interpretation center 10 or the radiology department 11.
  • the structured document creation device includes an arithmetic unit 1, an input device 2, a storage device 3, and an output device 4.
  • the arithmetic unit 1 is mainly composed of a CPU (Central Processing Unit), and includes a finding input capturing unit 101, an extended morphological analysis unit 102, a structured document conversion unit 103, a structured document output unit 104, and a dictionary. It includes an information update unit 105, an edit / input fetch unit 106, a search support unit 107, an item value prediction unit 108, and a structured document update unit 109 (see FIG. 3).
  • a CPU Central Processing Unit
  • the finding input capturing unit 101 converts speech (eg, a finding sentence relating to a disease or disease) input by a doctor into text by a built-in speech recognition engine, and supplies the text to the extended morphological analysis unit 102.
  • the extended morphological analysis unit 102 refers the converted text document to a word dictionary 33 that stores technical terms and categories to which the technical terms belong. Then, the morphological analysis is performed, and the morphological analysis is supplied to the structure conversion unit 103.
  • the structured document conversion unit 103 extracts the correlation between the word obtained by the morphological analysis and the category to which the word belongs, and converts the text document according to a standardized format standardized based on the correlation.
  • the document is converted into a document (interpretation report) and supplied to the structure output unit 104.
  • the structured document conversion unit 103 narrows down the templates stored and prepared in the template DB 34 by performing a correlation function operation.
  • the structured document output unit 104 accumulates the structured document obtained by the conversion in the structured document conversion unit 103 in the structured document DB (structured report DB) 312 and sets the output device 4 as the output device 4. Output to a display screen prepared in advance.
  • the edit input capturing unit 106 captures the input and supplies the input to the dictionary information updating unit 105.
  • the dictionary information updating unit 105 has a function of referring to the structured document stored in the structured document DB 312, and updating the word dictionary 33 (301 (see FIG. 4)) using the editing history.
  • the diagnostic knowledge DB 320 stores a diagnostic knowledge system created based on the structured documents stored in the structured document DB 312, and the item value prediction unit 108 reads out the diagnostic knowledge system from the diagnostic knowledge DB 320. It has the function of predicting the progress of symptoms from information and the elapsed time information that is input and captured from the outside, and applies it to the item fields of a standardized standard format (template).
  • the search support unit 107 has a function of searching for a structured document stored in the structured document DB 312 from an input character string accompanying a search request to which external force is also supplied, and searching and outputting the corresponding structured document.
  • the structured document updating unit 109 visually displays the structured document obtained by the conversion on a display screen, prompts an editing input by a doctor or the like from the input device 2 such as a keyboard, and when there is an editing input, Import the information and update the contents of the structured document DB312 where the structured documents are stored.
  • the word dictionary 33 (word dictionary 301), the structured document DB 312, the knowledge system DB (diagnostic knowledge DB 320), and the force represented as a part of the template DB 34 are actually:
  • the various files shown in Figs. 4 and 5 are constructed and used as a DB in the storage device 3. Also, as in the sentence boundary determination module 302, etc.
  • Various program modules that are called and function by U are also stored in the storage device 3 and provided.
  • an SR subtree bank 314 or as shown in FIG.
  • a category group term dictionary (remembering the category to which the technical terms belong), such as Caterlab and cardiac echo, is prepared, and the template DB (34) contains, as shown in Fig. 5 (b), SR templates for each field, such as mammography, catheter lab, and echocardiography, are available. Details will be described later.
  • the structured document creation device (arithmetic device 1) of the present invention (see FIG. 4) is configured to input text according to the character string pattern matching rules registered in the sentence boundary determination module 302 and the paragraph boundary determination module 308. Is divided into combinations of sentence sets.
  • the sentence segmentation process is performed step by step, and roughly divided into sentence sets such as “diagnosis part”, “finding part”, and “instruction part”, and then prepared by the noragraph boundary determination module 308 for each sentence set type.
  • the sentence segmentation process is recursively repeated according to the given pattern matching rules.
  • the sentence is assumed as the minimum unit, but the connecting particle ⁇ Ichiga '', such as ⁇ No obvious malignant findings are found, but please refer to other examination findings '' etc. This is not the case when it is better to divide a sentence into separate sentence sets in the middle of a sentence.
  • Each sentence set can be expressed by surrounding it with tags such as "ku paragraph> '" // paragraph> ".
  • tags such as "ku paragraph> '" // paragraph> ".
  • the type of paragraph may be more specific to the criticism.
  • the data is decomposed into a set of specific content description areas with a possible granularity.
  • the clue patterns required for the decomposition are recorded in the paragraph boundary determination module 308 and the sentence boundary determination module 302 allocated to the storage device.
  • the rules used in the sentence segmentation process are divided into those common to each user or examination type and those individually customized.
  • New data and rules that can be used to create structural reports according to the standard format can be synchronized by sharing the knowledge obtained at each site as necessary, and sharing them. It is possible to improve each other's reporting environment.
  • one sentence is assumed as input. However, since the content corresponding to one SR template may be described in multiple sentences, it may be applied to multiple sentences.
  • a word dictionary with semantic information such as “disease name” and “part” and an SR subtree bank 314 that is not limited to the word dictionary 33 for a context group are used as dictionaries.
  • the SR subtree bank 314 can be regarded as a word dictionary with semantic information having an SR subtree as semantic information.
  • Word power in word dictionary 301 "Many SMALL It corresponds to a character string that exceeds the word range, such as "Caldfication,” and has a power equivalent to its meaning. It is a corresponding SR subtree.
  • phrase analysis module 305 performs a phrase construction process on the word sequence among the word (character string) sequences output from the extended morphological analysis unit 102.
  • Phrases are independent words, which are words that typically have meanings such as verbs and nouns. The sequence is followed by a sequence of adjuncts such as case particles and auxiliary verbs that have no meaning by themselves. Also in English, units corresponding to Japanese clauses called noun clauses and verb clauses can be constructed.
  • the word sequences 601, 602, and 603 shown in FIG. 9 become a phrase 604 ("to the right breast")
  • the sign 605 becomes a phrase 606 ("accompanying") alone
  • the word sequences 607, 608, and 609 represent , Sentence 610 ("Tumor shadow")
  • symbols 611, 612, and 613 become sentence 614 ("Accept.”).
  • each clause can have semantic information such as “disease name”, “part”, “instruction”, etc., and “context group identification information” to which the word belongs, according to the semantic information of the word that is a constituent element.
  • Fig. 10 shows a context group related to calcification type terms.
  • the term “popcorn-like calcification” is assigned the code “F-01761” in version 1.1 of the code-riding system “SRT”.
  • the context group in Fig. 10 is assigned the ID "CID6132".
  • the context group of clause 604 is CID6022, which generally means an examination part.
  • Clause 610 is classified as “Mass”, which is a type of tumor, and clause 614, a verb phrase, is classified as indicating certainty.
  • the SR template ID that includes the phrase as an element is output as the phrase type.
  • the phrase type an element corresponding to “confidence” appears in TID4002, TID4005, TID4006, and the like, and thus has the above-mentioned three SR template HDs as the phrase type of reference numeral 614.
  • the SR subtree 615 has an ID of TID4009
  • SR templates having TID4009 as elements include TID4006 and TID4005.
  • the character string portion corresponding to the SR subtree has SR substructure tree information and needs to perform a phrase type determination process. There is no.
  • reference numeral 800 represents an input sentence, and the sentence boundary is in an ambiguous state.
  • Reference numeral 801 denotes a state in which the sentence range is determined by the above-described sentence division processing, and each sentence is represented by an area surrounded by a tag “ku sentence> ⁇ .
  • the range of each word in a state where the extended morphological analysis processing has been completed is represented as an area surrounded by a tag ⁇ morphological element ⁇ .
  • Reference numeral 803 indicates a state where the phrase construction processing has been completed. Is represented as an area surrounding the tag phrase.
  • Reference numeral 804 denotes a state in which the phrase type determination processing has been completed, and each phrase type is represented by “ ⁇ calcification”, “ ⁇ confidence level”. , “ ⁇ Number”, “ ⁇ pathology>”, etc.
  • An SR template has an ID and a subtree structure.
  • the subtree has the above-described context group ID and SR template ID (at least one of context group identification information and template identification information is added), and includes the above-described clause type information and SR subtree information.
  • phrase type determination process when a series of phrase types or SR subtrees at close distances have the same template ID as the phrase type, for example, in the example shown in FIG. If it has a template ID of "4005", a subtree is constructed according to the common template ID. Those clauses and SR subtrees do not have to be continuous.
  • the SR template itself to be compared with the input analysis result in the template matching type structure processing shown in S65 of FIG. 9 can also calculate the ID of the template having itself as an element at a glance .
  • the SR subtree newly constructed by the template matching type structuring process is similar to the phrase or SR subtree whose template ID candidates are the same, The elements are put together to form a subtree corresponding to the template.
  • those phrase groups are put together as a subtree.
  • the grouped clauses include the following contents: ⁇ description on size '', ⁇ description on tumor morphology and symptoms '', ⁇ description of number '', ⁇ description of interval until next test '', etc.
  • the SR subtree to be represented.
  • the output is changed by using the D ICOM image header, the examination type obtained through the hospital system such as RIS, HIS, and imaging part information. can do. For example, if the symmetric finding is found to be mammography, the SR template used for the content part will be narrowed down to template ID 4000-4100.
  • the document is output from the output device 4.
  • the output device 4 corresponds to a display screen, a printer, speed, a network interface card, and the like.
  • the structured document output unit 104 executes the structured report storage module 318. Thus, it is stored in the structure document DB (318).
  • the stored structure report is decomposed into a reusable subtree by the substructure extraction module 313 and registered in the SR subtree bank 314 as a pair with a corresponding substring.
  • these subtrees are synchronized and shared for highly available SR subtrees stored at each interpretation site, creating a structured report construction environment. It is possible to improve each other.
  • diagnostic knowledge DB320 By combining the diagnostic knowledge DB320 with the structured report, it is possible to provide an alert function for doctor input and omission of discrimination, suggestion of a test, etc., and educational human power support for trainees. For example, if the diagnostic knowledge DB320 has the knowledge that "ultrasonics are easy to distinguish cells from fibrous line types", if findings of "fibrous line types” or "cysts" are input, other What is the possibility!
  • reference numeral 1400 indicates the date and time (time axis) at which the image interpretation report was created.
  • Reference numeral 1401 indicates a site to be reported.
  • Reference numeral 1402 indicates calcification, tumor, and other findings (symptoms) at each site.
  • Each finding is further classified according to its severity. For example, in the case of calcification, calcification cases of high severity are listed from those related to obvious benign calcification such as calcification of skin and blood vessels, fiber type, circular calcification, and central calcification. .
  • Reference numerals 1404 to 1408 denote reports. By placing them at the corresponding time, site, and symptom location, it is possible to have a bird's-eye view of reports that have been performed in the past. It is convenient to create a new report or refer to an appropriate past report!
  • a registration module (SR block registration module 316) in the SR subtree bank 314 (see FIG. 4) is used.
  • SR block registration module 316 in the SR subtree bank 314
  • synonyms to the word dictionary 33 for the context group see Fig. 3.
  • the side force of the interpreting physician is a means for returning feedback on the presented structure result, and is the means for editing and inputting of input means 106.
  • the value of the “Individual Calcification” field in FIG. 7 is “Circular calcification” (reference numeral 501), but in fact, a content group on calcification cases (DICOM—SR, as shown in FIG. 10) If it is another element (defined as CID No. 61313), select another item (eg, “lime calcification”), or enter the value directly, Feedback to output results It comes out.
  • the phrase type determining module 304 is customized based on the structural modification report history record DB (311) recorded by the structural modification report editing module 317. For example, when the character string “S7” appears, the pattern match rules handled by the phrase type determination module 304 include the “cancer progress” pattern and the date (“Showa 7”) pattern. Suitable for both. If the radiologist is inputting a finding with the intention of “cancer progress”, no customization is performed if the phrase type determination module 304 originally determines the “cancer progress”.
  • the pattern matching rule is usually corrected by the input of the editing / inputting unit 106 of the structure editing report based on the feedback of the radiologist, and the corrected pattern is stored in the report correction history DB (311 ).
  • the correction log analysis module can partially customize the writing style of each radiologist by changing the application order of rules or changing the application weight of patterns.
  • DICOM —SR compatible interpretation report used in the input stage of the radiology interpretation report system will be required to be written as a standard specification in a pamphlet as an essential specification.
  • the invention can be expected to be used as an effective tool.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Software Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Measuring And Recording Apparatus For Diagnosis (AREA)
  • Document Processing Apparatus (AREA)

Description

明 細 書  Specification
構造化文書作成方法ならびに装置  Structured document creation method and apparatus
技術分野  Technical field
[0001] 本発明は、例えば、マンモグラフィ (乳房 X線撮影)等、医療分野における画像読影 レポート作成を支援するシステムに用いて好適な、構造化文書作成方法ならびに装 置に関する。  The present invention relates to a structured document creation method and apparatus suitable for use in a system that supports creation of an image interpretation report in the medical field, such as mammography (mammography).
背景技術  Background art
[0002] 画像読景レポートとは、 CT (Computer Tomography)や MRI (Magnetic Resonance Imaging)等の撮影写真を専門医が観察し、異常所見があれば、その形状、大きさ、 種類等の特徴や、診断結果、指示等を記述したものである。近年、これら読影レポ一 トの質の確保が課題となって 、る。  [0002] An image reading report refers to a photograph taken by a CT (Computer Tomography) or MRI (Magnetic Resonance Imaging), etc., which is observed by a specialist, and if there are any abnormal findings, characteristics such as the shape, size, type, etc. It describes the diagnosis results, instructions, etc. In recent years, securing the quality of these interpretation reports has become an issue.
従来の読影レポートシステムの問題点として、読影医師ごとに記述の仕方や詳細度 に差があるため、読影の質および信憑性が分かりにくい等の点があった。例えば、本 来診断すべき箇所についての記述がない場合、見落としなのか、見たうえでの異常 無しなのかが区別できない。また、現在遠隔読影サービスを実施している会社が既 に存在する力 どの読影医に読影の依頼がまわるかは、依頼者側からは見えないこ ともある。その際、お互い使用する用語や略語が統一されていることは、内容を正確 に伝える上でとても重要である。  One of the problems with the conventional interpretation report system was that the quality and authenticity of the interpretation were difficult to understand due to differences in the description method and the degree of detail among the interpretation doctors. For example, if there is no description of the part that should be diagnosed, it cannot be distinguished whether it is an oversight or no apparent abnormality. In addition, there are companies that currently offer remote image interpretation services. The requester may not be able to see which doctor's request for interpretation is available. In doing so, it is very important that the terms and abbreviations used are unified, in order to accurately convey the content.
[0003] 前記した背景から、医療情報システム間の情報交換のための標準化規格の策定が 進められてきており、現状では、 DICOM (Digital Imaging and Communication in Medicine)、および HL7 (Health Level 7)の普及が進んでいる。  [0003] From the background described above, the formulation of standardization standards for information exchange between medical information systems has been promoted. Currently, DICOM (Digital Imaging and Communication in Medicine) and HL7 (Health Level 7) Spread is spreading.
特に放射線部門において、 CT、 MRI等の装置で撮影される画像データの通信 · 保存に関する DICOM規格の普及が進んでおり、 PACS (Picture Archiving and Communication System)や DICOM対応モダリティ等の形で各社から対応製品が発 売され、毎年 IHEの主催するコネクタソンと呼ばれるイベントにおいて、各ベンダーマ シン間の接続チェックが行われて 、る。  Particularly in the radiation department, the DICOM standard for communication and storage of image data captured by devices such as CT and MRI is spreading, and each company supports it in the form of PACS (Picture Archiving and Communication System) and DICOM compatible modalities. When products are released, a connection check between vendor machines is performed at an event called Connectathon organized by IHE every year.
[0004] DICOM規格の中での読影レポートの記述に関する標準規格としては、 DICOM— SRの策定が進められている。 2004年 1月現在、 16の DICOM基本規格と 93の規格 補遺力もなる (例えば、非特許文献 1参照)。 [0004] In the DICOM standard, the standard for describing interpretation reports is DICOM— An SR is being formulated. As of January 2004, there are 16 DICOM basic standards and 93 standard supplements (for example, see Non-Patent Document 1).
SR (Structured Reporting)に関係するものは、マンモグラフィ CAD (Suppl.50)、胸 部 CAD (Supple.65)、カテーテルラボ(Supple.66)、循環器(Supple.71)、心臓エコー (Supple.72)、患者病歴 (Supple.75)、血管'脳室造影の数値表現 (Supple.76)、血管 超音波(Supple.77)、胎児'小児(Supple.78)、乳癌診断 (Supple.79)、 MRアンギオグ ラフィ · CTアンギオグラフィ(Supple.X)である。  Those related to SR (Structured Reporting) include mammography CAD (Suppl.50), chest CAD (Supple.65), catheter lab (Supple.66), circulatory organ (Supple.71), and cardiac echo (Supple.72). ), Patient history (Supple.75), Numerical representation of vascular 'ventricular imaging (Supple.76), Vascular ultrasound (Supple.77), Fetal' pediatric (Supple.78), Breast cancer diagnosis (Supple.79), MR angiography · CT angiography (Supple.X).
[0005] DICOM— SR等の標準構造形式で読影レポートを記述するには、入力文がどのテ ンプレート (構造ィ匕された定型フォーマット)に当てはまるかを決定し、該当テンプレ ートの空欄部分に入る値を決定しなくてはならない。そのため、これまでにも所見記 述内容をテンプレートイ匕し、入力の際にテンプレートの穴埋めを行う方法等が取られ てきた (例えば、特許文献 1、 2、 3参照)。 [0005] DICOM—To describe an image interpretation report in a standard structure format such as SR, determine which template (structured fixed format) the input sentence applies to, and fill in the blank part of the template. You have to determine the value that goes into. For this reason, a method has been adopted in which the content of the findings is templated and the template is filled in at the time of input (for example, see Patent Documents 1, 2, and 3).
非特許文献 1:「DICOM規格」、 [online]、 2004年 5月 5日検索、インターネットく U RL: http:/ 1 meaical.nema.org/Dicom/  Non-Patent Document 1: “DICOM Standard”, [online], search on May 5, 2004, Internet URL: http: / 1 meaical.nema.org/Dicom/
特許文献 1 :特開 2001— 125994号公報(段落 [0018]— [0034]、図 1) 特許文献 2:特開 2001— 126007号公報 (段落 [0010]、図 1)  Patent Document 1: Japanese Patent Application Laid-Open No. 2001-125994 (paragraphs [0018] to [0034], FIG. 1) Patent Document 2: Japanese Patent Application Laid-Open No. 2001-126007 (paragraph [0010], FIG. 1)
特許文献 3 :特開 2003— 288332号公報(段落 [0031]— [0038]、図 1、図 2) 発明の開示  Patent Document 3: Japanese Patent Application Laid-Open No. 2003-288332 (paragraphs [0031]-[0038], FIGS. 1 and 2) Disclosure of the Invention
[0006] し力しながら、前記した従来のテンプレート入力による読影入力システムでは、入力 者側で、あるいは診断部位毎に使用するテンプレートを選択し、入力項目を選択ある いは入力する必要があり、医師にとっては非常に使いにくいものであった。  [0006] However, in the conventional interpretation input system based on template input described above, it is necessary to select a template to be used on the input side or for each diagnostic site and select or input an input item. It was very difficult for doctors to use.
[0007] 本発明は、前記した事情に基づいてなされたものであり、医師の所見文入力を、診 療分野毎の構造化された定型フォーマットに、医師の負担が少ない形で、効率的に 変換して記録することのできる構造ィヒ文書作成方法ならびに装置を提供することを 課題とする。  [0007] The present invention has been made based on the above-mentioned circumstances, and enables a doctor's opinion sentence to be input efficiently into a structured fixed format for each medical field in a form that reduces the burden on the doctor. An object of the present invention is to provide a method and an apparatus for creating a structured document that can be converted and recorded.
[0008] 本発明の構造化文書作成方法ならびに装置は、入力テキストと、テンプレートおよ び過去に蓄積された構造化済みテンプレートとの適合度を、文字列情報、あるいは 言語解析して付与された言語情報を用いて計算し、効率的に変換先となるテンプレ ートを絞り込むことを特徴とする。このために、構造ィ匕に必要なカテゴリー情報を持つ 単語辞書を用いて相関関数演算を行うことで入力テキストをテンプレートに従う構造 化文書に変換する。 [0008] In the structured document creation method and apparatus of the present invention, the degree of conformity between an input text, a template, and a structured template stored in the past is provided by character string information or language analysis. A template that is calculated using linguistic information and efficiently converted to The feature is to narrow down the list. For this purpose, the input text is converted into a structured document according to the template by performing a correlation function operation using a word dictionary having category information necessary for the structure.
図面の簡単な説明  Brief Description of Drawings
[0009] [図 1]本発明の構造ィ匕文書作成装置が利用されるシステム環境を説明するために引 用した図である。  FIG. 1 is a drawing cited for explaining a system environment in which a structure creation apparatus of the present invention is used.
[図 2]本発明実施形態に係わるテンプレート絞込みのシークェンスを示す図である。  FIG. 2 is a diagram showing a template narrowing-down sequence according to the embodiment of the present invention.
[図 3]本発明実施形態に係わる構造ィ匕文書作成装置の内部構成ブロック図である。  FIG. 3 is a block diagram showing the internal configuration of the structure creation apparatus according to the embodiment of the present invention.
[図 4]本発明実施形態に係わるデータベースとプログラムモジュールの一例を示す図 である。  FIG. 4 is a diagram showing an example of a database and a program module according to the embodiment of the present invention.
[図 5] (a)は分野別コンテクストグループ用語辞書、(b)は分野別 SRテンプレート (テ ンプレート DB)の一例を示す図である。  [Fig. 5] (a) is a diagram showing an example of a field-specific context group term dictionary, and (b) is an example of a field-specific SR template (template DB).
[図 6]本発明実施形態に係わる SRテンプレートの例である。  FIG. 6 is an example of an SR template according to the embodiment of the present invention.
[図 7]本発明実施形態に係わる構造ィ匕レポート視覚化表示の一例を示す図である。  FIG. 7 is a diagram showing an example of a structure display report visualization display according to the embodiment of the present invention.
[図 8]本発明実施形態に係わるフローチャートである。  FIG. 8 is a flowchart according to the embodiment of the present invention.
[図 9]本発明実施形態に係わる構造ィ匕処理の動作概念図である。  FIG. 9 is a conceptual diagram illustrating an operation of a structure processing according to an embodiment of the present invention.
[図 10]DICOM—SRで定義されているコンテクストグループの例を示す図である。  FIG. 10 is a diagram showing an example of a context group defined in DICOM-SR.
[図 11]本発明実施形態に係わる入力文の処理途中結果の表現例を示す図である。  FIG. 11 is a diagram showing an expression example of an in-process result of an input sentence according to the embodiment of the present invention.
[図 12]本発明実施形態に係わる入力文の処理途中結果の表現例を示す図である。  FIG. 12 is a diagram showing an example of expressing an in-process result of an input sentence according to the embodiment of the present invention.
[図 13]本発明実施形態に係わる SR木構造の一例を示す図である。  FIG. 13 is a diagram showing an example of an SR tree structure according to the embodiment of the present invention.
[図 14]SR木構造の概念を示す図である。  FIG. 14 is a diagram showing the concept of an SR tree structure.
[図 15]DICOM— SRで定義されて!、る Content Itemの例を示す図である。  FIG. 15 is a diagram showing an example of a Content Item defined in DICOM-SR.
[図 16]DICOM—SRにおける Relationshipに関する制約の例を示す図である。  FIG. 16 is a diagram showing an example of a constraint on Relationship in DICOM-SR.
[図 17]本発明実施形態に係わる SR部分木バンクの 1エントリーを示す図である。  FIG. 17 is a diagram showing one entry of an SR subtree bank according to the embodiment of the present invention.
[図 18]本発明実施形態に係わる読影レポートの時系列配置の例を示す図である。 発明を実施するための最良の形態  FIG. 18 is a diagram showing an example of a time-series arrangement of interpretation reports according to the embodiment of the present invention. BEST MODE FOR CARRYING OUT THE INVENTION
[0010] 以下、本発明実施形態について図面を参照しながら詳細に説明する。まず、 DIC OM— SRにより定義される構造ィ匕文書について簡単に説明する。 DICOM— SRにより定義される構造ィ匕文書は、病名、診断、予定、形状、数値、大 きさ、日付等の内容に関するものだけではなぐ対象画像、過去レポートへのリファレ ンス情報を持つことができる。図 14に構造化文書の概念を示す。図 14に示されるよう に、符号 1001で示す「Content Item」と呼ばれる情報単位と、符号 1002で示す、 Content Item間の関係が定義された「Relationship」でそれぞれの概念が表現される。 DICOM— SRにおける構造化文書は、 Content ItemlOOlをノードに持ち、 Hereinafter, embodiments of the present invention will be described in detail with reference to the drawings. First, the structure defined by DIC OM-SR will be briefly described. DICOM—Structures defined by SRs may contain not only information on disease names, diagnoses, plans, shapes, numbers, sizes, dates, etc. but also target images and reference information to past reports. it can. Figure 14 shows the concept of a structured document. As shown in FIG. 14, each concept is expressed by an information unit called “Content Item” denoted by reference numeral 1001 and a “Relationship” denoted by reference numeral 1002, which defines a relationship between Content Items. DICOM—Structured documents in SR have Content ItemOOl in nodes,
Relationshipl002をアークとする木(ツリー)構造で表現される。  It is represented by a tree structure with Relationshipl002 as an arc.
[0011] Content ItemlOOlは、図 15に示されるように、符号 1101で示す"値タイプ"、符号 1102で示す"概念名 "、符号 1103で示す対応する"値"の 3個の組みで表 現される。値タイプ 1101は、ここでは、「CONTAINER」、 「TEXT」、 「CODE」、 「NUM」 、 「PNAME」、 「SCOORD」の 6種類が例示されている。  [0011] As shown in Fig. 15, the Content Item 1001 is represented by a triple of a "value type" denoted by reference numeral 1101, a "concept name" denoted by reference numeral 1102, and a corresponding "value" denoted by reference numeral 1103. Is done. As the value type 1101, six types of “CONTAINER”, “TEXT”, “CODE”, “NUM”, “PNAME”, and “SCOORD” are illustrated here.
rCONTAINERjは、 SRドキュメントのタイトル(SRの木構造で表現した場合の、ルー トノードに相当する)として用いられる他、 Content ItemlOOlをグループ化して、階層 化するための入れ物として使われる。グループに意味づけをする場合、適切な名前 を概念名 1102に書けばよぐ名前を記述しないで単なる入れ物として用いることも可 能である。「CONTAINER」は Content ItemlOOlの値タイプ 1101の中では唯一、実 際の「値(1103)」を持たな 、ものである。  rCONTAINERj is used not only as the title of the SR document (corresponding to the root node when expressed in the SR tree structure), but also as a container for grouping and hierarchizing Content Items. When giving meaning to a group, it is possible to write an appropriate name in the concept name 1102 and use it as a simple container without describing the name. “CONTAINER” is the only content item 1101 value type that does not have an actual “value (1103)”.
[0012] 「TEXT」は、値 1103として文字列を持ち、その文字列が何の値であるかを示す「概 念名(1102)」を必ず必要とする。ここで、概念名 1102として用いられるのは、任意 の用語ではなぐ DICOM-SRの中であら力じめコードが割り当てられた標準用語を 用いる。「CODE」の場合は、概念名 1102と値 1103の両方について標準用語で表 現する。「NUM」は、数値の概念名 1102および単位を伴った値 1103を持つ。  “TEXT” has a character string as a value 1103, and always needs a “concept name (1102)” indicating what value the character string is. Here, the term used as the concept name 1102 is not an arbitrary term but a standard term to which a force code is assigned in DICOM-SR. In the case of “CODE”, both the concept name 1102 and the value 1103 are expressed in standard terms. "NUM" has a numeric concept name 1102 and a value 1103 with units.
「PNAME」では、 "医師"、 "看護師"等の人名に関する概念名 1102とその値 1103 を表現するのに用いられる。「SCOORD」は、レポートが参照する画像中の座標を表 現する。この他にも、レポートが対象としている画像へのリファレンスであると力、過去 のレポートへのリファレンスを表現するための、 Content ItemlOOlも定義されている。  “PNAME” is used to represent the concept name 1102 and its value 1103 relating to a personal name such as “doctor” and “nurse”. “SCOORD” represents the coordinates in the image referenced by the report. In addition, a Content Item is also defined to express the reference to the target image of the report, and to express the reference to the past report.
[0013] SR木構造のもう一つの重要な構成要素である、 Relationship (1002)」の種類とし ては、 の中で 、「Contain」、「Has PropertiesJ、 inferred From」、「By— Reference 」、「Selected FromJ、「Has Observation ContextJ、「Has Acquisition ContextJ、「Has Concept Modifie の 8種類の関係が定義されている(例えば図 14、図 16参照)。 rContainJという関係は、親ノードタイプが「CONTAINER」であり、子ノードをグルー ビングしている以上の意味がない場合等に用いられる。「Has Properties は、親ノー ドの内容をさらに詳細化するような場合に用いられ、 inferred from」は、診断の根拠 となった症状等を子ノードとしてもつ場合に用いられる。「By- ReferenceJ、「Selected From」は画像へのリファレンスに用いられる。「Has Observation ContextJ「Has Acquisition Context」は観察日時、観察者等の属性を記述する場合に用いられる。「 Has Concept Modifie は、症状を修飾する表現等を追加する場合に用いられる。 [0013] Types of Relationship (1002), which is another important component of the SR tree structure, are: "Contain", "Has PropertiesJ, inferred From", "By—Reference". , "Selected FromJ,""Has Observation ContextJ,""Has Acquisition ContextJ," and "Has Concept Modifie" (for example, see Figs. 14 and 16). For the relationship rContainJ, the parent node type is " CONTAINER ", which is used when there is no point in grouping child nodes. “Has Properties” is used to further refine the contents of the parent node, and “inferred from” is used when the symptom, etc., on which the diagnosis was based is included as a child node. “By-ReferenceJ,“ Selected From ”” is used to reference images. “Has Observation ContextJ” “Has Acquisition Context” is used to describe attributes such as observation date and time and observer. "Has Concept Modifie is used to add expressions that modify symptoms.
[0014] 本実施形態での SRレポートは、図 14に示したような、 Content ItemlOOlを [0014] The SR report according to the present embodiment includes a Content Item 1001 as shown in FIG.
Relationshipl002で結びつけた木構造であれば全て可というわけではなぐ以下に 列挙するいくつかの制約をもつ。  Not all tree structures connected by Relationshipl002 are possible, and have some restrictions listed below.
(1)ルートノードの Content ItemlOOlのタイプは「CONTAINER」で、概念名 110 2は、 SRレポートのタイトルを記述する。  (1) The content item type of the root node is “CONTAINER”, and the concept name 1102 describes the title of the SR report.
(2)使用可能な値タイプ 1101は、以下のもののみとする。  (2) Only the following value types 1101 can be used.
TEXT,CODE,NUM,DATETIME,DATE,TIME,UIDREF,PNAME,SCOORD,TCOORD ,COMPOSITE,IMAGE,WAVEFORM,CONTAINER  TEXT, CODE, NUM, DATETIME, DATE, TIME, UIDREF, PNAME, SCOORD, TCOORD, COMPOSITE, IMAGE, WAVEFORM, CONTAINER
(3)値タイプ 1101ごとの使用可能な Relationshipl002は、図 16に表形式で示したも のに限る。  (3) The relationship l002 that can be used for each value type 1101 is limited to the one shown in table form in FIG.
(4) SR木構造において、先祖にあたる Content ItemlOOlへの参照関係は禁止とす る。  (4) In the SR tree structure, the reference relationship to the ancestor Content Item1001 is prohibited.
[0015] 前記した制約だけでは、診断内容の記述パターンを与えるほどの十分な制約は与 えられていない。マンモグラフィ、カテーテルラボ等、診療分野ごとに SRによる記述を 詳細に規定したものとして、「テンプレート」が存在する。  [0015] The above constraint alone does not provide a constraint that is sufficient to give a description pattern of the contents of diagnosis. “Template” exists as a detailed definition of the SR description for each medical field, such as mammography and catheter labs.
テンプレートとは、 SRの部分木構造を記述したものであり、レポートの一部分に対し 、 Content ItemlOOlと Relationshipl002の部分的な構造についての定型パターンを 記述したものである。テンプレートを模式的に表現すると、図 6のように表現することが できる。ここで、「TID4006」、という番号は、テンプレートの IDをあらわしており、マンモ グラフィ分野のテンプレートの「Single Image Finding] (符号 401)をあらわしている。 TID4006が規定する部分木のルートノードは、「CODE」タイプの Content Item 1001 が用いられている。 The template describes the subtree structure of the SR, and describes a fixed pattern for the partial structure of Content ItemOOlOl and Relationshipl002 for a part of the report. When the template is schematically represented, it can be represented as shown in FIG. Here, the number “TID4006” represents the ID of the template, Represents the template “Single Image Finding” (reference 401) in the field of graphics, and uses the “CODE” type Content Item 1001 as the root node of the subtree specified by TID4006.
[0016] 図 6に示す符号 402では、ルートノードの Content ItemlOOlの値 1103としては、 CID6014の中力も選ばなくてはいけないということが指示されている。 CIDとは、 DIC OM—SRの中ではコンテクストグループと呼ばれ、予め決められた値集合ごとに、 ID が付与されている。  [0016] Reference numeral 402 shown in FIG. 6 indicates that the value 1103 of the content item 1001 of the root node must also select a neutral force of the CID 6014. The CID is called a context group in the DIC OM-SR, and an ID is assigned to each predetermined value set.
CID6014には、「Calcincation Cluster」、「Individuai CalcificationJ「Breast CompositionJ「Breast Geometry」等の値 1103が存在する。その他、このテンプレート を構成する要素として、符号 403で示した、「所見の確信度」のような値 1103や、符 号 404、 406等のように他のテンプレートを読み込んでいる部分も存在する。符号 40 8のように、呼び出し元と同じテンプレート IDを指定することも可能である。また、各ノ ード(Content ItemlOOl)が存在するための条件として、符号 405、 407のように、出 現回数と必須項目か否かとを記述する。符号 405の「1, MC」は、出現するなら 1回 だけで、「M」は「必須」、「C」は「条件が満たせば」を意味する。条件については、テ ンプレートの実際の定義には、言葉で細力べ記述されているが、図 6では省略してい る。  The CID 6014 has values 1103 such as “Calcincation Cluster”, “Individuai CalcificationJ”, “Breast Composition J”, “Breast Geometry”. In addition, as a component of this template, there is a portion 1103 such as “reliability of finding” indicated by reference numeral 403 and a portion reading another template such as reference numerals 404 and 406. It is also possible to specify the same template ID as the caller, such as reference numeral 408. Also, as conditions for the existence of each node (Content Item 1001), the number of appearances and whether or not the item is a required item are described as indicated by reference numerals 405 and 407. Reference numeral 405, “1, MC” means “only once” if it appears, “M” means “required”, and “C” means “if the condition is satisfied”. The conditions are described in words in the actual definition of the template, but are omitted in FIG.
[0017] この例では、符号 402の値が「Calcification Cluster]の場合、 DTID4009番のテンプ レート 406を、 Has Properties関係の子ノードとして必ず持ち、また、 Inferred From関 係で、 DTID4006番のテンプレート 408を使うことが推奨されて!、る。  [0017] In this example, when the value of the code 402 is "Calcification Cluster", the template 406 of the DTID4009 is always possessed as a child node of the Has Properties relation, and the template 408 of the DTID4006 is used in the Inferred From relation. It is recommended to use!
DTID4006番のテンプレート、「Single Image Finding」から呼び出されるテンプレート には、「腫瘍に関する記述」、「石灰化に関する記述」、「数値表現」、「単一画像所見 」等が存在し、逆に「Single Image Finding」を呼び出すテンプレートとしては、「 Impression/RecommendationJ、 「し omposite FeatureJ等のアンプレ ~~トカ 7子土す 。  DTID4006 template, called from “Single Image Finding”, has “tumor description”, “calcification description”, “numerical expression”, “single image finding”, etc. The templates for calling "Image Finding" include "Impression / RecommendationJ" and "Amplify FeatureJ".
[0018] コンテクストグループは、特定の文脈で出現する標準用語群を一つの表にまとめた ものである。図 10に、石灰化に関する用語のコンテクストグループ「CID6132」の例を 示す。  [0018] A context group is a group of standard terms that appear in a specific context. Figure 10 shows an example of the context group “CID6132” for terms related to calcification.
ここでは、「SRT」と記載されている第 1カラム力 標準用語をコードするコード体系の 略称を表しており、 SNOMEDや ICD10、 HI-BIRD等の様々な用語集が用いられてい る。第 2カラムは、用いているコード体系のバージョンを表す。第 3カラムは、当該用語 のコード値を表す。第 4カラムは、可読性を高めるためのもので、当該コードが表す内 容について、便宜的に記述している。コンテクストグループは、前節で述べたテンプ レート(SRテンプレート)の中で、コンテクストグループ IDを指定することで用いられる 。なお、本実施形態では、テンプレートと SRテンプレートとは、同じものを指す。 Here, the coding system for the first column force standard term described as “SRT” is This is an abbreviation, and various glossaries such as SNOMED, ICD10, and HI-BIRD are used. The second column shows the version of the coding scheme used. The third column shows the code value of the term. The fourth column is intended to enhance readability, and describes the contents of the code for convenience. The context group is used by specifying the context group ID in the template (SR template) described in the previous section. In the present embodiment, the template and the SR template indicate the same one.
[0019] 図 13に DICOM—SRに従うテンプレート記述の一例が示されている。ここでは(図 1 1参照)、「Dens Breast been. Round Calcincations ケ) Noted In RT.A C Area, Adenoma Susp.Microcalcifications Seen In RT.A C Area. Malignancy 元全【こ否疋 できません。 Echoおよび MMGにて 6ヶ月後に Follow upお願いします。」という文を、 SR木構造状に表現した例が示されて ヽる。  FIG. 13 shows an example of a template description according to DICOM-SR. Here (see Fig. 11), “Dens Breast been. Round Calcincations ケ) Noted In RT.AC Area, Adenoma Susp. Microcalcifications Seen In RT.AC Area. Malignancy Please follow up after 6 months. ”Is shown in an SR tree structure.
[0020] 図 13において、符号 900は、 SR文書ツリーのルートノードであり、文書タイトル" Mammography Report"を表現している。子ノードとしては、符号 902の IMAGEタイプ のし ontent ItemlOOlと、 Processing and Findings Summary 情報を持つ部分不の ルートノード Item (符号 905)を持つ。符号 902は、レポートが書かれる対象である画 像情報へのリファレンスをもち、また、どのような状況で撮られた画像かを説明するた め、子ノード(符号 904)を、関係名 "Has Acq Context" (符号 903)として持つ。  In FIG. 13, reference numeral 900 denotes a root node of the SR document tree, which represents a document title “Mammography Report”. The child node has an ontent Item of IMAGE type 902 and a root node Item (905) with partial processing and Findings Summary information. Reference numeral 902 has a reference to the image information for which the report is to be written, and describes the child node (reference numeral 904) with the relation name “Has Acq Context "(symbol 903).
[0021] さらに、 "Processing and Findings Summary"は、 TID4002番に対応する、 "  [0021] Furthermore, "Processing and Findings Summary" corresponds to TID4002,
Assessment Category" (符号 907)、 "Recommendation Follow-Up" (符号 908)、 " Recommendation Follow-up Interval',(符号 909)と、 TID4004番に対応する、 "  Assessment Category "(symbol 907)," Recommendation Follow-Up "(symbol 908)," Recommendation Follow-up Interval ', (symbol 909), corresponding to TID4004,
Individual Impression Recommendation" (符号 911)とから構成される。符号 911は、 TID4005のテンプレートから構成される、 "Composite Feature" (符号 913、 920)と、 TID4006番のテンプレートから構成される、 "Single Image Finding" (符号 926)とから 構成され、それぞれが石灰化や腫瘍等の具体的な記述に対応して!/ヽる。  "Single Image Impression Recommendation" (symbol 911). The suffix 911 is composed of "Composite Feature" (symbols 913 and 920) composed of TID4005 template and "Single Image" composed of TID4006 template. Finding "(symbol 926), each corresponding to a specific description of calcification or tumor!
[0022] 前記した DICOM— SRで定義された構造化文書の定義を前提に、本発明実施形 態について以下に詳細に説明する。  An embodiment of the present invention will be described in detail below on the basis of the definition of the structured document defined in DICOM-SR.
図 1は、本発明の構造ィ匕文書作成装置が利用されるシステム環境を説明するため に引用した図である。具体的には、読影センター 10や、放射線科 11における読影ク ライアントへの入力を、 DICOM—SRフォーマットに変換するシステムの構成と、ネット ワークへの接続環境とを示したものである。なお、図 1のシステムは、読影クライアント を備える読影センター 10、読影クライアントを備える放射線科 11、 DICOM画像ビュ 一ァ等を備える病院 12、構造化 Webサーバ 13 (構造化文書作成装置)、読影クライ アント 14、専門医認定機関 15、保健審査機関 16等カゝら構成されている。 FIG. 1 is a diagram cited for explaining a system environment in which the structure creation apparatus of the present invention is used. To be more specific, the interpretation center 10 and the radiology department 11 It shows the configuration of the system that converts the input to the client to the DICOM-SR format and the environment for connecting to the network. The system shown in Fig. 1 consists of an image interpretation center 10 equipped with an image interpretation client, a radiology department 11 equipped with an image interpretation client, a hospital 12 equipped with a DICOM image viewer, a structured Web server 13 (structured document creation device), an image interpretation client It is composed of 14 ant, 14 specialist certification body, and 16 health examination body.
[0023] 通常、 X線等の画像撮影依頼を行う医者と、撮影画像を見て診断を行う読影医とは 別であることが多ぐ最近では図 1に示されるように、読影医のいる読影センター 10や 放射線科 11と、読影を依頼する医者のいる病院 12とがネットワークにより接続され、 遠隔画像診断を行うケースが想定できる。  [0023] Usually, there are many doctors who request imaging of X-rays or the like and interpretation doctors who make diagnosis by looking at the captured images. Recently, as shown in FIG. A case can be assumed in which an image interpretation center 10 and a radiology department 11 are connected to a hospital 12 where doctors requesting image interpretation are connected via a network, and remote image diagnosis is performed.
現実に、放射線科認定を一箇所に集めた読影センターにおいて、遠隔読影サービ スを行っている会社も存在する。その場合、まず依頼医が、読影対象の DICOM画 像を読影医の読影クライアント 14に送信する。送信プロトコルとしては、画像診断シス テムでは DICOM通信が一般的である力 システム構成に応じて様々な通信形態お よびプロトコルを利用できる。読影クライアント 14は、図示せぬ画像受信手段を介して 受信した DICOM画像を見ながら、読影作業を開始する。  In fact, there are companies that provide remote reading services at a reading center that collects radiology certifications in one place. In that case, the requesting doctor first transmits the DICOM image to be read to the reading client 14 of the reading doctor. As the transmission protocol, various communication forms and protocols can be used depending on the system configuration in which DICOM communication is common in the diagnostic imaging system. The image interpretation client 14 starts image interpretation work while viewing the DICOM image received via the image receiving means (not shown).
[0024] 読影作業は ITィ匕が進んでおり、各ベンダーからキーボード入力や音声入力インタ 一フェイスをもった読影レポートシステムが製品化されている。例えば、日立メディコ 社で作成している読影レポートシステム「Natural Report」では、所見入力手段として、 Ami Voiceを用いた音声入力システムが用いられて!/、る。  [0024] IT interpretation is progressing in interpretation work, and interpretation reports systems having keyboard input and voice input interfaces have been commercialized by various vendors. For example, in the “Natural Report” interpretation report system created by Hitachi Medical, a voice input system using Ami Voice is used as a finding input means!
ここでは、読影センター 10で読影レポートを作成する場合の音声入力による読影レ ポートの流れについて説明する。まず、医師は、コンピュータ画面(DICOM画像ビュ ーァ ZDICOM— SRレポートビューァ)上に表示された撮影画像あるいは、シャ一力 ステン上に並べられた撮影フィルムを見ながら、マイクに向力つて所見を口述する。 従来の読影レポートシステムであれば、口述したものが音声認識エンジンによってテ キストデータに変換され、テキストのまま保存される。構造化処理の流れとしては、医 師の入力作業はこれまでの読影作業と変わらず特に負担は増えないが、内部では、 音声認識されたテキストをそのまま保存するのではなぐ入力データに基づいて構造 化処理を行い、 DICOM— SRフォーマットに変換して力も保存することになる。 [0025] 構造ィ匕処理を開始するタイミングとして、複数の場合が考えられる。通常、文章を言 語解析して、書かれている情報を判定し、定型フォーマットに当てはめて構造ィ匕を行 う場合、なるべく文章全体を解析処理エンジンに渡せる方が解析精度の面から望ま しいが、構造ィ匕結果を医師に確認してもらうには、口述の途中から、構造化ステップ を開始することが望ましい。 Here, the flow of an interpretation report by voice input when an interpretation report is created by the interpretation center 10 will be described. First, the doctor looks at the microphone while looking at the captured image displayed on the computer screen (DICOM image viewer ZDICOM—SR report viewer) or the film arranged on the shaft. Dictate. With a conventional interpretation report system, what is dictated is converted into text data by a speech recognition engine and stored as text. As for the flow of the structured processing, the input work of the doctor is not particularly burdened as much as the conventional interpretation work, but internally, the structure based on the input data that is not stored as it is as speech recognition text Conversion to DICOM-SR format to save power. There are a plurality of possible timings for starting the structural processing. Normally, when a sentence is linguistically analyzed to determine the written information and applied to a fixed format for structural analysis, it is desirable from the viewpoint of analysis accuracy that the entire sentence can be passed to an analysis processing engine as much as possible. However, it is desirable to start the structuring step in the middle of the dictation so that the doctor can confirm the result of the structure.
そこで、精度と速度のトレードオフにより、(1)文章全体を言い終わるまで待つ、(2) 1文を言い終わるまで待つ、(3) 1文の入力を待たない、といった 3パターンを考える ことができる。入力文を、 DICOM-SR等であら力じめ定義された定型フォーマットに 変換する際、どのテンプレートが入力文のどの範囲をカバーするかを判定する必要 があるが、(3)の場合のように、口述が始まっている段階力 処理を開始するには、テ ンプレート候補を探索する処理を行うことにより、処理効率を上げることが可能と考え られる。  There are three trade-offs between accuracy and speed: (1) wait until the entire sentence is completed, (2) wait until one sentence is completed, and (3) do not wait for one sentence. When converting an input sentence into a fixed format that is pre-defined by DICOM-SR or the like, it is necessary to determine which template covers which range of the input sentence, as in (3). In addition, in order to start the step-by-step processing in which dictation has begun, it is thought that processing efficiency can be improved by performing processing to search for template candidates.
[0026] 構造ィ匕の結果について医師が意図したものかを確認するため、構造化の結果をわ 力りやすく提示する必要がある。提示方法としては、構造ィ匕が完全に終了した段階で 表示する方法と、口述途中に動的に部分的な構造化の様子を表示する方法とが考 えられる。構造化結果の表示方法としては、例えば、図 7に示す表示形式が考えられ る。  [0026] In order to confirm whether the result of the structure is intended by the doctor, it is necessary to present the result of the structure easily and clearly. As a presentation method, a method of displaying when the structure is completely completed, and a method of dynamically displaying the state of partial structuring during the dictation can be considered. As a display method of the structured result, for example, a display format shown in FIG. 7 can be considered.
図 7に示す例では、符号 504で示すフィールドに、文章力 抽出された、 X線照射 方向に関する情報が表示され、また、特にフォローに関する記述はなく(符号 505)、 症状の一つとして符号 506 (Composite Feature)が書かれており、その中身は、位置 が内上部、外上部 (符号 500)で、円形石灰化 (符号 501)を 3箇所 (符号 502)持ち、 診断名は腺腫 (符号 503)である等のことがわかる。  In the example shown in FIG. 7, information about the X-ray irradiation direction, which is extracted by writing power, is displayed in the field indicated by reference numeral 504, and there is no description about follow-up (reference numeral 505). (Composite Feature) is written, and its contents are the inner upper part and the outer upper part (reference number 500), three circular calcifications (reference number 501) (reference number 502), and the diagnosis is adenoma (reference number 503). ).
[0027] 構造化データが記憶装置に蓄積されれば、病変部位と症状のペアで検索する等、 高品質な読影レポート検索を行うことができる。また(図 1参照)、定型フォーマットに 従いレポートを記述することが一般的となれば、専門医認定機関 15において、専門 医の評価を行う際に、過去に作成された SRレポートを用いて評価を行うことや、保険 審査機関 16において医師の能力に応じた保険点数付けの差別ィ匕を行う際の評価へ の応用等の可能性も考えられる。 [0028] また、自由文の構造化手段を、 Web (World Wide Web)サービスとして、様々な医 療機関にアウトソーシング技術として提供することができる。図 1では、構造化 Webサ ーバ 13のように、ネットワーク上のどこかのサイトで構造ィ匕プログラムサービスが提供 される。この場合、読影レポートシステム側からは、関数呼び出しのような形で、構造 化プログラムが使用可能となる。 If the structured data is stored in the storage device, it is possible to perform a high-quality interpretation report search such as a search using a pair of a lesion site and a symptom. In addition, if it is common to write reports in a standard format (see Fig. 1), a specialist certification body 15 will use an SR report created in the past to evaluate specialists when evaluating specialists. It is also conceivable that the application may be applied to evaluation when discriminating insurance scores according to the physician's ability at the insurance reviewing agency 16. [0028] In addition, the means for structuring a free sentence can be provided as an outsourcing technology to various medical institutions as a Web (World Wide Web) service. In FIG. 1, a structured Web service is provided at a site somewhere on the network, such as a structured Web server 13. In this case, a structured program can be used from the interpretation report system in the form of a function call.
[0029] 図 2に、この場合における DICOM— SRレポート文構造化シーケンスが示されてい る。すなわち、読影クライアント 14から音声による所見入力(S201)が、また、病院 12 の PACS (Picture Archiving ana Communication system)、あるいは IS (Radiology Information System)から、それぞれ検査部位情報、画像が構造化 Webサーバ 13に 提供される。これを受けた、構造化 Webサーバ 13は、テンプレート候補の絞込みを 行うが(S203)、このとき併せて過去に蓄積された既存の構造化レポート候補の絞り 込みも行う。さらに、所見入力(S202)に従う SR部分木候補の絞込みによる構造ィ匕 処理を実行し(S204、 S205)、最終的な目標である DICOM— SRレポートの作成を 行う。前記した SR部分木候補の絞込み(S204)、および構造化処理 (S205)につい ては後記する。  FIG. 2 shows a DICOM-SR report sentence structured sequence in this case. That is, the examination part information and the image are respectively input from the image interpretation client 14 by voice input (S201) and from the PACS (Picture Archiving ana Communication system) or IS (Radiology Information System) of the hospital 12 to the examination site information and the image. Provided to In response to this, the structured Web server 13 narrows down template candidates (S203), and also narrows down existing structured report candidates accumulated in the past. Further, the structure is reduced by narrowing down the SR subtree candidates according to the finding input (S202) (S204, S205), and a DICOM-SR report, which is the final target, is created. The narrowing down of the SR subtree candidates (S204) and the structuring process (S205) will be described later.
[0030] 構造化 Webサーバ 13に実装される構造ィ匕文書作成装置の内部構成を図 3に示す 。なお、構造化文書作成装置は、読影センター 10もしくは放射線科 11のコンビユー タに実装されても良い。  FIG. 3 shows the internal configuration of the structured document creation device mounted on the structured Web server 13. Note that the structured document creation device may be implemented in a computer of the interpretation center 10 or the radiology department 11.
構造化文書作成装置は、演算装置 1、入力装置 2、記憶装置 3及び出力装置 4を 含んで構成される。演算装置 1は、 CPU (Central Processing Unit)から主として構成 され、所見入力取込み部 101と、拡張型形態素解析部 102と、構造化文書変換部 1 03と、構造ィ匕文書出力部 104と、辞書情報更新部 105と、編集入力取込み部 106と 、検索支援部 107と、項目値予測部 108と、構造化文書更新部 109とを含む(図 3参 照)。  The structured document creation device includes an arithmetic unit 1, an input device 2, a storage device 3, and an output device 4. The arithmetic unit 1 is mainly composed of a CPU (Central Processing Unit), and includes a finding input capturing unit 101, an extended morphological analysis unit 102, a structured document conversion unit 103, a structured document output unit 104, and a dictionary. It includes an information update unit 105, an edit / input fetch unit 106, a search support unit 107, an item value prediction unit 108, and a structured document update unit 109 (see FIG. 3).
[0031] 所見入力取込み部 101は、医師によって入力された音声 (例えば病気または疾患 に関する所見文)を、内蔵する音声認識エンジンによってテキストに変換して拡張型 形態素解析部 102へ供給する。拡張型形態素解析部 102は、変換されたテキスト文 書を、専門用語と当該専門用語が属するカテゴリーとを記憶した単語辞書 33を参照 して形態素解析を行い、構造ィ匕文書変換部 103へ供給する。 The finding input capturing unit 101 converts speech (eg, a finding sentence relating to a disease or disease) input by a doctor into text by a built-in speech recognition engine, and supplies the text to the extended morphological analysis unit 102. The extended morphological analysis unit 102 refers the converted text document to a word dictionary 33 that stores technical terms and categories to which the technical terms belong. Then, the morphological analysis is performed, and the morphological analysis is supplied to the structure conversion unit 103.
[0032] 構造化文書変換部 103は、前記形態素解析して得られる単語と当該単語が属する カテゴリーの相関関係を抽出し、テキスト文書を相関関係に基づいて規格化された 定型フォーマットに従う構造ィ匕文書 (読影レポート)に変換して構造ィ匕文書出力部 10 4へ供給する。構造化文書変換部 103は、構造化文書に変換する際、相関関数演 算を行うことでテンプレート DB34に格納され用意されるテンプレートの絞込みを行う 。また、構造化文書出力部 104は、構造ィ匕文書変換部 103で変換して得られる構造 化文書を構造ィ匕文書 DB (構造ィ匕レポート DB) 312に蓄積すると共に、出力装置 4と して用意される表示画面等に出力する。  [0032] The structured document conversion unit 103 extracts the correlation between the word obtained by the morphological analysis and the category to which the word belongs, and converts the text document according to a standardized format standardized based on the correlation. The document is converted into a document (interpretation report) and supplied to the structure output unit 104. When converting into a structured document, the structured document conversion unit 103 narrows down the templates stored and prepared in the template DB 34 by performing a correlation function operation. The structured document output unit 104 accumulates the structured document obtained by the conversion in the structured document conversion unit 103 in the structured document DB (structured report DB) 312 and sets the output device 4 as the output device 4. Output to a display screen prepared in advance.
[0033] 一方、編集入力取込み部 106は、表示画面に出力された構造ィ匕文書を視認した医 師が編集のための入力を試みた場合にその入力を取り込み、辞書情報更新部 105 へ供給する機能を持つ。辞書情報更新部 105は、構造化文書 DB312に蓄積された 構造化文書を参照し、その編集履歴を用いて単語辞書 33 (301〔図 4参照〕 )を更新 する機能を持つ。  On the other hand, when a doctor who visually recognizes the structured document output on the display screen attempts input for editing, the edit input capturing unit 106 captures the input and supplies the input to the dictionary information updating unit 105. With the ability to The dictionary information updating unit 105 has a function of referring to the structured document stored in the structured document DB 312, and updating the word dictionary 33 (301 (see FIG. 4)) using the editing history.
[0034] また、診断知識 DB320には、構造化文書 DB312に蓄積された構造化文書に基づ き作成される診断知識体系が格納され、項目値予測部 108には、診断知識 DB320 から読み出される情報と、外部から入力され取り込まれる経過時間情報から症状の進 行を予測し、規格化された定型フォーマット (テンプレート)の項目欄に当てはめる機 能を持つ。検索支援部 107は、外部力も供給される検索要求に付随する入力文字 列から、構造化文書 DB312に蓄積された構造化文書を検索して該当する構造化文 書を検索出力する機能を持つ。また、構造化文書更新部 109は、変換して得られる 構造化文書を表示画面に視覚表示してキーボード等の入力装置 2からの医師等の 編集入力を促し、編集入力があつたときにその情報を取り込み、構造化文書が蓄積 された構造化文書 DB312の内容を更新する。  The diagnostic knowledge DB 320 stores a diagnostic knowledge system created based on the structured documents stored in the structured document DB 312, and the item value prediction unit 108 reads out the diagnostic knowledge system from the diagnostic knowledge DB 320. It has the function of predicting the progress of symptoms from information and the elapsed time information that is input and captured from the outside, and applies it to the item fields of a standardized standard format (template). The search support unit 107 has a function of searching for a structured document stored in the structured document DB 312 from an input character string accompanying a search request to which external force is also supplied, and searching and outputting the corresponding structured document. Further, the structured document updating unit 109 visually displays the structured document obtained by the conversion on a display screen, prompts an editing input by a doctor or the like from the input device 2 such as a keyboard, and when there is an editing input, Import the information and update the contents of the structured document DB312 where the structured documents are stored.
[0035] なお、ここでは、各種 DBとして、単語辞書 33 (単語辞書 301)、構造化文書 DB31 2、知識体系 DB (診断知識 DB320)、テンプレート DB34の一部のみ代表して示し た力 実際は、図 4、図 5に示される各種ファイルが記憶装置 3に DBとして構築され、 利用される。また、文境界判定モジュール 302等のように、演算装置 1を構成する CP Uに呼び出されて機能する各種プログラムモジュールも記憶装置 3に記憶されて用 意される。 [0035] Here, as various DBs, the word dictionary 33 (word dictionary 301), the structured document DB 312, the knowledge system DB (diagnostic knowledge DB 320), and the force represented as a part of the template DB 34 are actually: The various files shown in Figs. 4 and 5 are constructed and used as a DB in the storage device 3. Also, as in the sentence boundary determination module 302, etc. Various program modules that are called and function by U are also stored in the storage device 3 and provided.
[0036] なお、図 3の単語辞書 33については、図 4に示される様に汎用の単語辞書 301の 他に SR部分木バンク 314、あるいは、図 5 (a)に示されるように、マンモグラフィ、カテ 一テルラボ、心臓エコー等、分野別コンテクストグループ用語辞書 (専門用語が属す るカテゴリーを記憶)が用意され、また、テンプレート DB (34)には、図 5 (b)に示され るように、マンモグラフィ、カテーテルラボ、心臓エコー等、分野別 SRテンプレートが 用意されている。詳細は後記する。  As for the word dictionary 33 in FIG. 3, in addition to a general-purpose word dictionary 301 as shown in FIG. 4, an SR subtree bank 314, or as shown in FIG. A category group term dictionary (remembering the category to which the technical terms belong), such as Caterlab and cardiac echo, is prepared, and the template DB (34) contains, as shown in Fig. 5 (b), SR templates for each field, such as mammography, catheter lab, and echocardiography, are available. Details will be described later.
[0037] 以下、自由文(自由口述文)として入力される読影レポートを、分野ごとに決められ たある一定の構造に変換する場合の処理について、図 8に示すフローチャート、なら びに関連する図面(図 3や図 4等)を参照しながら説明する。  Hereinafter, a process for converting an interpretation report input as a free sentence (free dictation) into a certain structure determined for each field will be described with reference to a flowchart shown in FIG. This will be described with reference to FIGS. 3 and 4).
[0038] (S61 :文書分割処理(S61— S64 (第 1のステップ))  [0038] (S61: Document division processing (S61—S64 (first step))
本発明の構造化文書作成装置 (演算装置 1)は、まず (図 4参照)、文境界判定モジ ユール 302やパラグラフ境界判定モジュール 308に登録されて!、る文字列パターン マッチルールに従い、入力文章を、文集合の組合せに分割する。  First, the structured document creation device (arithmetic device 1) of the present invention (see FIG. 4) is configured to input text according to the character string pattern matching rules registered in the sentence boundary determination module 302 and the paragraph boundary determination module 308. Is divided into combinations of sentence sets.
文章分割処理は段階的に行い、例えば「診断部分」、「所見部分」、「指示部分」等 の文集合に大雑把に分割したあと、ノラグラフ境界判定モジュール 308において、文 集合の種類ごとに用意されているパターンマッチルールに従い、文章分割処理を再 帰的に繰り返す。  The sentence segmentation process is performed step by step, and roughly divided into sentence sets such as “diagnosis part”, “finding part”, and “instruction part”, and then prepared by the noragraph boundary determination module 308 for each sentence set type. The sentence segmentation process is recursively repeated according to the given pattern matching rules.
[0039] 例えば、「所見部分」であれば、より細力 、文集合として、「画像クオリティ」、「撮影 方向」、「撮影部位」、「石灰化」、「胸部構成」等の種類が考えられる。分割ルールは 、例えば、入力文章中に、「診断:」、「IMPRESSION」、「CT SCAN OF ABDOMENJ 等の文字列パターンが見つかれば、その前後で文集合を分離することができる。 文集合「診断」の範囲としては、「診断:」という文字列が出現し、次の別の手がかり パターンが出現するか文章の終わりまでとなる。パラグラフ境界判定モジュール 308 による文章分割処理は、再起ステップの何れかの段階により、適用ルールが存在せ ず、分割をそれ以上進めることが不可能な場合が存在する。その場合は、文境界判 定モジュール 302により文単位の分割処理を行い、文境界判定モジュール 302によ る判定処理が失敗すれば、当該領域の分割処理は終了する。 [0039] For example, in the case of the "finding portion", the types of "image quality", "imaging direction", "imaging site", "calcification", "chest structure", etc. are considered as sentence sets. Can be For example, if a character string pattern such as "diagnosis:", "IMPRESSION", or "CT SCAN OF ABDOMENJ" is found in an input sentence, a sentence set can be separated before and after the input rule. The character string "Diagnosis:" appears, and the next other clue pattern appears or until the end of the sentence. In the sentence division processing by the paragraph boundary determination module 308, there is a case where the application rule does not exist and the division cannot be further advanced due to any stage of the restart step. In this case, the sentence boundary determination module 302 performs the sentence division processing, and the sentence boundary determination module 302 If the determination process fails, the process of dividing the area ends.
前記した文章分割処理では、最小単位として文を仮定しているが、「明らかな悪性 所見は認めませんが、他の検査所見を参考にしてください」等のように、接続助詞「 一が」等により、文の途中で、別々の文集合に分けた方がいい場合はその限りではな い。  In the above sentence segmentation process, the sentence is assumed as the minimum unit, but the connecting particle `` Ichiga '', such as `` No obvious malignant findings are found, but please refer to other examination findings '' etc. This is not the case when it is better to divide a sentence into separate sentence sets in the middle of a sentence.
[0040] 各文集合は、「く paragraph〉' "〈/paragraph〉」等のタグで囲んでおくことにより表現で きる。ノラグラフ範囲を決定するのに用いた手がかりによっては、パラグラフの種類を 決疋でさ こと力ある。その 合は、「、paragraph type= conclusion〉」、「く paragraph type=〃procedure〃〉」等、 XMLの属性情報形式を利用して、パラグラフの種類を表現 で  [0040] Each sentence set can be expressed by surrounding it with tags such as "ku paragraph> '" // paragraph> ". Depending on the clues used to determine the noragraph range, the type of paragraph may be more specific to the criticism. In this case, the type of paragraph is expressed using the attribute information format of XML, such as “, paragraph type = conclusion>” and “ku paragraph type = 〃procedure〃〉”.
きる。「く paragraphノく sentence〉' ".〈/sentenceXsentence〉' " / sentence〉 /paragraph 〉」のように、入れ子になったタグで該当領域を囲むことで、階層的な分割過程を表現 することができる。  Wear. It is possible to express the hierarchical division process by enclosing the relevant area with nested tags, such as "ku paragraph no sentence> '". </ SentenceXsentence>' "/ sentence> / paragraph>". it can.
以上のような分解ステップを可能な限り繰り返すことで、可能な粒度で、特定内容記 述領域の集合に分解する。分解に必要な手がかりパターンは、記憶装置に割付けら れたパラグラフ境界判定モジュール 308、文境界判定モジュール 302に記録される。  By repeating the above decomposition steps as much as possible, the data is decomposed into a set of specific content description areas with a possible granularity. The clue patterns required for the decomposition are recorded in the paragraph boundary determination module 308 and the sentence boundary determination module 302 allocated to the storage device.
[0041] 文章分割処理で用いるルールは、図 1に記憶装置として示されるように、ユーザや 検査種別ごとに共通なものと、個別にカスタマイズされるものとに分けられる。標準フ ォーマットに従った構造ィ匕レポートを作成する上で利用可能な新たなデータおよびル ールに関しては、各サイトで得られた知見を必要に応じて同期を取り、共有すること で、構造ィ匕レポート構築環境を向上し合うことが可能である。  As shown in FIG. 1, the rules used in the sentence segmentation process are divided into those common to each user or examination type and those individually customized. New data and rules that can be used to create structural reports according to the standard format can be synchronized by sharing the knowledge obtained at each site as necessary, and sharing them. It is possible to improve each other's reporting environment.
[0042] (S62 :拡張型形態素解析処理)  (S62: Extended Morphological Analysis Processing)
入力としては、通常一文を仮定するが、一つの SRテンプレートに対応する内容が、 複数文で記述されることもあるため、複数文を対象とする場合もある。  Normally, one sentence is assumed as input. However, since the content corresponding to one SR template may be described in multiple sentences, it may be applied to multiple sentences.
[0043] 拡張型形態素解析では、「病名」、「部位」等の意味情報付き単語辞書および、コン テクストグループ用の単語辞書 33だけではなぐ SR部分木バンク 314を辞書として 用いる。 SR部分木バンク 314は、意味情報として SR部分木を持つ、意味情報付き 単語辞書とみなすことができる。単語辞書 301における単語力 「多数の SMALL Caldfication」等の単語範囲を超えた文字列に相当し、意味に相当するの力 対応す る SR部分木となる。単語辞書 301と SR部分木バンク 314を用いて、入力文章に対 する辞書引きと動的計画法を用いた最適パス探索 (松本裕治,北内啓,山下達雄, 平野善隆,松田寛,高岡一馬,浅原正幸,"日本語形態素解析システム『茶筌』 version 2.2.1使用説明書", Dec, 2000)により、入力文章を意味 (構造)情報付きの 単語 (文字列)シークェンス(図 9、符号 600)に分解することができる。 In the extended morphological analysis, a word dictionary with semantic information such as “disease name” and “part” and an SR subtree bank 314 that is not limited to the word dictionary 33 for a context group are used as dictionaries. The SR subtree bank 314 can be regarded as a word dictionary with semantic information having an SR subtree as semantic information. Word power in word dictionary 301 "Many SMALL It corresponds to a character string that exceeds the word range, such as "Caldfication," and has a power equivalent to its meaning. It is a corresponding SR subtree. Using the word dictionary 301 and the SR subtree bank 314, search for the optimal path using dictionary lookup and dynamic programming for input sentences (Yuji Matsumoto, Kei Kitauchi, Tatsuo Yamashita, Yoshitaka Hirano, Hiroshi Matsuda, Kazutaka Takaoka Ma, Masayuki Asahara, "Japanese morphological analysis system" ChaSen "version 2.2.1 Instruction Manual", Dec, 2000), a word (character string) sequence with meaning (structure) information (Fig. 9, code 600).
[0044] さらに読影レポートでは、表記ゆれも多いことから、形態素解析処理における辞書 引き処理を拡張する。完全一致を前提とする辞書引き処理だけでなぐ文字の欠損 や挿入に対しても辞書エントリー文字列と入力文字列の DPマッチングを行うことで、 一定範囲の表記ゆれを吸収して辞書引き処理を行うことができる。  [0044] Further, in the interpretation report, since there are many spelling variations, the dictionary lookup process in the morphological analysis process is extended. DP matching between the dictionary entry character string and the input character string is performed even for missing or inserted characters that cannot be obtained only by dictionary lookup processing that assumes perfect matching. It can be carried out.
例えば、辞書に「微細線状石灰化」という単語が登録されている場合、文章中に、 一文字欠けた「微線状石灰化」 ヽぅ文字列が出現した場合でも、正しく意味情報付 き単語 (文字列)シークェンスを得ることができる。同様の処理が、入力が「左側 MRI オーダー」で、辞書が「左側の MRIオーダー」という場合でも可能である。通常、形態 素解析に用いる辞書は Suffix Treeや Suffix Arrayのように、辞書引き速度を高速化す るために工夫したデータ構造を持つことが多いため、入力文字を用いた Suffix Treeの 遷移過程に、挿入、欠損、置き換えに対応する処理を加える方法が考えられる。  For example, if the word “fine linear calcification” is registered in the dictionary, “fine linear calcification” with one character missing in the sentence (String) You can get the sequence. A similar process is possible when the input is "left MRI order" and the dictionary is "left MRI order". Normally, dictionaries used for morphological analysis often have a data structure devised to speed up dictionary lookup, such as a Suffix Tree or Suffix Array, so the transition process of the Suffix Tree using input characters requires A method of adding processes corresponding to insertion, deletion, and replacement is conceivable.
[0045] この処理には、例えば「先頭文字の欠損、置き換えはないものとする」等の制約を 加えることで、対処できる表記ゆれを限定し、速度を確保することができる。また、 Suffix Treeの遷移過程において、置換不可能な文字列を定義することも有効である。 例えば、「多数」、「多量」の一文字置換は同一と見做せる可能性があるが、「多数」、 「少数」の一文字置換を同一と見做すことは重大な誤りを引き起こすと考えられる。以 上の処理は日本語に限ったものではなぐ所見文に多く見られる、英語と日本語が混 在している文や英文に対しても同様に行うことができる。  [0045] In this process, for example, by adding a constraint such as "there is no loss or replacement of the first character", it is possible to limit the spelling variation that can be dealt with and secure the speed. It is also effective to define a non-replaceable character string in the transition process of the Suffix Tree. For example, one-letter substitutions of "many" and "many" may be considered the same, but treating one-character replacements of "many" and "minority" as the same may cause a serious error. . The above processing can be applied to sentence where English and Japanese are mixed and English sentence which is often found in remarks not limited to Japanese.
[0046] (S63 :文節解析処理)  (S63: Clause analysis processing)
続いて、前記した拡張型形態素解析部 102の出力結果である単語 (文字列)シーク エンスのうち、単語シークェンスについて、文節解析モジュール 305により、文節構 成処理を行う。文節とは、一般的には動詞、名詞等の意味をもつ単語である自立語 シークェンスの後に、格助詞、助動詞等の単独では意味を持たない付属語シークェ ンスが続いた形をしている。英語に関しても、名詞節、動詞節と呼ばれる日本語の文 節に相当する単位を構成することができる。 Subsequently, the phrase analysis module 305 performs a phrase construction process on the word sequence among the word (character string) sequences output from the extended morphological analysis unit 102. Phrases are independent words, which are words that typically have meanings such as verbs and nouns. The sequence is followed by a sequence of adjuncts such as case particles and auxiliary verbs that have no meaning by themselves. Also in English, units corresponding to Japanese clauses called noun clauses and verb clauses can be constructed.
例えば、図 9に示す単語シークェンス 601、 602、 603は、文節 604 (「右乳房に」) となり、符合 605は、単独で文節 606 (「伴った」)となり、単語シークェンス 607、 608 、 609は、文節 610 (「腫瘍影を」)になり、符号 611、 612、 613は、文節 614 (「認め ます。」)となる。  For example, the word sequences 601, 602, and 603 shown in FIG. 9 become a phrase 604 ("to the right breast"), the sign 605 becomes a phrase 606 ("accompanying") alone, and the word sequences 607, 608, and 609 represent , Sentence 610 ("Tumor shadow"), and symbols 611, 612, and 613 become sentence 614 ("Accept.").
[0047] (S64 :文節種判定処理) [0047] (S64: phrase type determination processing)
次に、文節種判定モジュール 304によって、文節種判定処理を行う。それぞれの文 節は、構成要素である単語の持つ意味情報に従い、「病名」「部位」「指示」等の意味 情報や、単語が属する「コンテクストグループ識別情報」を持つことができる。マンモ グラフィ CAD分野にお!、て、 SRで定義されて!、るコンテクストグループの例として、 図 10に石灰化の種類用語に関するコンテクストグループを挙げる。  Next, the phrase type determination module 304 performs a phrase type determination process. Each clause can have semantic information such as “disease name”, “part”, “instruction”, etc., and “context group identification information” to which the word belongs, according to the semantic information of the word that is a constituent element. As an example of a context group defined in SR in the field of mammography CAD !, Fig. 10 shows a context group related to calcification type terms.
ここで、「ポップコーン状石灰化」という用語は、符号ィ匕体系「SRT」のバージョン 1. 1 において、「F-01761」というコードが振られているということがわかる。図 10のコンテク ストグループには、「CID6132」という IDが付与されている。図 9に示す例では、文節 6 04のコンテクストグループが CID6022で一般的には、検査部位を意味する。文節 61 0は、腫瘍の一種である「Mass」に分類され、動詞句である文節 614は、確信度を示 すものとして分類される。  Here, it can be seen that the term “popcorn-like calcification” is assigned the code “F-01761” in version 1.1 of the code-riding system “SRT”. The context group in Fig. 10 is assigned the ID "CID6132". In the example shown in FIG. 9, the context group of clause 604 is CID6022, which generally means an examination part. Clause 610 is classified as “Mass”, which is a type of tumor, and clause 614, a verb phrase, is classified as indicating certainty.
[0048] 文節種判定処理の第 2の方法として、別に定義した単語シークェンスパターンに従 い、当該文節に、「部位」「指示」等の意味情報を付与する。文節種判定処理の結果 は唯一である必要はなぐ複数保持することが可能である。文節種判定処理では、そ の他に、当該文節を要素として含む SRテンプレート IDを文節種として出力する。例 えば SRテンプレートにおいて、「確信度」に相当する要素は、 TID4002、 TID4005、 TID4006等に出現するため、符号 614の文節種として、前記した 3つの SRテンプレー HDを持つ。また、 SR部分木 615は、 TID4009という IDを持つが、 TID4009を要素と してもつ SRテンプレートには、 TID4006、 TID4005が存在する。文節種判定処理段階 では、複数候補を保持しておき、 SRテンプレートを用いた構造ィ匕処理において、適 当なものを選択する。 As a second method of the phrase type determination processing, semantic information such as “part” and “instruction” is added to the phrase according to a separately defined word sequence pattern. It is possible to hold multiple results of the phrase type determination process, without having to be unique. In the phrase type determination process, the SR template ID that includes the phrase as an element is output as the phrase type. For example, in the SR template, an element corresponding to “confidence” appears in TID4002, TID4005, TID4006, and the like, and thus has the above-mentioned three SR template HDs as the phrase type of reference numeral 614. Although the SR subtree 615 has an ID of TID4009, SR templates having TID4009 as elements include TID4006 and TID4005. In the phrase type determination processing stage, a plurality of candidates are held, and in the structure Choose the right one.
前記した拡張型形態素解析部 102による出力結果である単語 (文字列)シークェン スのうち、 SR部分木に対応する文字列部分については、 SR部分構造木情報を持ち 、文節種判定処理を行う必要はない。  Of the word (character string) sequence output by the extended morphological analysis unit 102, the character string portion corresponding to the SR subtree has SR substructure tree information and needs to perform a phrase type determination process. There is no.
[0049] 解析結果や途中の状態は、図 11、図 12に示す XML (extended Mark-up The analysis results and the status in the middle are shown in XML (extended Mark-up
Language)フォーマット等で記述しておくことができる。  Language) format or the like.
図 11、図 12中、符号 800は、入力文を表し、文境界も曖昧な状態である。符号 80 1は、前記した文章分割処理により、文範囲が定まった状態を表しており、各文は、タ グ "く文〉〃で囲まれた領域で表現される。符号 802は、前記した拡張型形態素解析 処理が終了した状態である、各単語の範囲が、タグ〃く形態素〉〃の囲む領域として 表現されている。符号 803は、文節構成処理が終了した状態を表しており、文節が、 タグ〃く節〉〃の囲む領域として表現されている。符号 804は、文節種判定処理が終 了した状態を表しており、各文節種が、 "<石灰化 '、 "<確信度 '、 "<個数 ' 、 " <病理 > "等のタグで囲まれた領域として表現されて 、る。  In FIGS. 11 and 12, reference numeral 800 represents an input sentence, and the sentence boundary is in an ambiguous state. Reference numeral 801 denotes a state in which the sentence range is determined by the above-described sentence division processing, and each sentence is represented by an area surrounded by a tag “ku sentence> 〃. The range of each word in a state where the extended morphological analysis processing has been completed is represented as an area surrounded by a tag {morphological element}. Reference numeral 803 indicates a state where the phrase construction processing has been completed. Is represented as an area surrounding the tag phrase.} Reference numeral 804 denotes a state in which the phrase type determination processing has been completed, and each phrase type is represented by “<calcification”, “<confidence level”. , “<Number”, “<pathology>”, etc.
[0050] (第 2のステップ (S65):テンプレート照合による構造ィ匕処理) [0050] (Second step (S65): Structure matching processing by template matching)
続いて、構造化文書作成部 103によりテンプレートに基づく構造化処理が実行され る。 SRテンプレートは IDを持ち、部分木構造を持つ。部分木の構成要素としては、 前記したコンテクストグループ IDや SRテンプレート IDを持ち(コンテクストグループ識 別情報及びテンプレート識別情報のうちの少なくとも一方が付与され)、前記した文 節種情報や SR部分木情報と照合することで、適用可能な SRテンプレートおよび SR テンプレート中の対応箇所を特定することができる。  Subsequently, the structured document creation unit 103 executes a structuring process based on the template. An SR template has an ID and a subtree structure. The subtree has the above-described context group ID and SR template ID (at least one of context group identification information and template identification information is added), and includes the above-described clause type information and SR subtree information. By matching with, the applicable SR template and the corresponding part in the SR template can be specified.
S64で行われる文節種判定処理により、近い距離にある一連の文節種あるいは SR 部分木が、文節種として同じテンプレート IDを持つ場合、例えば図 9に示す例では、 符号 616と符号 617では、共通のテンプレート IDである" 4005"をもつような場合、共 通テンプレート IDに従った部分木を構成する。それらの文節および SR部分木群は、 必ずしも連続する必要はな ヽ。図 9の S65で示すテンプレート照合型構造ィ匕処理で 入力解析結果との照合対象となる SRテンプレート自身も、自身を要素として持つテ ンプレートの IDをあら力じめ計算しておくことができる。文節種判定処理結果出力の みならず、テンプレート照合型構造化処理により新たに構成された SR部分木も同様 に、テンプレート ID候補が同じである文節あるいは SR部分木力 連続するか至近距 離に並んでいる場合、それらの要素をまとめて、当該テンプレートに対応する部分木 を構成する。同一文節種候補が連続して並ぶ場合、それらの文節群を部分木として まとめる。まとめられた文節群は、各テンプレートの意味に従い、「サイズに関する記 述」、「腫瘍の形態、症状に関する記述」、「個数の記述」、「次回検査までのインター バルの記述」等の内容を表する SR部分木となる。 According to the phrase type determination process performed in S64, when a series of phrase types or SR subtrees at close distances have the same template ID as the phrase type, for example, in the example shown in FIG. If it has a template ID of "4005", a subtree is constructed according to the common template ID. Those clauses and SR subtrees do not have to be continuous. The SR template itself to be compared with the input analysis result in the template matching type structure processing shown in S65 of FIG. 9 can also calculate the ID of the template having itself as an element at a glance . Of the phrase type judgment processing result output Similarly, if the SR subtree newly constructed by the template matching type structuring process is similar to the phrase or SR subtree whose template ID candidates are the same, The elements are put together to form a subtree corresponding to the template. When the same phrase type candidates are arranged consecutively, those phrase groups are put together as a subtree. According to the meaning of each template, the grouped clauses include the following contents: `` description on size '', `` description on tumor morphology and symptoms '', `` description of number '', `` description of interval until next test '', etc. The SR subtree to be represented.
文節種判定処理において、自身を要素として含むテンプレート IDを計算する際、 D ICOM画像ヘッダや、 RIS、 HIS等の病院システムを介して得られる検査種別、撮影 部位情報を用いることで、出力を変更することができる。例えば、対称所見文がマン モグラフィであることが分かれば、内容に関する部分使用される SRテンプレートは、 テンプレート ID4000— 4100番に絞られる。  When calculating the template ID that includes itself as an element in the phrase type determination processing, the output is changed by using the D ICOM image header, the examination type obtained through the hospital system such as RIS, HIS, and imaging part information. can do. For example, if the symmetric finding is found to be mammography, the SR template used for the content part will be narrowed down to template ID 4000-4100.
[0051] (第 3のステップ (S66) :構造ィ匕文書の出力)  [0051] (Third Step (S66): Output of Structured Document)
構造ィ匕文書を出力装置 4から出力する。出力装置 4は、表示画面、プリンタ、スピー 力、ネットワークインタフェイスカード等が該当する。  The document is output from the output device 4. The output device 4 corresponds to a display screen, a printer, speed, a network interface card, and the like.
[0052] (第 4のステップ (S67):構造ィ匕レポートの蓄積)  [0052] (Fourth Step (S67): Accumulation of Structured Report)
作成された構造化レポート (構造化文書)は、編集入力取込み部 106により医師の フィードバックを経て正しいことを確認した後、構造化文書出力部 104が、構造化レ ポート蓄積モジュール 318を実行することにより、構造ィ匕文書 DB (318)に蓄積され る。  After confirming that the created structured report (structured document) is correct via the doctor's feedback by the editing and inputting unit 106, the structured document output unit 104 executes the structured report storage module 318. Thus, it is stored in the structure document DB (318).
蓄積された構造ィ匕レポートは、部分構造抽出モジュール 313により、再利用可能な 部分木に分解され、対応する部分文字列とペアで SR部分木バンク 314に登録され る。これらの部分木は、図 1に共有データとして示すように、各読影サイトで蓄積され た SR部分木で利用性の高いものについて同期を取り、共有することで、構造化レポ ート構築環境を向上し合うことが可能である。  The stored structure report is decomposed into a reusable subtree by the substructure extraction module 313 and registered in the SR subtree bank 314 as a pair with a corresponding substring. As shown in Figure 1 as shared data, these subtrees are synchronized and shared for highly available SR subtrees stored at each interpretation site, creating a structured report construction environment. It is possible to improve each other.
[0053] 次に、診断知識と時系列情報の利用について説明する。診断知識 DB320と構造 化レポートを組み合わせることで、医師の入力 ·鑑別漏れ、検査示唆等のアラート機 能や、研修医の教育的人力支援を行うことができる。 例えば、診断知識 DB320に、「線維線種とのぅ胞の鑑別は、超音波が容易」という 知識があれば、「繊維線種」や「嚢胞」の所見入力がなされた場合に、他の可能性は な!、か確認を求めることや、 SRテンプレートの Recommendation部分に「超音波検査 を進める記述を、デフォルト値として設定できる」という知識があった場合、「スピキユラ を伴う腫瘍→良性「手術後瘢痕 (postsurgical scar),放射状瘢痕 (radial scar),脂肪壊 死 (fat necrosis),硬化性腺症 (sclerosing adenosis),脳瘍」、悪性「硬癌、浸潤性小葉 癌。手術や外傷の既往がはっきりしていて、その部位が所見に一致している場合を 除いて、精密検査適用」「境界不明瞭→Breast Compositionの問題力、腫瘤の性状 かの区別が重要」、「スピキユラを伴う腫瘤に脂肪濃度をみた場合→脂肪を巻き込ん で浸潤する硬癌と放射性瘢痕 (radial scar)とは鑑別しましたか?」「のう胞内に増殖 性の部分があっても、その部分の X線吸収値と、液体成分のそれとを区別することは できない。」「腫瘤の形状と悪性の度合い。円形→多角形→分葉状→不整形」「平滑 であって浸潤である、という境界は矛盾した表現」「マンモグラムのみで良性石灰化と 診断できるもの→皮膚、血管の石灰化、線維線種や入管拡張症に伴う石灰化、円形 石灰化、中心透過性石灰化、石灰乳石灰化等」「びまん性、領域性の分布は良性が 多い。区域性、線状の分布をとるものは悪性を十分に考慮する必要がある」「鑑別が 必要なものと、明らかな良性とを区別することが大事」「脂肪を腫瘤の構成要素として 含む→脂肪腫、過誤腫、乳瘤、オイルシスト、リンパ節」「乳癌脂肪成分なし。周辺組 織に浸潤して脂肪を巻き込むことはある」といった入力支援が可能になる。これらは 項目値予測部 108により処理される。 Next, use of diagnostic knowledge and time-series information will be described. By combining the diagnostic knowledge DB320 with the structured report, it is possible to provide an alert function for doctor input and omission of discrimination, suggestion of a test, etc., and educational human power support for trainees. For example, if the diagnostic knowledge DB320 has the knowledge that "ultrasonics are easy to distinguish cells from fibrous line types", if findings of "fibrous line types" or "cysts" are input, other What is the possibility! If there is knowledge that “the description to proceed with the ultrasound examination can be set as the default value” in the recommendation part of the SR template, “tumor with spikyura → benign” “postsurgical scar” ), Radial scar, fat necrosis, sclerosing adenosis, cerebral ulcer, malignant `` hard carcinoma, invasive lobular carcinoma. Except for the case where the site is consistent with the findings, application of detailed examination, `` unclear boundaries → It is important to distinguish the problem of Breast Composition, and the nature of the mass '', `` When the fat concentration is observed in a mass with spikyura → Did you distinguish between hard cancer, which involves fat and infiltrate it, and radiological scar? "" Even if there is a proliferative part in the cyst, the X-ray absorption value of that part and that of the liquid component Cannot be distinguished "The shape of the mass and the degree of malignancy. Circular → polygon → lobulated → irregularly shaped." The boundary of smoothness and infiltration is inconsistent. "" Mammamogram alone can diagnose benign calcification. " Calcification of skin and blood vessels, calcification associated with fibrosis and invasive dilatation, circular calcification, central permeability calcification, lime milk calcification, etc. "" Diffuse and regional distribution is often benign. For those with a linear distribution, it is necessary to sufficiently consider malignancy. ”“ It is important to distinguish between those that require differentiation and those that are clearly benign. ”“ Including fat as a component of a mass → lipoma It is possible to provide input support such as "hamartoma, mammary mass, oil cysts, and lymph nodes" and "no breast cancer fat component. Fat may be involved by infiltrating surrounding tissues." These are processed by the item value prediction unit 108.
[0054] 部分木テンプレートを用いた構造ィ匕文書作成処理にぉ 、て、入力文で記述されな い内容については、「異常なし」あるいは、前回の読影レポートの対応する値を用いる ことで、デフォルト値を与え、医師の入力負担を減らすこともできる。逆に入力文に記 述されている内容と、検査値や過去の読影レポートの対応項目との矛盾点を指摘す ることち有効である。 In the process of creating a structured document using a partial tree template, for the content not described in the input sentence, “no abnormality” or the corresponding value of the previous interpretation report is used. A default value can be given to reduce the doctor's input burden. Conversely, it is effective to point out inconsistencies between the contents described in the input sentence and the corresponding values in the test values and past radiology reports.
[0055] 次に、時系列情報の表示について説明する。前記したように、レポートを構造化す ることにより、レポートを項目別に時系列配置することが可能である。  Next, display of time-series information will be described. As described above, by structuring the report, it is possible to arrange the report in a time series for each item.
具体的に、図 18において、符号 1400は読影レポート作成日時(時間軸)を示し、 符号 1401は、レポート対象の部位を示す。また、符号 1402は、各部位における、石 灰化、腫瘍、その他の所見 (症状)を表す。 Specifically, in FIG. 18, reference numeral 1400 indicates the date and time (time axis) at which the image interpretation report was created. Reference numeral 1401 indicates a site to be reported. Reference numeral 1402 indicates calcification, tumor, and other findings (symptoms) at each site.
各所見はさらに、重篤度の度合いに応じて分類する。例えば、石灰化であれば、皮 膚、血管の石灰化、繊維線種、円形石灰化、中心透過性石灰化等の明らかな良性 石灰化に関するものから、重篤度の高い石灰化症例を並べる。符号 1404— 1408 はレポートを表し、対応する時間、部位、症状の位置に配置することで、過去に行つ てきたレポートを俯瞰することができる。新たにレポートを作成する場合、適当な過去 レポートを参照する場合にも都合がよ!、。  Each finding is further classified according to its severity. For example, in the case of calcification, calcification cases of high severity are listed from those related to obvious benign calcification such as calcification of skin and blood vessels, fiber type, circular calcification, and central calcification. . Reference numerals 1404 to 1408 denote reports. By placing them at the corresponding time, site, and symptom location, it is possible to have a bird's-eye view of reports that have been performed in the past. It is convenient to create a new report or refer to an appropriate past report!
[0056] 読影医の側カゝら積極的に行う「構造化読影レポート作成支援システム」のカスタマィ ズ方法として、 SR部分木バンク 314 (図 4参照)への登録モジュール(SRブロック登 録モジュール 316)と、コンテクストグループ用の単語辞書 33 (図 3参照)への同義語 •新規用語登録モジュール(コンテクストグループモジュール 315)とがある。 As a method of customizing the “structured interpretation report creation support system” that is actively performed by the interpreting doctor, a registration module (SR block registration module 316) in the SR subtree bank 314 (see FIG. 4) is used. ) And synonyms to the word dictionary 33 for the context group (see Fig. 3). • There is a new term registration module (context group module 315).
前者のモジュールでは、読影医がよく利用する表現とその構造化結果を、 SR部分 木バンク 314に、 SRブロック登録モジュール 316を用いて登録する。これにより、入 力文字列への貼り付け処理において、読影医がよく利用する表現とその構造ィ匕結果 の再利用が可能である(第 5のステップ)。後者のモジュールでは、例えば石灰化の 種類に関するコンテクストグループの、 "ポップコーン石灰化〃と同義語であれば、 " し ode bcheme Designator 、 Version 、 "し oae Value 、 「SRT、 1.1、 F- 017ol」で あり、 code meaningの表現が異なるエントリーをコンテクストグループモジュール 315 により登録する。また、石灰化の種類ではある力 図 10におけるいずれのエントリー の同義語でもない場合は、「USER、 1.0、 F-01770」等のように、重複しない新しい ID で登録する。  In the former module, expressions frequently used by the radiologist and their structured results are registered in the SR subtree bank 314 using the SR block registration module 316. This makes it possible to reuse the expression frequently used by the radiologist and the result of the structure in the pasting process to the input character string (fifth step). In the latter module, for example, in the context group on the type of calcification, if it is a synonym for "popcorn calcification," it is "shi ode bcheme Designator, Version," "shi oae Value," "SRT, 1.1, F-017ol." Yes, entries with different code meaning expressions are registered by the context group module 315. If the type of calcification is not a synonym for any of the entries in Figure 10, register with a new ID that does not overlap, such as "USER, 1.0, F-01770".
[0057] 提示された構造ィ匕結果に対して、読影医の側力もフィードバックを返す手段力 編 集入力取込み部 106である。例えば、図 7の" Individual Calcification"フィールドの値 力 "円形石灰化"となっているが (符号 501)、実際には石灰化症例に関するコンテ タストグループ(DICOM— SRでは、図 10に示すように CID6132番として定義されて いる)中の別の要素であった場合、別の項目(例えば、 "石灰乳石灰化")を選択する 、あるいは直接値を入力する等して、利用者の側で出力結果にフィードバックするこ とがでさる。 [0057] The side force of the interpreting physician is a means for returning feedback on the presented structure result, and is the means for editing and inputting of input means 106. For example, the value of the “Individual Calcification” field in FIG. 7 is “Circular calcification” (reference numeral 501), but in fact, a content group on calcification cases (DICOM—SR, as shown in FIG. 10) If it is another element (defined as CID No. 61313), select another item (eg, “lime calcification”), or enter the value directly, Feedback to output results It comes out.
[0058] 読影医の作業を利用した自動カスタマイズの方法に、構造化レポートの修正履歴を 用いる方法がある。構造ィ匕レポート編集モジュール 317により記録された構造ィ匕レポ ート修正履歴 DB (311)を基に、文節種判定モジュール 304のカスタマイズを行う。 例えば、「S7」という文字列が出現した際、文節種判定モジュール 304で取り扱わ れるパターンマッチルールのうち、「がんの進行度」のパターンと、日付(「昭和 7年」) を表すパターンの両方に適合する。読影医が「がんの進行度」のつもりで所見入力を 行っている場合、もし、文節種判定モジュール 304が、もともと「がんの進行度」と判定 すれば、特にカスタマイズは行われない。逆に、「昭和 7年」と判定した場合は、通常、 構造ィ匕レポートの編集入力取込み部 106により、読影医のフィードバックでパターン マッチルールが修正され、その修正パターンがレポート修正履歴 DB (311)に記録さ れる。蓄積された修正ログを用いて、修正ログ解析モジュールが、ルールの適用順序 を変更し、あるいはパターンの適用重みを変更することにより、各読影医の書き方を 一部カスタマイズが可能である。  As a method of automatic customization using the work of an interpreting doctor, there is a method of using a correction history of a structured report. The phrase type determining module 304 is customized based on the structural modification report history record DB (311) recorded by the structural modification report editing module 317. For example, when the character string “S7” appears, the pattern match rules handled by the phrase type determination module 304 include the “cancer progress” pattern and the date (“Showa 7”) pattern. Suitable for both. If the radiologist is inputting a finding with the intention of “cancer progress”, no customization is performed if the phrase type determination module 304 originally determines the “cancer progress”. Conversely, when it is determined to be “Showa 7”, the pattern matching rule is usually corrected by the input of the editing / inputting unit 106 of the structure editing report based on the feedback of the radiologist, and the corrected pattern is stored in the report correction history DB (311 ). Using the accumulated correction log, the correction log analysis module can partially customize the writing style of each radiologist by changing the application order of rules or changing the application weight of patterns.
[0059] このように、本発明によれば、医師の所見文入力を、診療分野毎の構造化された定 型フォーマットに、医師の負担が少ない形で、効率的に変換して記録することができ る。  [0059] As described above, according to the present invention, it is possible to efficiently convert and record a doctor's opinion sentence input into a structured fixed format for each medical care field with a small burden on the doctor. Can be done.
今後、放射線科の読影レポートシステムの入力段階において使用される「DICOM —SR対応読影レポート」を標準仕様としてパンフレットに書くことが必須仕様として求 められるようになると考えられ、この場合、前記した本発明は有効なツールとして活用 されることが期待できる。  In the future, it is expected that "DICOM —SR compatible interpretation report" used in the input stage of the radiology interpretation report system will be required to be written as a standard specification in a pamphlet as an essential specification. The invention can be expected to be used as an effective tool.
産業上利用の可能性  Possibility of industrial use
[0060] 本発明によれば、医師に負担をかけることなぐ標準形式に従った利便性の高い読 影レポートを作成し、また、知識として蓄積することができる装置を具現できる。 According to the present invention, it is possible to implement a device that can create a highly convenient interpretation report in a standard format without putting a burden on a doctor and accumulate it as knowledge.

Claims

請求の範囲 The scope of the claims
[1] 入力装置によって入力されるテキスト文書を演算装置が演算して構造ィ匕文書を作 成し、当該作成された構造化文書を出力装置に出力する構造化文書作成方法であ つて、  [1] A structured document creating method in which a computing device computes a text document input by an input device to create a structured document, and outputs the created structured document to an output device,
前記演算装置は、  The arithmetic unit includes:
入力されたテキスト文書について、専門用語と当該専門用語が属するカテゴリーと を記憶した単語辞書を参照して形態素解析を行う第 1のステップと、  A first step of performing a morphological analysis on the input text document by referring to a word dictionary storing technical terms and categories to which the technical terms belong;
前記形態素解析して得られる単語と当該単語が属するカテゴリーとの相関関係を 抽出し、前記テキスト文書を、前記相関関係に基づいて規格ィ匕されたフォーマットに 従う構造ィ匕文書に変換する第 2のステップと、  Extracting a correlation between a word obtained by the morphological analysis and a category to which the word belongs, and converting the text document into a structured document according to a standardized format based on the correlation. Steps and
前記変換して得られる構造化文書を前記出力装置へ出力する第 3のステップと、 を実行することを特徴とする構造化文書作成方法。  Outputting a structured document obtained by the conversion to the output device.
[2] 前記単語辞書は、 [2] The word dictionary is
コンテクストアイテムをノードに持ち、リレーションシップをアークとして持つ木構造で 表現された構造化文書で表現され、前記コンテンクストアイテムとリレーションシップ の部分的な構造にっ 、て、それぞれにコンテクストグループ識別情報及びテンプレ ート識別情報のうちの少なくとも一方が付与された部分木構造を持つことを特徴とす る請求項 1に記載の構造化文書作成方法。  It is represented by a structured document that has a context item as a node and has a tree structure with relationships as arcs. According to the partial structure of the context item and the relationship, context group identification information and 2. The structured document creation method according to claim 1, wherein the structured document creation method has a partial tree structure to which at least one of the template identification information is assigned.
[3] 前記第 1のステップは、 [3] The first step is:
あら力じめ用意される文字列パターンマッチルールに従い前記入力されたテキスト を文集合の組合せに分割するサブステップと、  Sub-step of dividing the input text into a set of sentence sets according to a string pattern match rule prepared in advance;
前記分割された文書の文節解析を行!ヽ、前記単語辞書から前記コンテクストダル ープ識別情報を持つ文節種情報を判定するサブステップと、  Performing a phrase analysis of the divided document! ヽ, and determining phrase type information having the context group identification information from the word dictionary;
を有することを特徴とする請求項 1に記載の構造化文書作成方法。  2. The structured document creation method according to claim 1, comprising:
[4] 前記第 2のステップは、 [4] The second step is:
前記判定された文節種情報毎、前記単語辞書の部分木構造と照合して該当するコ ンテクストグループ識別情報を抽出し、当該抽出されたコンテクストグループ識別情 報に基づき適用可能な定型文書および当該定型文書中の対応箇所を特定するサブ ステップ、 For each of the determined phrase type information, the corresponding context group identification information is extracted by collating with the subtree structure of the word dictionary, and the fixed form document and the fixed form applicable based on the extracted context group identification information. Sub to identify the corresponding part in the document Steps,
を有することを特徴とする請求項 3に記載の構造化文書作成方法。  4. The structured document creation method according to claim 3, comprising:
[5] 前記演算装置は、 [5] The arithmetic unit comprises:
前記変換して得られる構造化文書を記憶装置に蓄積する第 4のステップと、 前記記憶装置に蓄積された構造化文書のうち、利用頻度の高い構造化文書と入 力文字列との照合を行!、、当該照合が得られた構造化文書を前記入力文字列へ貼 り付ける第 5のステップと、  A fourth step of storing the structured document obtained by the conversion in a storage device, and comparing a structured document frequently used among the structured documents stored in the storage device with an input character string. , A fifth step of pasting the structured document obtained by the matching to the input character string,
をさらに実行することを特徴とする請求項 4に記載の構造化文書作成方法。  5. The structured document creation method according to claim 4, further comprising:
[6] 前記テキスト文書は病気または疾患に関する所見文、前記構造化文書は読影レポ ートであり、前記読影レポートは、前記病気または疾患の症状もしくは処置について の特定の項目に対し時系列的に並べて出力されること、 [6] The text document is a commentary on a disease or a disease, the structured document is a radiological report, and the radiological report is a chronological order for a specific item about the symptom or treatment of the disease or disease. Output side by side,
を特徴とする請求項 1に記載の構造化文書作成方法。  2. The structured document creation method according to claim 1, wherein:
[7] 入力装置によって入力されるテキスト文書を演算装置が演算して構造ィ匕文書を作 成し、前記作成された構造化文書を出力装置に出力する構造化文書作成装置であ つて、 [7] A structured document creation device that creates a structured document by calculating a text document input by an input device, and outputs the created structured document to an output device,
入力されたテキスト文書について、専門用語と当該専門用語が属するカテゴリーと を記憶した単語辞書を参照して形態素解析を行う形態素解析部と、  A morphological analysis unit that performs a morphological analysis on the input text document by referring to a word dictionary storing technical terms and categories to which the technical terms belong;
前記形態素解析して得られる単語と当該単語が属するカテゴリーの相関関係を抽 出し、前記テキスト文書を、前記相関関係に基づいて規格化された定型フォーマット に従う構造ィ匕文書に変換する構造ィ匕文書変換部と、  The structure obtained by extracting a correlation between a word obtained by the morphological analysis and a category to which the word belongs, and converting the text document into a structure formed according to a standardized format standardized based on the correlation. A conversion unit;
前記変換して得られる構造化文書を前記出力装置に出力する構造化文書出力部 と、  A structured document output unit for outputting the structured document obtained by the conversion to the output device;
を具備することを特徴とする構造化文書作成装置。  A structured document creation device, comprising:
[8] 前記変換して得られる構造ィヒ文書が蓄積される構造ィヒ文書記憶装置と、 [8] a structured document storage device in which the structured document obtained by the conversion is stored;
前記構造化文書記憶装置に蓄積された構造化文書を参照し、その編集履歴を用 V、て前記単語辞書を更新する辞書情報更新部と、  A dictionary information updating unit that updates the word dictionary by referring to the structured document stored in the structured document storage device and using the editing history thereof V;
を具備することを特徴とする請求項 7に記載の構造化文書作成装置。  8. The structured document creation device according to claim 7, comprising:
[9] 前記蓄積された構造ィヒ文書に基づき作成される知識辞書が記憶される記憶装置と 当該知識辞書から読み出される情報と、入力され取り込まれる経過時間情報から症 状の進行を予測し、前記規格化された定型フォーマットの項目欄に当てはめる項目 値予測部と、 [9] a storage device for storing a knowledge dictionary created based on the stored structured document; An item value prediction unit that predicts the progress of a symptom from the information read from the knowledge dictionary and the elapsed time information input and captured, and applies the information to the item column of the standardized standard format;
を具備することを特徴とする請求項 7に記載の構造化文書作成装置。  8. The structured document creation device according to claim 7, comprising:
[10] 検索要求に付随する入力文字列から、前記記憶装置に蓄積された構造化文書を 検索して該当する構造化文書を検索出力する検索支援部と、 [10] A search support unit that searches for a structured document stored in the storage device from an input character string attached to a search request, and searches and outputs a corresponding structured document.
を具備することを特徴とする請求項 7に記載の構造化文書作成装置。  8. The structured document creation device according to claim 7, comprising:
[11] 入力装置によって入力されるテキスト文書を演算装置が演算して構造ィ匕文書を作 成し、当該作成された構造化文書を表示装置の表示画面に出力する構造化文書作 成装置であって、 [11] A structured document creation device that computes a text document input by an input device to create a structured document, and outputs the created structured document to a display screen of a display device. So,
専門用語と当該専門用語が属するカテゴリーを記憶した単語辞書を参照して、入 力されたテキスト文書の形態素解析を行う形態素解析部と、  A morphological analysis unit that performs a morphological analysis of an input text document by referring to a word dictionary storing technical terms and a category to which the technical terms belong;
前記形態素解析して得られる単語と当該単語が属するカテゴリーの相関関係を抽 出し、前記テキスト文書を、前記相関関係に基づいて規格ィ匕されたフォーマットに従 う構造化文書に変換する構造化文書変換部と、  A structured document that extracts a correlation between a word obtained by the morphological analysis and a category to which the word belongs, and converts the text document into a structured document that follows a standardized format based on the correlation. A conversion unit;
前記変換して得られる構造化文書を前記表示画面に視覚表示して編集入力を促 し、前記編集入力があつたときにその情報を取り込み、前記構造化文書変換部によ つて変換された結果が蓄積される記憶装置の内容を更新する構造化文書更新部と、 を具備することを特徴とする構造化文書作成装置。  The structured document obtained by the conversion is visually displayed on the display screen to prompt an edit input, and when the edit input is received, the information is taken in, and the result of the conversion by the structured document conversion unit is obtained. And a structured document updating unit for updating the content of the storage device in which the stored document is stored.
[12] 前記入力装置は、 [12] The input device,
音声入力される自由口述文力 テキスト文書を作成する音声認識エンジンを含むこ とを特徴とする請求項 7に記載の構造化文書作成装置。  8. The structured document creation device according to claim 7, further comprising a speech recognition engine for creating a text dictation-free dictation ability.
PCT/JP2005/000102 2004-06-07 2005-01-07 Structurized document creation method, and device thereof WO2005122002A2 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP2006514408A JP4649405B2 (en) 2004-06-07 2005-01-07 Structured document creation method and apparatus

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
JP2004-168948 2004-06-07
JP2004168948 2004-06-07

Publications (2)

Publication Number Publication Date
WO2005122002A1 WO2005122002A1 (en) 2005-12-22
WO2005122002A2 true WO2005122002A2 (en) 2005-12-22

Family

ID=35503794

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/JP2005/000102 WO2005122002A2 (en) 2004-06-07 2005-01-07 Structurized document creation method, and device thereof

Country Status (2)

Country Link
JP (1) JP4649405B2 (en)
WO (1) WO2005122002A2 (en)

Cited By (30)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2007193399A (en) * 2006-01-17 2007-08-02 Konica Minolta Medical & Graphic Inc Information presenting system and program
JP2008146220A (en) * 2006-12-07 2008-06-26 Fuji Xerox Co Ltd Document creation support system, method of supporting document creation, and computer program
JP2008204012A (en) * 2007-02-16 2008-09-04 Univ Nagoya Report preparation device and report preparation program
JP2009064258A (en) * 2007-09-06 2009-03-26 Toshiba Corp Diagnostic reading report support system and method thereof
JP2009075951A (en) * 2007-09-21 2009-04-09 Fujifilm Corp Evaluation management system, device, and method
JP2009082443A (en) * 2007-09-28 2009-04-23 Canon Inc Diagnosis support device and control method thereof
JP2009093544A (en) * 2007-10-11 2009-04-30 Toshiba Corp Medical report apparatus
JP2009271621A (en) * 2008-05-01 2009-11-19 Fujifilm Corp Medical document preparation support device, medical document preparation support method and medical document preparation support program
JP2013000347A (en) * 2011-06-16 2013-01-07 Toshiba Corp Medical image processor
JP2013149267A (en) * 2013-03-25 2013-08-01 Konica Minolta Inc Diagnosis support system
US8626508B2 (en) 2009-02-26 2014-01-07 National University Corporation Toyohashi University Of Technology Speech search device and speech search method
JP2014501012A (en) * 2010-12-10 2014-01-16 コーニンクレッカ フィリップス エヌ ヴェ Clinical document debugging decision support
JP2014124228A (en) * 2012-12-25 2014-07-07 Toshiba Corp Medical information management apparatus and ultrasonic diagnostic apparatus
JP2014161511A (en) * 2013-02-25 2014-09-08 Hitachi Aloka Medical Ltd Medical image diagnosis device
JP2015533241A (en) * 2012-08-13 2015-11-19 エムモーダル アイピー エルエルシー Maintaining discrete data representations corresponding to information contained in free-form text
JP2017509946A (en) * 2014-01-30 2017-04-06 コーニンクレッカ フィリップス エヌ ヴェKoninklijke Philips N.V. Context-dependent medical data entry system
JP2017167433A (en) * 2016-03-17 2017-09-21 株式会社東芝 Summary generation device, summary generation method, and summary generation program
US9940715B2 (en) 2013-07-17 2018-04-10 Canon Kabushiki Kaisha Diagnosis support apparatus, method for the same, and non-transitory computer-readable storage medium
US9996510B2 (en) 2011-06-19 2018-06-12 Mmodal Ip Llc Document extension in dictation-based document generation workflow
JP2018521394A (en) * 2015-06-12 2018-08-02 コーニンクレッカ フィリップス エヌ ヴェKoninklijke Philips N.V. Apparatus, system and method for displaying semantically categorized timelines
US10325296B2 (en) 2010-09-23 2019-06-18 Mmodal Ip Llc Methods and systems for selective modification to one of a plurality of components in an engine
CN110023927A (en) * 2016-11-30 2019-07-16 谷歌有限责任公司 For the system and method for being applied to document will to be laid out
US10628476B2 (en) 2015-02-16 2020-04-21 Canon Kabushiki Kaisha Information processing apparatus, information processing method, information processing system, and storage medium
JP2020074020A (en) * 2020-01-08 2020-05-14 株式会社東芝 Summary generation device, summary generating method and summary generation program
WO2020241857A1 (en) 2019-05-31 2020-12-03 富士フイルム株式会社 Medical document creation device, method, and program, learning device, method, and program, and learned model
US10950329B2 (en) 2015-03-13 2021-03-16 Mmodal Ip Llc Hybrid human and computer-assisted coding workflow
US11043306B2 (en) 2017-01-17 2021-06-22 3M Innovative Properties Company Methods and systems for manifestation and transmission of follow-up notifications
CN113050987A (en) * 2021-03-26 2021-06-29 北京沃东天骏信息技术有限公司 Interface document generation method and device, storage medium and electronic equipment
US11232402B2 (en) 2010-02-26 2022-01-25 3M Innovative Properties Company Clinical data reconciliation as part of a report generation solution
US11282596B2 (en) 2017-11-22 2022-03-22 3M Innovative Properties Company Automated code feedback system

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2001290801A (en) * 2000-02-04 2001-10-19 Fujitsu Ltd Structured document system, structured documentation program, and computer-readable storage medium
JP2002215796A (en) * 2001-01-18 2002-08-02 Hitachi Medical Corp Diagnostic report making method

Cited By (36)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2007193399A (en) * 2006-01-17 2007-08-02 Konica Minolta Medical & Graphic Inc Information presenting system and program
JP2008146220A (en) * 2006-12-07 2008-06-26 Fuji Xerox Co Ltd Document creation support system, method of supporting document creation, and computer program
JP2008204012A (en) * 2007-02-16 2008-09-04 Univ Nagoya Report preparation device and report preparation program
JP2009064258A (en) * 2007-09-06 2009-03-26 Toshiba Corp Diagnostic reading report support system and method thereof
JP2009075951A (en) * 2007-09-21 2009-04-09 Fujifilm Corp Evaluation management system, device, and method
JP2009082443A (en) * 2007-09-28 2009-04-23 Canon Inc Diagnosis support device and control method thereof
JP2009093544A (en) * 2007-10-11 2009-04-30 Toshiba Corp Medical report apparatus
JP2009271621A (en) * 2008-05-01 2009-11-19 Fujifilm Corp Medical document preparation support device, medical document preparation support method and medical document preparation support program
US8626508B2 (en) 2009-02-26 2014-01-07 National University Corporation Toyohashi University Of Technology Speech search device and speech search method
US11922373B2 (en) 2010-02-26 2024-03-05 3M Innovative Properties Company Clinical data reconciliation as part of a report generation solution
US11232402B2 (en) 2010-02-26 2022-01-25 3M Innovative Properties Company Clinical data reconciliation as part of a report generation solution
US10325296B2 (en) 2010-09-23 2019-06-18 Mmodal Ip Llc Methods and systems for selective modification to one of a plurality of components in an engine
JP2014501012A (en) * 2010-12-10 2014-01-16 コーニンクレッカ フィリップス エヌ ヴェ Clinical document debugging decision support
JP2013000347A (en) * 2011-06-16 2013-01-07 Toshiba Corp Medical image processor
US9996510B2 (en) 2011-06-19 2018-06-12 Mmodal Ip Llc Document extension in dictation-based document generation workflow
JP2015533241A (en) * 2012-08-13 2015-11-19 エムモーダル アイピー エルエルシー Maintaining discrete data representations corresponding to information contained in free-form text
US10156956B2 (en) 2012-08-13 2018-12-18 Mmodal Ip Llc Maintaining a discrete data representation that corresponds to information contained in free-form text
JP2014124228A (en) * 2012-12-25 2014-07-07 Toshiba Corp Medical information management apparatus and ultrasonic diagnostic apparatus
JP2014161511A (en) * 2013-02-25 2014-09-08 Hitachi Aloka Medical Ltd Medical image diagnosis device
JP2013149267A (en) * 2013-03-25 2013-08-01 Konica Minolta Inc Diagnosis support system
US9940715B2 (en) 2013-07-17 2018-04-10 Canon Kabushiki Kaisha Diagnosis support apparatus, method for the same, and non-transitory computer-readable storage medium
JP2017509946A (en) * 2014-01-30 2017-04-06 コーニンクレッカ フィリップス エヌ ヴェKoninklijke Philips N.V. Context-dependent medical data entry system
US10628476B2 (en) 2015-02-16 2020-04-21 Canon Kabushiki Kaisha Information processing apparatus, information processing method, information processing system, and storage medium
US10950329B2 (en) 2015-03-13 2021-03-16 Mmodal Ip Llc Hybrid human and computer-assisted coding workflow
JP2018521394A (en) * 2015-06-12 2018-08-02 コーニンクレッカ フィリップス エヌ ヴェKoninklijke Philips N.V. Apparatus, system and method for displaying semantically categorized timelines
JP2017167433A (en) * 2016-03-17 2017-09-21 株式会社東芝 Summary generation device, summary generation method, and summary generation program
US10540987B2 (en) 2016-03-17 2020-01-21 Kabushiki Kaisha Toshiba Summary generating device, summary generating method, and computer program product
CN110023927A (en) * 2016-11-30 2019-07-16 谷歌有限责任公司 For the system and method for being applied to document will to be laid out
CN110023927B (en) * 2016-11-30 2023-12-15 谷歌有限责任公司 System and method for applying layout to document
US11727206B2 (en) 2016-11-30 2023-08-15 Google Llc Systems and methods for applying layout to documents
US11043306B2 (en) 2017-01-17 2021-06-22 3M Innovative Properties Company Methods and systems for manifestation and transmission of follow-up notifications
US11699531B2 (en) 2017-01-17 2023-07-11 3M Innovative Properties Company Methods and systems for manifestation and transmission of follow-up notifications
US11282596B2 (en) 2017-11-22 2022-03-22 3M Innovative Properties Company Automated code feedback system
WO2020241857A1 (en) 2019-05-31 2020-12-03 富士フイルム株式会社 Medical document creation device, method, and program, learning device, method, and program, and learned model
JP2020074020A (en) * 2020-01-08 2020-05-14 株式会社東芝 Summary generation device, summary generating method and summary generation program
CN113050987A (en) * 2021-03-26 2021-06-29 北京沃东天骏信息技术有限公司 Interface document generation method and device, storage medium and electronic equipment

Also Published As

Publication number Publication date
JP4649405B2 (en) 2011-03-09
JPWO2005122002A1 (en) 2008-07-31

Similar Documents

Publication Publication Date Title
JP4649405B2 (en) Structured document creation method and apparatus
US20220199230A1 (en) Context driven summary view of radiology findings
CN108475538B (en) Structured discovery objects for integrating third party applications in an image interpretation workflow
JP6749835B2 (en) Context-sensitive medical data entry system
Noumeir Benefits of the DICOM structured report
JP5517524B2 (en) Medical diagnosis support apparatus, control method and program for medical diagnosis support apparatus
US6366683B1 (en) Apparatus and method for recording image analysis information
US9317580B2 (en) Imaging protocol update and/or recommender
EP1943641B1 (en) Method and system for processing dictated information
EP2478833A1 (en) Medical image display device, method, and program
US7418120B2 (en) Method and system for structuring dynamic data
JP2017191457A (en) Report creation apparatus and control method thereof
US20180330820A1 (en) Content-driven problem list ranking in electronic medical records
US20100010806A1 (en) Storage system for symptom information of Traditional Chinese Medicine (TCM) and method for storing TCM symptom information
JP2020205070A (en) Medial report creation device and control method therefor, medical report creation system, and program
JP7473314B2 (en) Medical information management device and method for adding metadata to medical reports
JPWO2009025022A1 (en) Information processing system and program
Jamoulle et al. Mapping French terms in a Belgian guideline on heart failure to international classifications and nomenclatures: the devil is in the detail.
JP2014160482A (en) Information processing apparatus, information process system, information processing method and program
WO2024116828A1 (en) Analysis device, program, analysis system, and analysis method
US20240177818A1 (en) Methods and systems for summarizing densely annotated medical reports
Sonntag et al. Design and implementation of a semantic dialogue system for radiologists
Zheng et al. Semantic extraction and processing of medical records for patient-oriented visual index
JP2022162368A (en) Diagnosis assistance device, diagnosis assistance method and diagnosis assistance program
WO2023001372A1 (en) Data-based clinical decision-making utilising knowledge graph

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A2

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BW BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE EG ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NA NI NO NZ OM PG PH PL PT RO RU SC SD SE SG SK SL SM SY TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): BW GH GM KE LS MW MZ NA SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IS IT LT LU MC NL PL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG

DPEN Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed from 20040101)
121 Ep: the epo has been informed by wipo that ep was designated in this application
WWE Wipo information: entry into national phase

Ref document number: 2006514408

Country of ref document: JP

NENP Non-entry into the national phase in:

Ref country code: DE

WWW Wipo information: withdrawn in national office

Country of ref document: DE

122 Ep: pct application non-entry in european phase