CN106776533A - 用于分析一段文本的方法和系统 - Google Patents
用于分析一段文本的方法和系统 Download PDFInfo
- Publication number
- CN106776533A CN106776533A CN201510953092.XA CN201510953092A CN106776533A CN 106776533 A CN106776533 A CN 106776533A CN 201510953092 A CN201510953092 A CN 201510953092A CN 106776533 A CN106776533 A CN 106776533A
- Authority
- CN
- China
- Prior art keywords
- text
- unique
- module unit
- grade
- character
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 title claims abstract description 43
- 238000013500 data storage Methods 0.000 claims description 3
- 238000009434 installation Methods 0.000 description 9
- 230000006870 function Effects 0.000 description 6
- 239000000203 mixture Substances 0.000 description 3
- 238000004891 communication Methods 0.000 description 2
- 230000001419 dependent effect Effects 0.000 description 2
- 238000010586 diagram Methods 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 238000012358 sourcing Methods 0.000 description 2
- 108010022579 ATP dependent 26S protease Proteins 0.000 description 1
- 239000012141 concentrate Substances 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 230000007717 exclusion Effects 0.000 description 1
- 230000002349 favourable effect Effects 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000003012 network analysis Methods 0.000 description 1
- 238000004321 preservation Methods 0.000 description 1
- 230000003252 repetitive effect Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G09—EDUCATION; CRYPTOGRAPHY; DISPLAY; ADVERTISING; SEALS
- G09B—EDUCATIONAL OR DEMONSTRATION APPLIANCES; APPLIANCES FOR TEACHING, OR COMMUNICATING WITH, THE BLIND, DEAF OR MUTE; MODELS; PLANETARIA; GLOBES; MAPS; DIAGRAMS
- G09B7/00—Electrically-operated teaching apparatus or devices working with questions and answers
- G09B7/02—Electrically-operated teaching apparatus or devices working with questions and answers of the type wherein the student is expected to construct an answer to the question which is presented or wherein the machine gives an answer to the question presented by a student
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/279—Recognition of textual entities
- G06F40/289—Phrasal analysis, e.g. finite state techniques or chunking
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/903—Querying
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/205—Parsing
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/205—Parsing
- G06F40/226—Validation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/237—Lexical tools
- G06F40/247—Thesauruses; Synonyms
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/253—Grammatical analysis; Style critique
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/279—Recognition of textual entities
- G06F40/284—Lexical analysis, e.g. tokenisation or collocates
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q50/00—Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
- G06Q50/10—Services
- G06Q50/20—Education
- G06Q50/205—Education administration or guidance
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Health & Medical Sciences (AREA)
- Computational Linguistics (AREA)
- General Health & Medical Sciences (AREA)
- General Engineering & Computer Science (AREA)
- Artificial Intelligence (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Business, Economics & Management (AREA)
- Educational Technology (AREA)
- Educational Administration (AREA)
- Tourism & Hospitality (AREA)
- Databases & Information Systems (AREA)
- Strategic Management (AREA)
- Human Resources & Organizations (AREA)
- Economics (AREA)
- Data Mining & Analysis (AREA)
- Marketing (AREA)
- Primary Health Care (AREA)
- General Business, Economics & Management (AREA)
- Machine Translation (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Electrically Operated Instructional Devices (AREA)
Abstract
Description
Claims (17)
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
HK15111493.2 | 2015-11-20 | ||
HK15111493.2A HK1210371A2 (zh) | 2015-11-20 | 2015-11-20 | 種分析文本的方法和系統 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN106776533A true CN106776533A (zh) | 2017-05-31 |
CN106776533B CN106776533B (zh) | 2021-05-07 |
Family
ID=55747663
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201510953092.XA Active CN106776533B (zh) | 2015-11-20 | 2015-12-16 | 用于分析一段文本的方法和系统 |
Country Status (10)
Country | Link |
---|---|
US (1) | US10755594B2 (zh) |
JP (1) | JP6693032B2 (zh) |
CN (1) | CN106776533B (zh) |
CA (1) | CA2926953C (zh) |
HK (1) | HK1210371A2 (zh) |
MY (1) | MY195702A (zh) |
PH (1) | PH12018550064A1 (zh) |
SG (1) | SG10201509744UA (zh) |
TW (1) | TWI686714B (zh) |
WO (1) | WO2017084238A1 (zh) |
Families Citing this family (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20190317979A1 (en) * | 2017-12-14 | 2019-10-17 | Sang C. Lee | Tripartite poetry paradigm |
CN111914093A (zh) * | 2019-05-09 | 2020-11-10 | 深圳中兴飞贷金融科技有限公司 | 数据处理方法和装置,存储介质和电子设备 |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2000148767A (ja) * | 1998-11-05 | 2000-05-30 | Nippon Telegr & Teleph Corp <Ntt> | 文書重要文ランキング方法、文書重要文ランキング装置、及び文書重要文ランキングプログラムを記録した記録媒体 |
CN1673996A (zh) * | 2004-03-24 | 2005-09-28 | 无敌科技股份有限公司 | 一种识别语言文本难易度的系统及其方法 |
US7165264B1 (en) * | 2001-07-26 | 2007-01-16 | Digeo, Inc. | Client-side tool for splitting or truncating text strings for interactive television |
CN101539923A (zh) * | 2008-03-18 | 2009-09-23 | 北京搜狗科技发展有限公司 | 从文档中提取正文片段的方法及装置 |
CN104615772A (zh) * | 2015-02-16 | 2015-05-13 | 重庆大学 | 一种用于电子商务的文本评价数据专业程度分析方法 |
Family Cites Families (39)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5870608A (en) * | 1994-06-03 | 1999-02-09 | Synopsys, Inc. | Method and apparatus for displaying text including context sensitive information derived from parse tree |
US5724498A (en) * | 1995-06-07 | 1998-03-03 | Adobe Systems Incorporated | Method and apparatus for concealing character modifications made for text composition purposes |
US5794177A (en) * | 1995-07-19 | 1998-08-11 | Inso Corporation | Method and apparatus for morphological analysis and generation of natural language text |
US6154757A (en) * | 1997-01-29 | 2000-11-28 | Krause; Philip R. | Electronic text reading environment enhancement method and apparatus |
TW364966B (en) * | 1997-07-15 | 1999-07-21 | Inventec Corp | Automatic syntax analysis method for Chinese |
US6120297A (en) * | 1997-08-25 | 2000-09-19 | Lyceum Communication, Inc. | Vocabulary acquistion using structured inductive reasoning |
US7069508B1 (en) * | 2000-07-13 | 2006-06-27 | Language Technologies, Inc. | System and method for formatting text according to linguistic, visual and psychological variables |
US6658377B1 (en) * | 2000-06-13 | 2003-12-02 | Perspectus, Inc. | Method and system for text analysis based on the tagging, processing, and/or reformatting of the input text |
US7103848B2 (en) * | 2001-09-13 | 2006-09-05 | International Business Machines Corporation | Handheld electronic book reader with annotation and usage tracking capabilities |
US7313513B2 (en) * | 2002-05-13 | 2007-12-25 | Wordrake Llc | Method for editing and enhancing readability of authored documents |
US20050069849A1 (en) * | 2003-09-30 | 2005-03-31 | Iode Design | Computer-based method of improving reading comprehension |
JP4304146B2 (ja) | 2004-12-01 | 2009-07-29 | 株式会社東芝 | 辞書登録装置、辞書登録方法および辞書登録プログラム |
US8608477B2 (en) * | 2006-04-06 | 2013-12-17 | Vantage Technologies Knowledge Assessment, L.L.C. | Selective writing assessment with tutoring |
JP2008129475A (ja) * | 2006-11-23 | 2008-06-05 | Osamu Asai | 音声教材 |
TW200825778A (en) * | 2006-12-12 | 2008-06-16 | Inventec Besta Co Ltd | Hand-held reading device and the reading assistant method thereof |
GB2446427A (en) * | 2007-02-07 | 2008-08-13 | Sharp Kk | Computer-implemented learning method and apparatus |
US20090228777A1 (en) * | 2007-08-17 | 2009-09-10 | Accupatent, Inc. | System and Method for Search |
US8306356B1 (en) * | 2007-09-28 | 2012-11-06 | Language Technologies, Inc. | System, plug-in, and method for improving text composition by modifying character prominence according to assigned character information measures |
US8136034B2 (en) * | 2007-12-18 | 2012-03-13 | Aaron Stanton | System and method for analyzing and categorizing text |
US8463594B2 (en) * | 2008-03-21 | 2013-06-11 | Sauriel Llc | System and method for analyzing text using emotional intelligence factors |
CN101540041B (zh) | 2008-03-21 | 2012-06-27 | 中国科学院计算技术研究所 | 一种扫描文档浏览适配方法 |
US8320674B2 (en) | 2008-09-03 | 2012-11-27 | Sony Corporation | Text localization for image and video OCR |
US8606796B2 (en) * | 2008-09-15 | 2013-12-10 | Kilac, LLC | Method and system for creating a data profile engine, tool creation engines and product interfaces for identifying and analyzing files and sections of files |
JP2010256821A (ja) * | 2009-04-28 | 2010-11-11 | Sci-Tec:Kk | 学習支援システム |
US20100311030A1 (en) * | 2009-06-03 | 2010-12-09 | Microsoft Corporation | Using combined answers in machine-based education |
US20110123967A1 (en) * | 2009-11-24 | 2011-05-26 | Xerox Corporation | Dialog system for comprehension evaluation |
US8892421B2 (en) * | 2010-12-08 | 2014-11-18 | Educational Testing Service | Computer-implemented systems and methods for determining a difficulty level of a text |
JP2012208143A (ja) * | 2011-03-29 | 2012-10-25 | Hideki Aikawa | オンライン学習システム |
CN102497270B (zh) | 2011-12-24 | 2014-07-16 | 桂林电子科技大学 | 一类规范化文档的加密方法 |
CN103186911B (zh) | 2011-12-28 | 2015-07-15 | 北大方正集团有限公司 | 一种处理扫描书数据的方法及装置 |
CN102662952B (zh) | 2012-03-02 | 2015-04-15 | 成都康赛信息技术有限公司 | 一种基于层次的中文文本并行数据挖掘方法 |
CN104462207B (zh) * | 2014-11-03 | 2017-07-11 | 陕西师范大学 | 面向分布式学习环境的多片段学习资源标注方法 |
RU2580424C1 (ru) * | 2014-11-28 | 2016-04-10 | Общество С Ограниченной Ответственностью "Яндекс" | Способ выявления незначащих лексических единиц в текстовом сообщении и компьютер |
US9563613B1 (en) * | 2015-01-23 | 2017-02-07 | Sprint Communications Company L.P. | System and method for dynamic portable document file generation |
CN107291683A (zh) * | 2016-04-11 | 2017-10-24 | 珠海金山办公软件有限公司 | 一种拼写检查方法及装置 |
US11615104B2 (en) * | 2016-09-26 | 2023-03-28 | Splunk Inc. | Subquery generation based on a data ingest estimate of an external data system |
US11250371B2 (en) * | 2016-09-26 | 2022-02-15 | Splunk Inc. | Managing process analytics across process components |
US11604795B2 (en) * | 2016-09-26 | 2023-03-14 | Splunk Inc. | Distributing partial results from an external data system between worker nodes |
US11106681B2 (en) * | 2018-09-28 | 2021-08-31 | Splunk Inc. | Conditional processing based on inferred sourcetypes |
-
2015
- 2015-11-20 HK HK15111493.2A patent/HK1210371A2/zh not_active IP Right Cessation
- 2015-11-26 SG SG10201509744UA patent/SG10201509744UA/en unknown
- 2015-12-01 TW TW104140236A patent/TWI686714B/zh active
- 2015-12-16 CN CN201510953092.XA patent/CN106776533B/zh active Active
-
2016
- 2016-04-11 JP JP2018525475A patent/JP6693032B2/ja active Active
- 2016-04-11 MY MYPI2018701910A patent/MY195702A/en unknown
- 2016-04-11 WO PCT/CN2016/079003 patent/WO2017084238A1/en active Application Filing
- 2016-04-12 CA CA2926953A patent/CA2926953C/en active Active
- 2016-04-15 US US15/130,761 patent/US10755594B2/en active Active
-
2018
- 2018-05-16 PH PH12018550064A patent/PH12018550064A1/en unknown
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2000148767A (ja) * | 1998-11-05 | 2000-05-30 | Nippon Telegr & Teleph Corp <Ntt> | 文書重要文ランキング方法、文書重要文ランキング装置、及び文書重要文ランキングプログラムを記録した記録媒体 |
US7165264B1 (en) * | 2001-07-26 | 2007-01-16 | Digeo, Inc. | Client-side tool for splitting or truncating text strings for interactive television |
CN1673996A (zh) * | 2004-03-24 | 2005-09-28 | 无敌科技股份有限公司 | 一种识别语言文本难易度的系统及其方法 |
CN101539923A (zh) * | 2008-03-18 | 2009-09-23 | 北京搜狗科技发展有限公司 | 从文档中提取正文片段的方法及装置 |
CN104615772A (zh) * | 2015-02-16 | 2015-05-13 | 重庆大学 | 一种用于电子商务的文本评价数据专业程度分析方法 |
Also Published As
Publication number | Publication date |
---|---|
MY195702A (en) | 2023-02-06 |
TWI686714B (zh) | 2020-03-01 |
CA2926953A1 (en) | 2017-05-20 |
SG10201509744UA (en) | 2017-06-29 |
US10755594B2 (en) | 2020-08-25 |
HK1210371A2 (zh) | 2016-04-15 |
US20170148337A1 (en) | 2017-05-25 |
JP2018538615A (ja) | 2018-12-27 |
WO2017084238A1 (en) | 2017-05-26 |
JP6693032B2 (ja) | 2020-05-13 |
PH12018550064A1 (en) | 2018-11-12 |
CN106776533B (zh) | 2021-05-07 |
TW201719450A (zh) | 2017-06-01 |
CA2926953C (en) | 2022-08-09 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US10755185B2 (en) | Rating difficulty of questions | |
US11226968B2 (en) | Providing search result content tailored to stage of project and user proficiency and role on given topic | |
US9633309B2 (en) | Displaying quality of question being asked a question answering system | |
Bergsma et al. | Stylometric analysis of scientific articles | |
CN110674271B (zh) | 一种问答处理方法及装置 | |
CN104471568A (zh) | 对自然语言问题的基于学习的处理 | |
CN109359290B (zh) | 试题文本的知识点确定方法、电子设备及存储介质 | |
CN110991163B (zh) | 一种文档比对分析方法、装置、电子设备及存储介质 | |
CN111159405B (zh) | 基于背景知识的讽刺检测方法 | |
Sijimol et al. | Handwritten short answer evaluation system (HSAES) | |
CN106776533A (zh) | 用于分析一段文本的方法和系统 | |
Rintyarna et al. | Automatic ranking system of university based on technology readiness level using LDA-Adaboost. MH | |
JP2017021523A (ja) | 用語意味コード判定装置、方法、及びプログラム | |
US10261993B1 (en) | Adaptable text analytics platform | |
Shweta et al. | Comparative study of feature engineering for automated short answer grading | |
Shekhar et al. | Computational linguistic retrieval framework using negative bootstrapping for retrieving transliteration variants | |
CN109933788B (zh) | 类型确定方法、装置、设备和介质 | |
Almuayqil et al. | Towards an ontology-based fully integrated system for student e-assessment | |
CN110717029A (zh) | 一种信息处理方法和系统 | |
Ardini et al. | Development of mobile application through the concept of artificial intelligence to enhance pronunciation skill in EFL | |
Netisopakul et al. | The state of knowledge extraction from text for thai language | |
EP4163815A1 (en) | Textual content evaluation using machine learned models | |
CN110083817A (zh) | 一种命名排歧方法、装置、计算机可读存储介质 | |
US20230281388A1 (en) | A Method and System for Analyzing a Piece of Text Comprising Chinese Characters | |
Long | A Grammatical Error Correction Model for English Essay Words in Colleges Using Natural Language Processing |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
REG | Reference to a national code |
Ref country code: HK Ref legal event code: DE Ref document number: 1232633 Country of ref document: HK |
|
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
CB02 | Change of applicant information | ||
CB02 | Change of applicant information |
Address after: Room G, 22 floor, 4 Hyun court, Hai Yi Peninsula, Hongkong, China. Applicant after: Fortune asset Company Limited Address before: 601 / F, Malaysia Building, 50 Gloucester Road, Wanchai, Hongkong, China 601 Applicant before: Fortune asset Company Limited |
|
GR01 | Patent grant | ||
GR01 | Patent grant | ||
CP02 | Change in the address of a patent holder | ||
CP02 | Change in the address of a patent holder |
Address after: Room 15a, 15 / F, building 1, Furong garden, Mongkok, Kowloon, Hong Kong, China Patentee after: Chrysus Intellectual Properties Ltd. Address before: Room G, 22 floor, 4 Hyun court, Hai Yi Peninsula, Hongkong, China. Patentee before: Chrysus Intellectual Properties Ltd. |