CN109388717A - 一种批量生成语料的方法和系统 - Google Patents
一种批量生成语料的方法和系统 Download PDFInfo
- Publication number
- CN109388717A CN109388717A CN201810803666.9A CN201810803666A CN109388717A CN 109388717 A CN109388717 A CN 109388717A CN 201810803666 A CN201810803666 A CN 201810803666A CN 109388717 A CN109388717 A CN 109388717A
- Authority
- CN
- China
- Prior art keywords
- clause
- word
- corpus
- situation
- library
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Landscapes
- Machine Translation (AREA)
Abstract
Description
词名称 | 所属项 | 词类 | 类型 | 词性 | 近义词个数 |
小鹿叮叮 | 项目品牌 | 小鹿叮叮 | 实体词 | 名词 | 8 |
品牌 | 品牌 | 全分类 | 其他词 | 名词 | 6 |
满送 | 活动类型 | 电商 | 实体词 | 名词 | 2 |
600 | 优惠条件 | 电商 | 实体词 | 数量短语 | 4 |
满减券 | 优惠券详情 | 电商 | 短句词 | 名词 | 7 |
是否是 | 是否 | 全分类 | 句式词 | --- | 3 |
Claims (11)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810803666.9A CN109388717B (zh) | 2018-07-20 | 2018-07-20 | 一种批量生成语料的方法和系统 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810803666.9A CN109388717B (zh) | 2018-07-20 | 2018-07-20 | 一种批量生成语料的方法和系统 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN109388717A true CN109388717A (zh) | 2019-02-26 |
CN109388717B CN109388717B (zh) | 2021-04-20 |
Family
ID=65417470
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201810803666.9A Active CN109388717B (zh) | 2018-07-20 | 2018-07-20 | 一种批量生成语料的方法和系统 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN109388717B (zh) |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110399499A (zh) * | 2019-07-18 | 2019-11-01 | 珠海格力电器股份有限公司 | 一种语料生成方法、装置、电子设备及可读存储介质 |
CN110491394A (zh) * | 2019-09-12 | 2019-11-22 | 北京百度网讯科技有限公司 | 唤醒语料的获取方法和装置 |
CN110750989A (zh) * | 2019-10-28 | 2020-02-04 | 北京金山数字娱乐科技有限公司 | 一种语句分析的方法及装置 |
CN111027308A (zh) * | 2019-11-06 | 2020-04-17 | 厦门快商通科技股份有限公司 | 文本生成方法、系统、移动终端及存储介质 |
CN113127610A (zh) * | 2019-12-31 | 2021-07-16 | 北京猎户星空科技有限公司 | 一种数据处理方法、装置、设备及介质 |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8433708B2 (en) * | 2008-09-16 | 2013-04-30 | Kendyl A. Román | Methods and data structures for improved searchable formatted documents including citation and corpus generation |
CN106649280A (zh) * | 2017-02-13 | 2017-05-10 | 长沙军鸽软件有限公司 | 一种创建共享语料库的方法 |
CN106709072A (zh) * | 2017-02-13 | 2017-05-24 | 长沙军鸽软件有限公司 | 一种基于共享语料库获得智能会话回复内容的方法 |
CN106874451A (zh) * | 2017-02-13 | 2017-06-20 | 长沙军鸽软件有限公司 | 一种自动建立个人专属语料库的方法 |
CN107004000A (zh) * | 2016-06-29 | 2017-08-01 | 深圳狗尾草智能科技有限公司 | 一种语料生成装置和方法 |
-
2018
- 2018-07-20 CN CN201810803666.9A patent/CN109388717B/zh active Active
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8433708B2 (en) * | 2008-09-16 | 2013-04-30 | Kendyl A. Román | Methods and data structures for improved searchable formatted documents including citation and corpus generation |
CN107004000A (zh) * | 2016-06-29 | 2017-08-01 | 深圳狗尾草智能科技有限公司 | 一种语料生成装置和方法 |
CN106649280A (zh) * | 2017-02-13 | 2017-05-10 | 长沙军鸽软件有限公司 | 一种创建共享语料库的方法 |
CN106709072A (zh) * | 2017-02-13 | 2017-05-24 | 长沙军鸽软件有限公司 | 一种基于共享语料库获得智能会话回复内容的方法 |
CN106874451A (zh) * | 2017-02-13 | 2017-06-20 | 长沙军鸽软件有限公司 | 一种自动建立个人专属语料库的方法 |
Cited By (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110399499A (zh) * | 2019-07-18 | 2019-11-01 | 珠海格力电器股份有限公司 | 一种语料生成方法、装置、电子设备及可读存储介质 |
CN110399499B (zh) * | 2019-07-18 | 2022-02-18 | 珠海格力电器股份有限公司 | 一种语料生成方法、装置、电子设备及可读存储介质 |
CN110491394A (zh) * | 2019-09-12 | 2019-11-22 | 北京百度网讯科技有限公司 | 唤醒语料的获取方法和装置 |
CN110491394B (zh) * | 2019-09-12 | 2022-06-17 | 北京百度网讯科技有限公司 | 唤醒语料的获取方法和装置 |
CN110750989A (zh) * | 2019-10-28 | 2020-02-04 | 北京金山数字娱乐科技有限公司 | 一种语句分析的方法及装置 |
CN110750989B (zh) * | 2019-10-28 | 2023-09-19 | 北京金山数字娱乐科技有限公司 | 一种语句分析的方法及装置 |
CN111027308A (zh) * | 2019-11-06 | 2020-04-17 | 厦门快商通科技股份有限公司 | 文本生成方法、系统、移动终端及存储介质 |
CN113127610A (zh) * | 2019-12-31 | 2021-07-16 | 北京猎户星空科技有限公司 | 一种数据处理方法、装置、设备及介质 |
CN113127610B (zh) * | 2019-12-31 | 2024-04-19 | 北京猎户星空科技有限公司 | 一种数据处理方法、装置、设备及介质 |
Also Published As
Publication number | Publication date |
---|---|
CN109388717B (zh) | 2021-04-20 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN109388717A (zh) | 一种批量生成语料的方法和系统 | |
Pavlick et al. | Simple PPDB: A paraphrase database for simplification | |
Nakayama et al. | Is culture of origin associated with more expressions? An analysis of Yelp reviews on Japanese restaurants | |
CN105095319B (zh) | 基于时间序列化的文档的标识、关联、搜索及展现的系统 | |
US20060078862A1 (en) | Answer support system, answer support apparatus, and answer support program | |
US20140136541A1 (en) | Mining Semi-Structured Social Media | |
Bosc et al. | DART: A dataset of arguments and their relations on Twitter | |
CN105339936A (zh) | 文本匹配装置以及方法、和文本分类装置以及方法 | |
Shi et al. | Online public opinion during the first epidemic wave of COVID-19 in China based on Weibo data | |
CN103984771B (zh) | 一种英文微博中地理兴趣点抽取和感知其时间趋势的方法 | |
WO2019200705A1 (zh) | 自动生成完形填空试题的方法以及装置 | |
CN110490686A (zh) | 一种基于时间感知的商品评分模型构建、推荐方法及系统 | |
Ray et al. | Utilizing emotion scores for improving classifier performance for predicting customer's intended ratings from social media posts | |
Bayraktar et al. | An analysis of english punctuation: The special case of comma | |
Ji et al. | Discussing environmental issues in Chinese social media: An analysis of Greenpeace China’s Weibo posts and audience responses | |
KR102140253B1 (ko) | 챗봇 통신을 기반으로 한 사용자 맞춤형 공공지식 정보 제공방법 및 그 시스템 | |
SG193613A1 (en) | Text analyzing device, problematic behavior extraction method, and problematic behavior extraction program | |
CN109800418A (zh) | 文本处理方法、装置和存储介质 | |
JP2012248187A (ja) | 外来語の発音検索サービスを提供する検索結果提供システム及び検索結果提供方法 | |
Krommyda et al. | Emotion detection in Twitter posts: a rule-based algorithm for annotated data acquisition | |
CN111190965B (zh) | 基于文本数据的即席关系分析系统及方法 | |
Kathirvelu et al. | Voice Recognition Chat bot for Consumer Product Applications | |
CN103678720B (zh) | 用户反馈数据处理方法和装置 | |
Suryawanshi et al. | Sentiment analyzer using machine learning | |
CN102346777A (zh) | 一种对例句检索结果进行排序的方法和装置 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
CB02 | Change of applicant information |
Address after: Room 15a08, no.6, financial Third Street, Wuxi Economic Development Zone, Jiangsu Province Applicant after: Smart point (Wuxi) Technology Co.,Ltd. Address before: 100084 SOHOB709, Zhongguancun, Haidian District, Beijing Applicant before: BEIJING ABITAI TECHNOLOGY Co.,Ltd. |
|
CB02 | Change of applicant information | ||
TA01 | Transfer of patent application right |
Effective date of registration: 20200714 Address after: 310051 15 / F, main building, Hengxin building, 588 Jiangnan Avenue, Binjiang District, Hangzhou City, Zhejiang Province Applicant after: Hangzhou Guangyun Technology Co.,Ltd. Address before: Room 15a08, no.6, financial Third Street, Wuxi Economic Development Zone, Jiangsu Province Applicant before: Smart point (Wuxi) Technology Co.,Ltd. |
|
TA01 | Transfer of patent application right | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
TR01 | Transfer of patent right |
Effective date of registration: 20210608 Address after: Room 1207, 12 / F, main building, Hengxin building, 588 Jiangnan Avenue, Changhe street, Binjiang District, Hangzhou City, Zhejiang Province 310051 Patentee after: Hangzhou kuaixiaozhi Technology Co.,Ltd. Address before: 310051 15 / F, main building, Hengxin building, 588 Jiangnan Avenue, Binjiang District, Hangzhou City, Zhejiang Province Patentee before: Hangzhou Guangyun Technology Co.,Ltd. |
|
TR01 | Transfer of patent right |