CN101512521A - 基于概念对语音文档的跨媒体索引和检索 - Google Patents
基于概念对语音文档的跨媒体索引和检索 Download PDFInfo
- Publication number
- CN101512521A CN101512521A CNA200780020395XA CN200780020395A CN101512521A CN 101512521 A CN101512521 A CN 101512521A CN A200780020395X A CNA200780020395X A CN A200780020395XA CN 200780020395 A CN200780020395 A CN 200780020395A CN 101512521 A CN101512521 A CN 101512521A
- Authority
- CN
- China
- Prior art keywords
- document
- vector
- phoneme
- speech
- documents
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/60—Information retrieval; Database structures therefor; File system structures therefor of audio data
- G06F16/68—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
- G06F16/683—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
- G06F16/685—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content using automatically derived transcript of audio data, e.g. lyrics
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/33—Querying
- G06F16/3331—Query processing
- G06F16/334—Query execution
- G06F16/3343—Query execution using phonetics
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- Databases & Information Systems (AREA)
- General Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Data Mining & Analysis (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Health & Medical Sciences (AREA)
- Multimedia (AREA)
- Library & Information Science (AREA)
- Computational Linguistics (AREA)
- Acoustics & Sound (AREA)
- Artificial Intelligence (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
Description
Claims (13)
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US81078606P | 2006-06-02 | 2006-06-02 | |
US60/810,786 | 2006-06-02 | ||
PCT/US2007/012965 WO2007143109A2 (en) | 2006-06-02 | 2007-06-01 | Concept based cross media indexing and retrieval of speech documents |
Publications (2)
Publication Number | Publication Date |
---|---|
CN101512521A true CN101512521A (zh) | 2009-08-19 |
CN101512521B CN101512521B (zh) | 2013-01-16 |
Family
ID=38802089
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN200780020395XA Active CN101512521B (zh) | 2006-06-02 | 2007-06-01 | 基于概念对语音文档的跨媒体索引和检索 |
Country Status (6)
Country | Link |
---|---|
US (1) | US7716221B2 (zh) |
EP (1) | EP2030132A4 (zh) |
JP (1) | JP2009540398A (zh) |
CN (1) | CN101512521B (zh) |
CA (1) | CA2653932C (zh) |
WO (1) | WO2007143109A2 (zh) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106663429A (zh) * | 2014-03-10 | 2017-05-10 | 韦利通公司 | 提供音频录音以供内容资源中使用的引擎、系统和方法 |
Families Citing this family (27)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8442197B1 (en) | 2006-03-30 | 2013-05-14 | Avaya Inc. | Telephone-based user interface for participating simultaneously in more than one teleconference |
KR100893629B1 (ko) * | 2007-02-12 | 2009-04-20 | 주식회사 이지씨앤씨 | 전자교재 컨텐츠의 구문에 식별코드를 부여하는 시스템 및방법, 전자교재 컨텐츠의 데이터 검색 시스템 및 방법,전자교재 컨텐츠의 사용과 제공에 관한 포인트 관리 시스템및 방법 |
US7923790B1 (en) * | 2007-03-09 | 2011-04-12 | Silicon Laboratories Inc. | Planar microshells for vacuum encapsulated devices and damascene method of manufacture |
EP2245555A1 (fr) * | 2008-01-30 | 2010-11-03 | France Telecom | Procede d'identification d'un document multimedia dans une base de reference, programme d'ordinateur, et dispositif d'identification correspondants |
US8229729B2 (en) * | 2008-03-25 | 2012-07-24 | International Business Machines Corporation | Machine translation in continuous space |
US8301619B2 (en) * | 2009-02-18 | 2012-10-30 | Avaya Inc. | System and method for generating queries |
JP4735726B2 (ja) * | 2009-02-18 | 2011-07-27 | ソニー株式会社 | 情報処理装置および方法、並びにプログラム |
US8621011B2 (en) | 2009-05-12 | 2013-12-31 | Avaya Inc. | Treatment of web feeds as work assignment in a contact center |
US8437266B2 (en) * | 2009-08-26 | 2013-05-07 | Avaya Inc. | Flow through call control |
US8756215B2 (en) * | 2009-12-02 | 2014-06-17 | International Business Machines Corporation | Indexing documents |
US8903794B2 (en) | 2010-02-05 | 2014-12-02 | Microsoft Corporation | Generating and presenting lateral concepts |
US8983989B2 (en) | 2010-02-05 | 2015-03-17 | Microsoft Technology Licensing, Llc | Contextual queries |
US8260664B2 (en) | 2010-02-05 | 2012-09-04 | Microsoft Corporation | Semantic advertising selection from lateral concepts and topics |
US8150859B2 (en) | 2010-02-05 | 2012-04-03 | Microsoft Corporation | Semantic table of contents for search results |
CN101984424A (zh) * | 2010-10-26 | 2011-03-09 | 浙江工商大学 | 海量跨媒体索引方法 |
KR101252397B1 (ko) * | 2011-06-02 | 2013-04-08 | 포항공과대학교 산학협력단 | 웹을 이용한 정보 검색 방법 및 이를 사용하는 음성 대화 방법 |
GB2494455B (en) * | 2011-09-09 | 2015-06-10 | British Broadcasting Corp | Processing audio-video data to produce metadata |
GB2505400B (en) * | 2012-07-18 | 2015-01-07 | Toshiba Res Europ Ltd | A speech processing system |
US9594542B2 (en) * | 2013-06-20 | 2017-03-14 | Viv Labs, Inc. | Dynamically evolving cognitive architecture system based on training by third-party developers |
US9633317B2 (en) * | 2013-06-20 | 2017-04-25 | Viv Labs, Inc. | Dynamically evolving cognitive architecture system based on a natural language intent interpreter |
US10083009B2 (en) | 2013-06-20 | 2018-09-25 | Viv Labs, Inc. | Dynamically evolving cognitive architecture system planning |
US10474961B2 (en) | 2013-06-20 | 2019-11-12 | Viv Labs, Inc. | Dynamically evolving cognitive architecture system based on prompting for additional user input |
KR102437689B1 (ko) * | 2015-09-16 | 2022-08-30 | 삼성전자주식회사 | 음성 인식 서버 및 그 제어 방법 |
CN106095893B (zh) * | 2016-06-06 | 2018-11-20 | 北京大学深圳研究生院 | 一种跨媒体检索方法 |
CN107273517B (zh) * | 2017-06-21 | 2021-07-23 | 复旦大学 | 基于图嵌入学习的图文跨模态检索方法 |
CN111339261A (zh) * | 2020-03-17 | 2020-06-26 | 北京香侬慧语科技有限责任公司 | 一种基于预训练模型的文档抽取方法及系统 |
CN113239237B (zh) * | 2021-07-13 | 2021-11-30 | 北京邮电大学 | 跨媒体大数据搜索方法及装置 |
Family Cites Families (19)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4839853A (en) | 1988-09-15 | 1989-06-13 | Bell Communications Research, Inc. | Computer information retrieval using latent semantic structure |
JPH02195400A (ja) * | 1989-01-24 | 1990-08-01 | Canon Inc | 音声認識装置 |
US5301109A (en) * | 1990-06-11 | 1994-04-05 | Bell Communications Research, Inc. | Computerized cross-language document retrieval using latent semantic indexing |
US5941944A (en) * | 1997-03-03 | 1999-08-24 | Microsoft Corporation | Method for providing a substitute for a requested inaccessible object by identifying substantially similar objects using weights corresponding to object features |
US5974412A (en) | 1997-09-24 | 1999-10-26 | Sapient Health Network | Intelligent query system for automatically indexing information in a database and automatically categorizing users |
CA2366057C (en) * | 1999-03-05 | 2009-03-24 | Canon Kabushiki Kaisha | Database annotation and retrieval |
US7151942B1 (en) * | 1999-05-04 | 2006-12-19 | Mci, Llc | Advertisement broadcasting for paging |
US6757646B2 (en) * | 2000-03-22 | 2004-06-29 | Insightful Corporation | Extended functionality for an inverse inference engine based web search |
US6615208B1 (en) | 2000-09-01 | 2003-09-02 | Telcordia Technologies, Inc. | Automatic recommendation of products using latent semantic indexing of content |
US7006969B2 (en) * | 2000-11-02 | 2006-02-28 | At&T Corp. | System and method of pattern recognition in very high-dimensional space |
US7113943B2 (en) * | 2000-12-06 | 2006-09-26 | Content Analyst Company, Llc | Method for document comparison and selection |
US7124081B1 (en) * | 2001-09-28 | 2006-10-17 | Apple Computer, Inc. | Method and apparatus for speech recognition using latent semantic adaptation |
US6985861B2 (en) * | 2001-12-12 | 2006-01-10 | Hewlett-Packard Development Company, L.P. | Systems and methods for combining subword recognition and whole word recognition of a spoken input |
US6877001B2 (en) * | 2002-04-25 | 2005-04-05 | Mitsubishi Electric Research Laboratories, Inc. | Method and system for retrieving documents with spoken queries |
US7542966B2 (en) * | 2002-04-25 | 2009-06-02 | Mitsubishi Electric Research Laboratories, Inc. | Method and system for retrieving documents with spoken queries |
US7152065B2 (en) | 2003-05-01 | 2006-12-19 | Telcordia Technologies, Inc. | Information retrieval and text mining using distributed latent semantic indexing |
JP2006048289A (ja) * | 2004-08-03 | 2006-02-16 | Sony Corp | 情報処理装置および方法、並びにプログラム |
US7765098B2 (en) * | 2005-04-26 | 2010-07-27 | Content Analyst Company, Llc | Machine translation using vector space representations |
US7475063B2 (en) * | 2006-04-19 | 2009-01-06 | Google Inc. | Augmenting queries with synonyms selected using language statistics |
-
2007
- 2007-06-01 EP EP07777361A patent/EP2030132A4/en not_active Ceased
- 2007-06-01 JP JP2009513300A patent/JP2009540398A/ja active Pending
- 2007-06-01 CA CA2653932A patent/CA2653932C/en active Active
- 2007-06-01 US US11/809,455 patent/US7716221B2/en active Active
- 2007-06-01 CN CN200780020395XA patent/CN101512521B/zh active Active
- 2007-06-01 WO PCT/US2007/012965 patent/WO2007143109A2/en active Application Filing
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106663429A (zh) * | 2014-03-10 | 2017-05-10 | 韦利通公司 | 提供音频录音以供内容资源中使用的引擎、系统和方法 |
Also Published As
Publication number | Publication date |
---|---|
CN101512521B (zh) | 2013-01-16 |
US20070299838A1 (en) | 2007-12-27 |
CA2653932C (en) | 2013-03-19 |
EP2030132A4 (en) | 2010-07-14 |
WO2007143109A3 (en) | 2009-05-07 |
CA2653932A1 (en) | 2007-12-13 |
JP2009540398A (ja) | 2009-11-19 |
US7716221B2 (en) | 2010-05-11 |
WO2007143109A2 (en) | 2007-12-13 |
EP2030132A2 (en) | 2009-03-04 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN101512521B (zh) | 基于概念对语音文档的跨媒体索引和检索 | |
Larson et al. | Spoken content retrieval: A survey of techniques and technologies | |
US7292979B2 (en) | Time ordered indexing of audio data | |
US8126897B2 (en) | Unified inverted index for video passage retrieval | |
US20140136197A1 (en) | Accuracy improvement of spoken queries transcription using co-occurrence information | |
US20120131060A1 (en) | Systems and methods performing semantic analysis to facilitate audio information searches | |
JP2011529600A (ja) | 意味ベクトルおよびキーワード解析を使用することによるデータセットを関係付けるための方法および装置 | |
Clavel et al. | Spontaneous speech and opinion detection: mining call-centre transcripts | |
JP2007241888A (ja) | 情報処理装置および方法、並びにプログラム | |
Ogata et al. | Automatic transcription for a web 2.0 service to search podcasts | |
Koumpis et al. | Content-based access to spoken audio | |
Carrive et al. | Transdisciplinary analysis of a corpus of French newsreels: The ANTRACT Project | |
Vaiani et al. | Leveraging multimodal content for podcast summarization | |
Fersini et al. | Semantics and machine learning: A new generation of court management systems | |
Chen et al. | Exploring the use of unsupervised query modeling techniques for speech recognition and summarization | |
Hazen et al. | Speech-based annotation and retrieval of digital photographs. | |
Jong et al. | Access to recorded interviews: A research agenda | |
Sen et al. | Audio indexing | |
Naveen et al. | Abstractive text summarizer: A comparative study on dot product attention and cosine similarity | |
Nouza et al. | Large-scale processing, indexing and search system for Czech audio-visual cultural heritage archives | |
Gilbert et al. | Speech and language processing over the web | |
Tiwari et al. | Marathi speech database standardization: A review and work | |
Mizuno et al. | A similar content retrieval method for podcast episodes | |
Chien et al. | A spoken‐access approach for chinese text and speech information retrieval | |
Koržinek et al. | Automatic transcription of the polish newsreel |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
ASS | Succession or assignment of patent right |
Owner name: TELCORDIA TECH INC. Free format text: FORMER OWNER: TURLEKODIYA TECHNOLOGY CO., LTD. Effective date: 20100813 |
|
C41 | Transfer of patent application or patent right or utility model | ||
TA01 | Transfer of patent application right |
Effective date of registration: 20100813 Address after: new jersey Applicant after: Telcordia Tech Inc. Address before: new jersey Applicant before: Telcordia Technologies, Inc. |
|
ASS | Succession or assignment of patent right |
Owner name: TTI INVENTION A LLC Free format text: FORMER OWNER: TELCORDIA TECH INC. Effective date: 20120515 |
|
C41 | Transfer of patent application or patent right or utility model | ||
TA01 | Transfer of patent application right |
Effective date of registration: 20120515 Address after: Delaware Applicant after: Telcordia Tech Inc. (US) Address before: new jersey Applicant before: Telcordia Tech Inc. |
|
C14 | Grant of patent or utility model | ||
GR01 | Patent grant |