WO2002077898A2 - Method and apparatus for identifying sounds - Google Patents
Method and apparatus for identifying sounds Download PDFInfo
- Publication number
- WO2002077898A2 WO2002077898A2 PCT/FI2002/000239 FI0200239W WO02077898A2 WO 2002077898 A2 WO2002077898 A2 WO 2002077898A2 FI 0200239 W FI0200239 W FI 0200239W WO 02077898 A2 WO02077898 A2 WO 02077898A2
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- sound
- sounds
- computer
- archive
- sound sample
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims abstract description 14
- 230000001755 vocal effect Effects 0.000 claims 1
- 230000015654 memory Effects 0.000 description 4
- 238000004458 analytical method Methods 0.000 description 3
- 230000008901 benefit Effects 0.000 description 3
- 238000010586 diagram Methods 0.000 description 3
- 241000894007 species Species 0.000 description 3
- 230000005540 biological transmission Effects 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 206010011878 Deafness Diseases 0.000 description 1
- 241001465754 Metazoa Species 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 238000005562 fading Methods 0.000 description 1
- 230000005923 long-lasting effect Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/28—Constructional details of speech recognition systems
- G10L15/30—Distributed recognition, e.g. in client-server systems, for mobile phones or network applications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification techniques
- G10L17/26—Recognition of special voice characteristics, e.g. for use in lie detectors; Recognition of animal voices
Definitions
- the present invention relates to a method and apparatus for identifying sounds. Though the principal intention is to identify particularly birds on the basis of their calls, there is nothing to prevent the invention from being used to identify other sounds too.
- Birdcalls in their various forms for example, calls marking territories, calls searching for a mate, and other typical calls have been recorded in natural conditions and collected, for example, on commercial CD records.
- the idea is that when a birdwatcher hears an unknown birdcall, it will be possible to listen to the CD, to identify the call heard as being that of a specific bird.
- the present invention is intended to create a method and apparatus, by means of which the sounds of birds and possibly also other animals can be identified with a great degree of certainty, as quickly as possible, while also bringing the advantage that one interested in birds no longer needs to rely on fading memories of the nature and type of sounds.
- Figure 1 shows one embodiment of the invention as a schematic diagram
- Figure 2 shows another embodiment of the invention, also as a schematic diagram.
- the device according to Figure 1 comprises a device 2 for capturing sounds.
- This device is usually a directional microphone, which can be moved to search for the birdcall, or similar sound it is wished to monitor. This sound is then listened to using earphones 5, or any other suitable device and, once an apparently suitable sound of suitable quality is heard, it can be forwarded, using a cable 4 or similar and a device 6, to the apparatus 3, in which it is recorded and in which it can be further processed.
- the microphone apparatus or device 6 can also include a memory, which permits a sound sample to be recorded and to be sent at a later stage. It is then possible to exploit the fact that only once a completely satisfactory sound sample has been obtained is it sent for comparison.
- the data is sent with the aid of the device 6 to be processed elsewhere, in which case a powerful apparatus 3 located in suitable premises receives the data and carries out the analysis and returns the result of the analysis using the aforesaid transmission/reception channel.
- the apparatus 3 is naturally a computer-type apparatus and particularly a computer with a large processing capacity, in the memory of which samples of all the birdcalls that may be required are recorded.
- the device 6 could be, for example, a portable computer, which includes a suitable component for wireless data transfer.
- a cheaper device 6, which is available to every user is a mobile telephone, which has suitable properties for transmitting sound data wirelessly.
- the sound captured and recorded in the apparatus 3 is now compared with the sound samples of the sound archive and the result of the comparison is shown on a display device, either the display of a microcomputer or the display of a telephone.
- Figure 2 shows an alternative, in which, in place of the microcomputer 6 shown in Figure 1 , a mobile telephone is used to send and receive the data.
- the data can be send and received using any known protocol.
- the embodiments of Figures 1 and 2 do not differ from each other, because the sending and reception of data to and from the computer 6 will most probably take place using mobile telephone technology.
- the microphone 2 is connected using a suitable adapter to the mobile telephone 6, from which a connection is opened to the apparatus 3 containing the sound archive.
- the sound sample is sent to the apparatus 3, which analyses the sample and returns the identification data to the mobile telephone, either during the same connection, or, for instance, as a text message, once identification has been carried out.
- a suitable price is calculated for the aforesaid services, which is charged, for example, directly in the mobile telephone bill.
- the reception of image data is also entirely possible and indeed sensible.
- the server 3 not only sends information as to what bird is in question, but also sends one or several pictures of the said songster.
- it is also possible to send other information concerning the bird such as information on its living environment, habits and distribution, etc.
- it is also possible to send an image to a mobile telephone though the displays of present mobile telephones generally do not provide a sufficiently clear image to permit the identification of a bird.
- the system according to the invention can, as such, be used not only for direct identification, but also as an archive, from which it is possible to retrieve, for example, a sound sample of a bird and possibly a picture, without having to enter a sound sample in it. In that case, only a simple query according to a certain protocol need be made.
Landscapes
- Engineering & Computer Science (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Computational Linguistics (AREA)
- Investigating Or Analyzing Materials By The Use Of Ultrasonic Waves (AREA)
- Measurement Of Mechanical Vibrations Or Ultrasonic Waves (AREA)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
AU2002242763A AU2002242763A1 (en) | 2001-03-27 | 2002-03-21 | Method and apparatus for identifying sounds |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
FI20010632 | 2001-03-27 | ||
FI20010632A FI20010632A (fi) | 2001-03-27 | 2001-03-27 | Menetelmä ja laitteisto äänien tunnistamiseksi |
Publications (2)
Publication Number | Publication Date |
---|---|
WO2002077898A2 true WO2002077898A2 (en) | 2002-10-03 |
WO2002077898A3 WO2002077898A3 (en) | 2003-06-26 |
Family
ID=8560854
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/FI2002/000239 WO2002077898A2 (en) | 2001-03-27 | 2002-03-21 | Method and apparatus for identifying sounds |
Country Status (3)
Country | Link |
---|---|
AU (1) | AU2002242763A1 (fi) |
FI (1) | FI20010632A (fi) |
WO (1) | WO2002077898A2 (fi) |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
GB2422044A (en) * | 2005-01-11 | 2006-07-12 | Pariff Llc | Identifying bird vocalisation by hierarchical analysis of family and species |
WO2009155348A1 (en) | 2008-06-17 | 2009-12-23 | Pandion Systems, Inc. | System and method for detecting bats and their impact on wind facilities |
US10832672B2 (en) | 2018-07-13 | 2020-11-10 | International Business Machines Corporation | Smart speaker system with cognitive sound analysis and response |
US10832673B2 (en) | 2018-07-13 | 2020-11-10 | International Business Machines Corporation | Smart speaker device with cognitive sound analysis and response |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CA2089597A1 (en) * | 1993-02-16 | 1994-08-17 | Douglas G. Bain | Apparatus for audio identification of a bird |
EP0813186A2 (en) * | 1996-06-14 | 1997-12-17 | Masaomi Yamamoto | Animal's intention translational method |
US5956463A (en) * | 1993-06-15 | 1999-09-21 | Ontario Hydro | Audio monitoring system for assessing wildlife biodiversity |
WO2002006922A2 (en) * | 2000-07-19 | 2002-01-24 | Identity Concepts, Llc | Method and apparatus for identifying a subject |
US20020116195A1 (en) * | 2000-11-03 | 2002-08-22 | International Business Machines Corporation | System for selling a product utilizing audio content identification |
-
2001
- 2001-03-27 FI FI20010632A patent/FI20010632A/fi unknown
-
2002
- 2002-03-21 AU AU2002242763A patent/AU2002242763A1/en not_active Abandoned
- 2002-03-21 WO PCT/FI2002/000239 patent/WO2002077898A2/en not_active Application Discontinuation
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CA2089597A1 (en) * | 1993-02-16 | 1994-08-17 | Douglas G. Bain | Apparatus for audio identification of a bird |
US5956463A (en) * | 1993-06-15 | 1999-09-21 | Ontario Hydro | Audio monitoring system for assessing wildlife biodiversity |
EP0813186A2 (en) * | 1996-06-14 | 1997-12-17 | Masaomi Yamamoto | Animal's intention translational method |
WO2002006922A2 (en) * | 2000-07-19 | 2002-01-24 | Identity Concepts, Llc | Method and apparatus for identifying a subject |
US20020116195A1 (en) * | 2000-11-03 | 2002-08-22 | International Business Machines Corporation | System for selling a product utilizing audio content identification |
Non-Patent Citations (3)
Title |
---|
BRIGHT L ET AL: "Efficient remote data access in a mobile computing environment" PROCEEDINGS 2000. INTERNATIONAL WORKSHOP ON PARALLEL PROCESSING, PROCEEDINGS 2000. INTERNATIONAL WORKSHOP ON PARALLEL PROCESSING, TORONTO, ONT., CANADA, 21-24 AUG. 2000, pages 57-64, XP002222483 2000, Los Alamitos, CA, USA, IEEE Comput. Soc, USA ISBN: 0-7695-0771-9 * |
DATABASE INSPEC [Online] INSTITUTE OF ELECTRICAL ENGINEERS, STEVENAGE, GB; BAINBRIDGE D ET AL: "Towards a digital library of popular music" Database accession no. 7048293 XP002222485 & DIGITAL 99 LIBRARIES. FOURTH ACM CONFERENCE ON DIGITAL LIBRARIES, PROCEEDINGS OF 1999 CONFERENCE ON DIGITAL LIBRARIES, BERKLEY, CA, USA, 11-14 AUG. 1999, pages 161-169, 1999, New York, NY, USA, ACM, USA ISBN: 1-58113-145-3 * |
MCILRAITH A L ET AL: "Bird song identification using artificial neural networks and statistical analysis" CCECE '97. CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING. ENGINEERING INNOVATION: VOYAGE OF DISCOVERY. CONFERENCE PROCEEDINGS (CAT. NO.97TTH8244), CCECE '97. CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING. ENGINEERING INNOVAT, pages 63-66 vol.1, XP002222484 1997, New York, NY, USA, IEEE, USA ISBN: 0-7803-3716-6 * |
Cited By (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
GB2422044A (en) * | 2005-01-11 | 2006-07-12 | Pariff Llc | Identifying bird vocalisation by hierarchical analysis of family and species |
US7377233B2 (en) | 2005-01-11 | 2008-05-27 | Pariff Llc | Method and apparatus for the automatic identification of birds by their vocalizations |
US7963254B2 (en) | 2005-01-11 | 2011-06-21 | Pariff Llc | Method and apparatus for the automatic identification of birds by their vocalizations |
WO2009155348A1 (en) | 2008-06-17 | 2009-12-23 | Pandion Systems, Inc. | System and method for detecting bats and their impact on wind facilities |
EP2316006A4 (en) * | 2008-06-17 | 2017-03-08 | Normandeau Associates, Inc. | System and method for detecting bats and their impact on wind facilities |
US10832672B2 (en) | 2018-07-13 | 2020-11-10 | International Business Machines Corporation | Smart speaker system with cognitive sound analysis and response |
US10832673B2 (en) | 2018-07-13 | 2020-11-10 | International Business Machines Corporation | Smart speaker device with cognitive sound analysis and response |
US11631407B2 (en) | 2018-07-13 | 2023-04-18 | International Business Machines Corporation | Smart speaker system with cognitive sound analysis and response |
Also Published As
Publication number | Publication date |
---|---|
FI20010632A0 (fi) | 2001-03-27 |
AU2002242763A1 (en) | 2002-10-08 |
FI20010632A (fi) | 2002-09-28 |
WO2002077898A3 (en) | 2003-06-26 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US6404860B1 (en) | System and method for internet call management with text-to-speech messaging | |
KR101954550B1 (ko) | 음량조절 방법, 시스템, 디바이스 및 컴퓨터 저장매체 | |
US9721287B2 (en) | Method and system for interacting with a user in an experimental environment | |
CN102779179B (zh) | 一种信息关联的方法及终端 | |
US7920158B1 (en) | Individual participant identification in shared video resources | |
US20070160365A1 (en) | Image capture system, handheld terminal device, and image server | |
EP1587291A3 (en) | Enhanced caller ID information based on access device information via a broadband access gateway | |
EP0782296A3 (en) | Securing transmission and receipt of electronic data | |
US20070266092A1 (en) | Conferencing system with automatic identification of speaker | |
JP2008113418A (ja) | データを中央でストリングする方法 | |
EP1139663A3 (en) | Communication method, communication service apparatus, communication terminal device and communication system | |
CN111416758A (zh) | 智慧家居实时对讲系统及方法 | |
CN105389318B (zh) | 一种信息处理方法及电子设备 | |
CN111311774A (zh) | 基于语音识别的签到方法及系统 | |
WO2002077898A2 (en) | Method and apparatus for identifying sounds | |
CN114227702A (zh) | 一种基于机器人的会议智能指引方法、装置和机器人 | |
CN110062097A (zh) | 骚扰电话处理方法、装置、移动终端以及存储介质 | |
US20050239511A1 (en) | Speaker identification using a mobile communications device | |
US8514762B2 (en) | System and method for embedding text in multicast transmissions | |
CN105407409A (zh) | 进行远程即时通讯的方法及系统 | |
CN104917995A (zh) | 离线视频通讯的实现方法及装置 | |
US20050068183A1 (en) | Security system and security method | |
JP2004221736A (ja) | ドアホン装置 | |
CN108766486B (zh) | 一种控制方法、装置及电子设备 | |
CN108694388A (zh) | 基于智能摄像头的校园监控方法及设备 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AK | Designated states |
Kind code of ref document: A2 Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ OM PH PL PT RO RU SD SE SG SI SK SL TJ TM TN TR TT TZ UA UG US UZ VN YU ZA ZM ZW |
|
AL | Designated countries for regional patents |
Kind code of ref document: A2 Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
DFPE | Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101) | ||
REG | Reference to national code |
Ref country code: DE Ref legal event code: 8642 |
|
122 | Ep: pct application non-entry in european phase | ||
NENP | Non-entry into the national phase in: |
Ref country code: JP |
|
WWW | Wipo information: withdrawn in national office |
Country of ref document: JP |