WO2002077898A2 - Method and apparatus for identifying sounds - Google Patents

Method and apparatus for identifying sounds Download PDF

Info

Publication number
WO2002077898A2
WO2002077898A2 PCT/FI2002/000239 FI0200239W WO02077898A2 WO 2002077898 A2 WO2002077898 A2 WO 2002077898A2 FI 0200239 W FI0200239 W FI 0200239W WO 02077898 A2 WO02077898 A2 WO 02077898A2
Authority
WO
WIPO (PCT)
Prior art keywords
sound
sounds
computer
archive
sound sample
Prior art date
Application number
PCT/FI2002/000239
Other languages
English (en)
French (fr)
Other versions
WO2002077898A3 (en
Inventor
Jaakko Larjomaa
Original Assignee
Jaakko Larjomaa
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Jaakko Larjomaa filed Critical Jaakko Larjomaa
Priority to AU2002242763A priority Critical patent/AU2002242763A1/en
Publication of WO2002077898A2 publication Critical patent/WO2002077898A2/en
Publication of WO2002077898A3 publication Critical patent/WO2002077898A3/en

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/28Constructional details of speech recognition systems
    • G10L15/30Distributed recognition, e.g. in client-server systems, for mobile phones or network applications
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification techniques
    • G10L17/26Recognition of special voice characteristics, e.g. for use in lie detectors; Recognition of animal voices

Definitions

  • the present invention relates to a method and apparatus for identifying sounds. Though the principal intention is to identify particularly birds on the basis of their calls, there is nothing to prevent the invention from being used to identify other sounds too.
  • Birdcalls in their various forms for example, calls marking territories, calls searching for a mate, and other typical calls have been recorded in natural conditions and collected, for example, on commercial CD records.
  • the idea is that when a birdwatcher hears an unknown birdcall, it will be possible to listen to the CD, to identify the call heard as being that of a specific bird.
  • the present invention is intended to create a method and apparatus, by means of which the sounds of birds and possibly also other animals can be identified with a great degree of certainty, as quickly as possible, while also bringing the advantage that one interested in birds no longer needs to rely on fading memories of the nature and type of sounds.
  • Figure 1 shows one embodiment of the invention as a schematic diagram
  • Figure 2 shows another embodiment of the invention, also as a schematic diagram.
  • the device according to Figure 1 comprises a device 2 for capturing sounds.
  • This device is usually a directional microphone, which can be moved to search for the birdcall, or similar sound it is wished to monitor. This sound is then listened to using earphones 5, or any other suitable device and, once an apparently suitable sound of suitable quality is heard, it can be forwarded, using a cable 4 or similar and a device 6, to the apparatus 3, in which it is recorded and in which it can be further processed.
  • the microphone apparatus or device 6 can also include a memory, which permits a sound sample to be recorded and to be sent at a later stage. It is then possible to exploit the fact that only once a completely satisfactory sound sample has been obtained is it sent for comparison.
  • the data is sent with the aid of the device 6 to be processed elsewhere, in which case a powerful apparatus 3 located in suitable premises receives the data and carries out the analysis and returns the result of the analysis using the aforesaid transmission/reception channel.
  • the apparatus 3 is naturally a computer-type apparatus and particularly a computer with a large processing capacity, in the memory of which samples of all the birdcalls that may be required are recorded.
  • the device 6 could be, for example, a portable computer, which includes a suitable component for wireless data transfer.
  • a cheaper device 6, which is available to every user is a mobile telephone, which has suitable properties for transmitting sound data wirelessly.
  • the sound captured and recorded in the apparatus 3 is now compared with the sound samples of the sound archive and the result of the comparison is shown on a display device, either the display of a microcomputer or the display of a telephone.
  • Figure 2 shows an alternative, in which, in place of the microcomputer 6 shown in Figure 1 , a mobile telephone is used to send and receive the data.
  • the data can be send and received using any known protocol.
  • the embodiments of Figures 1 and 2 do not differ from each other, because the sending and reception of data to and from the computer 6 will most probably take place using mobile telephone technology.
  • the microphone 2 is connected using a suitable adapter to the mobile telephone 6, from which a connection is opened to the apparatus 3 containing the sound archive.
  • the sound sample is sent to the apparatus 3, which analyses the sample and returns the identification data to the mobile telephone, either during the same connection, or, for instance, as a text message, once identification has been carried out.
  • a suitable price is calculated for the aforesaid services, which is charged, for example, directly in the mobile telephone bill.
  • the reception of image data is also entirely possible and indeed sensible.
  • the server 3 not only sends information as to what bird is in question, but also sends one or several pictures of the said songster.
  • it is also possible to send other information concerning the bird such as information on its living environment, habits and distribution, etc.
  • it is also possible to send an image to a mobile telephone though the displays of present mobile telephones generally do not provide a sufficiently clear image to permit the identification of a bird.
  • the system according to the invention can, as such, be used not only for direct identification, but also as an archive, from which it is possible to retrieve, for example, a sound sample of a bird and possibly a picture, without having to enter a sound sample in it. In that case, only a simple query according to a certain protocol need be made.

Landscapes

  • Engineering & Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Computational Linguistics (AREA)
  • Investigating Or Analyzing Materials By The Use Of Ultrasonic Waves (AREA)
  • Measurement Of Mechanical Vibrations Or Ultrasonic Waves (AREA)
PCT/FI2002/000239 2001-03-27 2002-03-21 Method and apparatus for identifying sounds WO2002077898A2 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
AU2002242763A AU2002242763A1 (en) 2001-03-27 2002-03-21 Method and apparatus for identifying sounds

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
FI20010632 2001-03-27
FI20010632A FI20010632A (fi) 2001-03-27 2001-03-27 Menetelmä ja laitteisto äänien tunnistamiseksi

Publications (2)

Publication Number Publication Date
WO2002077898A2 true WO2002077898A2 (en) 2002-10-03
WO2002077898A3 WO2002077898A3 (en) 2003-06-26

Family

ID=8560854

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/FI2002/000239 WO2002077898A2 (en) 2001-03-27 2002-03-21 Method and apparatus for identifying sounds

Country Status (3)

Country Link
AU (1) AU2002242763A1 (fi)
FI (1) FI20010632A (fi)
WO (1) WO2002077898A2 (fi)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB2422044A (en) * 2005-01-11 2006-07-12 Pariff Llc Identifying bird vocalisation by hierarchical analysis of family and species
WO2009155348A1 (en) 2008-06-17 2009-12-23 Pandion Systems, Inc. System and method for detecting bats and their impact on wind facilities
US10832672B2 (en) 2018-07-13 2020-11-10 International Business Machines Corporation Smart speaker system with cognitive sound analysis and response
US10832673B2 (en) 2018-07-13 2020-11-10 International Business Machines Corporation Smart speaker device with cognitive sound analysis and response

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CA2089597A1 (en) * 1993-02-16 1994-08-17 Douglas G. Bain Apparatus for audio identification of a bird
EP0813186A2 (en) * 1996-06-14 1997-12-17 Masaomi Yamamoto Animal's intention translational method
US5956463A (en) * 1993-06-15 1999-09-21 Ontario Hydro Audio monitoring system for assessing wildlife biodiversity
WO2002006922A2 (en) * 2000-07-19 2002-01-24 Identity Concepts, Llc Method and apparatus for identifying a subject
US20020116195A1 (en) * 2000-11-03 2002-08-22 International Business Machines Corporation System for selling a product utilizing audio content identification

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CA2089597A1 (en) * 1993-02-16 1994-08-17 Douglas G. Bain Apparatus for audio identification of a bird
US5956463A (en) * 1993-06-15 1999-09-21 Ontario Hydro Audio monitoring system for assessing wildlife biodiversity
EP0813186A2 (en) * 1996-06-14 1997-12-17 Masaomi Yamamoto Animal's intention translational method
WO2002006922A2 (en) * 2000-07-19 2002-01-24 Identity Concepts, Llc Method and apparatus for identifying a subject
US20020116195A1 (en) * 2000-11-03 2002-08-22 International Business Machines Corporation System for selling a product utilizing audio content identification

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
BRIGHT L ET AL: "Efficient remote data access in a mobile computing environment" PROCEEDINGS 2000. INTERNATIONAL WORKSHOP ON PARALLEL PROCESSING, PROCEEDINGS 2000. INTERNATIONAL WORKSHOP ON PARALLEL PROCESSING, TORONTO, ONT., CANADA, 21-24 AUG. 2000, pages 57-64, XP002222483 2000, Los Alamitos, CA, USA, IEEE Comput. Soc, USA ISBN: 0-7695-0771-9 *
DATABASE INSPEC [Online] INSTITUTE OF ELECTRICAL ENGINEERS, STEVENAGE, GB; BAINBRIDGE D ET AL: "Towards a digital library of popular music" Database accession no. 7048293 XP002222485 & DIGITAL 99 LIBRARIES. FOURTH ACM CONFERENCE ON DIGITAL LIBRARIES, PROCEEDINGS OF 1999 CONFERENCE ON DIGITAL LIBRARIES, BERKLEY, CA, USA, 11-14 AUG. 1999, pages 161-169, 1999, New York, NY, USA, ACM, USA ISBN: 1-58113-145-3 *
MCILRAITH A L ET AL: "Bird song identification using artificial neural networks and statistical analysis" CCECE '97. CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING. ENGINEERING INNOVATION: VOYAGE OF DISCOVERY. CONFERENCE PROCEEDINGS (CAT. NO.97TTH8244), CCECE '97. CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING. ENGINEERING INNOVAT, pages 63-66 vol.1, XP002222484 1997, New York, NY, USA, IEEE, USA ISBN: 0-7803-3716-6 *

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB2422044A (en) * 2005-01-11 2006-07-12 Pariff Llc Identifying bird vocalisation by hierarchical analysis of family and species
US7377233B2 (en) 2005-01-11 2008-05-27 Pariff Llc Method and apparatus for the automatic identification of birds by their vocalizations
US7963254B2 (en) 2005-01-11 2011-06-21 Pariff Llc Method and apparatus for the automatic identification of birds by their vocalizations
WO2009155348A1 (en) 2008-06-17 2009-12-23 Pandion Systems, Inc. System and method for detecting bats and their impact on wind facilities
EP2316006A4 (en) * 2008-06-17 2017-03-08 Normandeau Associates, Inc. System and method for detecting bats and their impact on wind facilities
US10832672B2 (en) 2018-07-13 2020-11-10 International Business Machines Corporation Smart speaker system with cognitive sound analysis and response
US10832673B2 (en) 2018-07-13 2020-11-10 International Business Machines Corporation Smart speaker device with cognitive sound analysis and response
US11631407B2 (en) 2018-07-13 2023-04-18 International Business Machines Corporation Smart speaker system with cognitive sound analysis and response

Also Published As

Publication number Publication date
FI20010632A0 (fi) 2001-03-27
AU2002242763A1 (en) 2002-10-08
FI20010632A (fi) 2002-09-28
WO2002077898A3 (en) 2003-06-26

Similar Documents

Publication Publication Date Title
US6404860B1 (en) System and method for internet call management with text-to-speech messaging
KR101954550B1 (ko) 음량조절 방법, 시스템, 디바이스 및 컴퓨터 저장매체
US9721287B2 (en) Method and system for interacting with a user in an experimental environment
CN102779179B (zh) 一种信息关联的方法及终端
US7920158B1 (en) Individual participant identification in shared video resources
US20070160365A1 (en) Image capture system, handheld terminal device, and image server
EP1587291A3 (en) Enhanced caller ID information based on access device information via a broadband access gateway
EP0782296A3 (en) Securing transmission and receipt of electronic data
US20070266092A1 (en) Conferencing system with automatic identification of speaker
JP2008113418A (ja) データを中央でストリングする方法
EP1139663A3 (en) Communication method, communication service apparatus, communication terminal device and communication system
CN111416758A (zh) 智慧家居实时对讲系统及方法
CN105389318B (zh) 一种信息处理方法及电子设备
CN111311774A (zh) 基于语音识别的签到方法及系统
WO2002077898A2 (en) Method and apparatus for identifying sounds
CN114227702A (zh) 一种基于机器人的会议智能指引方法、装置和机器人
CN110062097A (zh) 骚扰电话处理方法、装置、移动终端以及存储介质
US20050239511A1 (en) Speaker identification using a mobile communications device
US8514762B2 (en) System and method for embedding text in multicast transmissions
CN105407409A (zh) 进行远程即时通讯的方法及系统
CN104917995A (zh) 离线视频通讯的实现方法及装置
US20050068183A1 (en) Security system and security method
JP2004221736A (ja) ドアホン装置
CN108766486B (zh) 一种控制方法、装置及电子设备
CN108694388A (zh) 基于智能摄像头的校园监控方法及设备

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A2

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ OM PH PL PT RO RU SD SE SG SI SK SL TJ TM TN TR TT TZ UA UG US UZ VN YU ZA ZM ZW

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
REG Reference to national code

Ref country code: DE

Ref legal event code: 8642

122 Ep: pct application non-entry in european phase
NENP Non-entry into the national phase in:

Ref country code: JP

WWW Wipo information: withdrawn in national office

Country of ref document: JP