JP7188589B2 - 復元装置、復元方法、およびプログラム - Google Patents

復元装置、復元方法、およびプログラム Download PDF

Info

Publication number
JP7188589B2
JP7188589B2 JP2021528089A JP2021528089A JP7188589B2 JP 7188589 B2 JP7188589 B2 JP 7188589B2 JP 2021528089 A JP2021528089 A JP 2021528089A JP 2021528089 A JP2021528089 A JP 2021528089A JP 7188589 B2 JP7188589 B2 JP 7188589B2
Authority
JP
Japan
Prior art keywords
signal
clipped
restoration
neural network
clipped signal
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
JP2021528089A
Other languages
English (en)
Japanese (ja)
Other versions
JPWO2020255242A1 (https=
Inventor
暁 江村
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
NTT Inc
NTT Inc USA
Original Assignee
Nippon Telegraph and Telephone Corp
NTT Inc USA
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nippon Telegraph and Telephone Corp, NTT Inc USA filed Critical Nippon Telegraph and Telephone Corp
Publication of JPWO2020255242A1 publication Critical patent/JPWO2020255242A1/ja
Application granted granted Critical
Publication of JP7188589B2 publication Critical patent/JP7188589B2/ja
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L21/0216Noise filtering characterised by the method used for estimating noise
    • G10L21/0232Processing in the frequency domain
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/27Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the analysis technique
    • G10L25/30Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the analysis technique using neural networks
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/0464Convolutional networks [CNN, ConvNet]
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/09Supervised learning

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Theoretical Computer Science (AREA)
  • Artificial Intelligence (AREA)
  • Evolutionary Computation (AREA)
  • Signal Processing (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • General Engineering & Computer Science (AREA)
  • Biophysics (AREA)
  • Data Mining & Analysis (AREA)
  • General Health & Medical Sciences (AREA)
  • Molecular Biology (AREA)
  • Computing Systems (AREA)
  • Biomedical Technology (AREA)
  • General Physics & Mathematics (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Quality & Reliability (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Image Processing (AREA)
JP2021528089A 2019-06-18 2019-06-18 復元装置、復元方法、およびプログラム Active JP7188589B2 (ja)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/JP2019/024058 WO2020255242A1 (ja) 2019-06-18 2019-06-18 復元装置、復元方法、およびプログラム

Publications (2)

Publication Number Publication Date
JPWO2020255242A1 JPWO2020255242A1 (https=) 2020-12-24
JP7188589B2 true JP7188589B2 (ja) 2022-12-13

Family

ID=74037011

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2021528089A Active JP7188589B2 (ja) 2019-06-18 2019-06-18 復元装置、復元方法、およびプログラム

Country Status (3)

Country Link
US (1) US20220375489A1 (https=)
JP (1) JP7188589B2 (https=)
WO (1) WO2020255242A1 (https=)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11765011B2 (en) 2021-07-06 2023-09-19 Huawei Technologies Co., Ltd. Method and apparatus for transmitting and receiving data

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2005275410A (ja) 2004-03-23 2005-10-06 Herman Becker Automotive Systems-Wavemakers Inc ニューラルネットワークを利用してスピーチ信号を分離する。
JP2013162347A (ja) 2012-02-06 2013-08-19 Sony Corp 画像処理装置、画像処理方法、プログラム、および装置

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20150032449A1 (en) * 2013-07-26 2015-01-29 Nuance Communications, Inc. Method and Apparatus for Using Convolutional Neural Networks in Speech Recognition
KR102565447B1 (ko) * 2017-07-26 2023-08-08 삼성전자주식회사 청각 인지 속성에 기반하여 디지털 오디오 신호의 이득을 조정하는 전자 장치 및 방법
US10699700B2 (en) * 2018-07-31 2020-06-30 Tencent Technology (Shenzhen) Company Limited Monaural multi-talker speech recognition with attention mechanism and gated convolutional networks
US20190149134A1 (en) * 2019-01-14 2019-05-16 Intel Corporation Filter optimization to improve computational efficiency of convolution operations

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2005275410A (ja) 2004-03-23 2005-10-06 Herman Becker Automotive Systems-Wavemakers Inc ニューラルネットワークを利用してスピーチ信号を分離する。
JP2013162347A (ja) 2012-02-06 2013-08-19 Sony Corp 画像処理装置、画像処理方法、プログラム、および装置

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
IIZUKA,Satoshi et al.,Globally and Locally Consistent Image Completion,[onlone],米国,ACM,2017年07月,pages:1-14,[Retrieved from the Internet]<URL:https://dl.acm.org/doi/pdf/10.1145/3072959.3073659>
YU,Jiahui et al.,Free-Form Image Inpainting with Gated Convolution,[online],米国,arXiv,2018年06月10日,pages:1-12,[retrieved on 2022.8.24],Retrieved from the Internet:<URL:https://arxiv.org/pdf/1806.03589v1.pdf>

Also Published As

Publication number Publication date
WO2020255242A1 (ja) 2020-12-24
JPWO2020255242A1 (https=) 2020-12-24
US20220375489A1 (en) 2022-11-24

Similar Documents

Publication Publication Date Title
US20240161251A1 (en) Image denoising method and apparatus based on wavelet high-frequency channel synthesis
Chen et al. Graph unrolling networks: Interpretable neural networks for graph signal denoising
JP7007488B2 (ja) ハードウェアベースのプーリングのシステムおよび方法
EP3340129B1 (en) Artificial neural network class-based pruning
Chen et al. Signal recovery on graphs: Variation minimization
Wei et al. Deep unfolding with normalizing flow priors for inverse problems
WO2021101864A1 (en) Forecasting time-series data in a network environment
JP2020532777A (ja) ディープニューラルネットワークの実行方法、実行装置、学習方法、学習装置及びプログラム
CN113255437A (zh) 滚动轴承深度卷积稀疏自动编码器故障诊断方法
Maus et al. Evaluating Lyapunov exponent spectra with neural networks
JP2019079305A (ja) データ分析装置、データ分析方法、およびデータ分析プログラム
WO2020003434A1 (ja) 機械学習方法、機械学習装置、及び機械学習プログラム
EP3637327A1 (en) Computing device and method
Ulfarsson et al. Sparse variable PCA using geodesic steepest descent
KR20210043295A (ko) 뉴럴 네트워크의 데이터를 양자화하는 방법 및 장치
US20210004681A1 (en) Data processing apparatus, training apparatus, method of detecting an object, method of training, and medium
CN111783938A (zh) 时间序列的预测方法和装置
CN111461862B (zh) 为业务数据确定目标特征的方法及装置
KR102885647B1 (ko) 어텐션 기반 잠재 영역에서 음성 향상 기술을 결합한 음성인식 시스템
JP7188589B2 (ja) 復元装置、復元方法、およびプログラム
CN113496228A (zh) 一种基于Res2Net、TransUNet和协同注意力的人体语义分割方法
JP7047665B2 (ja) 学習装置、学習方法及び学習プログラム
KR20210038027A (ko) 신경망 압축 훈련 방법 및 압축된 신경망을 이용하는 방법
CN119206098B (zh) 一种基于clip的性质到微结构逆向生成方法
CN112749845A (zh) 模型训练方法、资源数据预测方法、装置和计算设备

Legal Events

Date Code Title Description
A621 Written request for application examination

Free format text: JAPANESE INTERMEDIATE CODE: A621

Effective date: 20211011

A131 Notification of reasons for refusal

Free format text: JAPANESE INTERMEDIATE CODE: A131

Effective date: 20220830

A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20220929

TRDD Decision of grant or rejection written
A01 Written decision to grant a patent or to grant a registration (utility model)

Free format text: JAPANESE INTERMEDIATE CODE: A01

Effective date: 20221101

A61 First payment of annual fees (during grant procedure)

Free format text: JAPANESE INTERMEDIATE CODE: A61

Effective date: 20221114

R150 Certificate of patent or registration of utility model

Ref document number: 7188589

Country of ref document: JP

Free format text: JAPANESE INTERMEDIATE CODE: R150

S533 Written request for registration of change of name

Free format text: JAPANESE INTERMEDIATE CODE: R313533

R350 Written notification of registration of transfer

Free format text: JAPANESE INTERMEDIATE CODE: R350