IEEE ICASSP 2020 Virtual Conference May 2020

Thu, 16 July, 2020

Showing 401 - 450 of 1951

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Effects Of Spectral Tilt On Listeners' Preferences And Intelligibility

00:12:34

0 views

High intelligibility can be achieved when listening to synthetic or artificially-produced speech under adverse conditions. But can listener preferences reveal any extra information when intelligibility is at ceiling? This paper describes a real-time speec

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

The Picasso Algorithm For Bayesian Localization Via Paired Comparisons In A Union Of Subspaces Model

00:09:54

551 views

We develop a framework for localizing an unknown point $\w$ using paired comparisons of the form ``$\w$ is closer to point $\x_i$ than to $\x_j$'' when the points lie in a union of known subspaces. This model, which extends a broad class of existing metho

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Polyphonic Sound Event Detection Using Transposed Convolutional Recurrent Neural Network

00:17:30

1 view

In this paper we propose a Transposed Convolutional Recurrent Neural Network (TCRNN) architecture for polyphonic sound event recognition. Transposed convolution layer, which caries out a regular convolution operation but reverts the spatial transformation

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Precise Performance Analysis Of The Box-Elastic Net Under Matrix Uncertainties

00:17:19

784 views

In this letter, we consider the problem of recovering an unknown sparse signal from noisy linear measurements, using an enhanced version of the popular Elastic-Net (EN) method.We modify the EN by adding a box-constraint, and we call it the Box-Elastic Net

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Real-Time Epileptic Seizure Detection During Sleep Using Passive Infrared (Pir) Sensors

00:13:17

0 views

According to World Health Organization (WHO), millions of people suffer from epilepsy, which is a chronic disorder of the brain. Sudden Unexplained Death in Epilepsy (SUDEP) is considered as one of the most dangerous threats to the patients who suffer fro

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Self-Tuning Algorithms For Multisensor-Multitarget Tracking Using Belief Propagation

00:14:21

1 view

Situation-aware technologies enabled by multitarget tracking algorithms will create new services and applications in emerging fields such as autonomous navigation and maritime surveillance. The system models underlying multitarget tracking algorithms ofte

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

A Single-Wavelength Real-Time Material-Sensing Camera Based On Time-Of-Flight Measurements

00:13:49

0 views

Time-of-Flight (ToF) cameras provide a fast and robust way of acquiring the 3D shape of real scenes. Dense depth images can be generated at tens of frame per second. 3D shapes can be then segmented and objects classified, but can we directly sense the obj

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Effective Pipeline For Compressing Deep Object Detectors

00:12:58

0 views

To alleviate the deployment of deep object detectors with large model capacity and complex computation, an effective model compression pipeline is designed in this paper. Firstly, attributed to the refined soft filter pruning, 3D filters of each convoluti

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

A Noninvasive Method To Detect Diabetes Mellitus And Lung Cancer Using The Stacked Sparse Autoencoder

00:13:33

0 views

Diabetes mellitus and lung cancer are two of the most common fatal diseases in the world, causing considerable deaths every year. However, it is not easy to detect diabetes mellitus and lung cancer efficiently--needing professional medical instruments suc

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

On The Use Of RéNyi Entropy For Optimal Window Size Computation In The Short-Time Fourier Transform

00:13:50

1 view

This paper investigates the determination of an optimal window length associated with the computation of the short time Fourier transform of multicomponent signals. For that purpose, the minimum of the Rényi entropy has been widely used in recent years. H

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Steganography And Its Detection In Jpeg Images Obtained With The "trunc"

00:11:28

0 views

Many portable imaging devices use the operation of "trunc" (rounding towards zero) instead of rounding as the final quantizer for computing DCT coefficients during JPEG compression. We show that this has rather profound consequences for steganography and

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Reflectance-Guided, Contrast-Accumulated Histogram Equalization

00:14:28

0 views

Existing image enhancement methods fall short of expectations because with them it is difficult to improve global and local image contrast simultaneously. To address this problem, we propose a histogram equalization-based method that adapts to the data-de

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Joint Training Of Deep Neural Networks For Multi-Channel Dereverberation And Speech Source Separation

00:12:50

0 views

In this paper, we propose a joint training of two deep neural networks (DNNs) for dereverberation and speech source separation. The proposed method connects the first DNN, the dereverberation part, the second DNN, and the speech source separation part in

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Hidden Markov Models For Sepsis Detection In Preterm Infants

00:14:42

0 views

We explore the use of traditional and contemporary hidden Markov models (HMMs) for sequential physiological data analysis and sepsis prediction in preterm infants. We investigate the use of classical Gaussian mixture model based HMM, and a recently propos

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

A Hybrid Structural Sparse Error Model For Image Deblocking

00:14:02

0 views

Inspired by the image nonlocal self-similarity (NSS) prior, structural sparse representation (SSR) models exploit each group as the basic unit for sparse representation, which have achieved promising results in various image restoration applications. Howe

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Manet: Multi-Scale Aggregated Network For Light Field Depth Estimation

00:14:02

0 views

We present a novel end-to-end network, MANet, for light field depth estimation. MANet is a parameter-effective and efficient multi-scale aggregated network, which is about 3 times smaller and 3 times faster than the current top-performing method Epinet. T

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Deep Learning For Robust Power Control For Wireless Networks

00:15:44

0 views

Robust optimization is an important task in wireless communications, because due to fading and feedback delay there is inherent uncertainty in channel state information in a wireless environment. This paper aims to show that a deep learning approach for n

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Continual Learning Through One-Class Classification Using Vae

00:14:25

0 views

Artificial neural networks (ANNs) suffer from catastrophic forgetting, a sharp decrease in performance on previously learned tasks, when trained on a new task without constant rehearsal. In this paper, we propose a new method for overcoming this phenomeno

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Coincidence, Categorization, And Consolidation: Learning To Recognize Sounds With Minimal Supervision

00:14:33

0 views

Humans do not acquire perceptual abilities in the way we train machines. While machine learning algorithms typically operate on large collections of randomly-chosen, explicitly-labeled examples, human acquisition relies more heavily on multimodal unsuperv

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Assimilation-Based Learning Of Chaotic Dynamical Systems From Noisy And Partial Data

00:14:56

0 views

Despite some promising results under ideal conditions (i.e. noise-free and complete observation), learning chaotic dynamical systems from real life data is still a very challenging task. We propose a novel framework, which combines data assimilation schem

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Transformer-Based Acoustic Modeling For Hybrid Speech Recognition

[2 Videos ]

We propose and evaluate transformer-based acoustic models for hybrid speech recognition. Several modeling choices are discussed in this work, including various positional embedding methods and an iterated loss to enable training deep transformers. We also

Show videos in this product

Transformer-Based Acoustic Modeling For Hybrid Speech Recognition

00:18:28

0 views

We propose and evaluate transformer-based acoustic models for hybrid speech recognition. Several modeling choices are discussed in this work, including various positional embedding methods and an iterated loss to enable training deep transformers. We also
Transformer-Based Acoustic Modeling For Hybrid Speech Recognition

00:00:00

0 views

We propose and evaluate transformer-based acoustic models for hybrid speech recognition. Several modeling choices are discussed in this work, including various positional embedding methods and an iterated loss to enable training deep transformers. We also

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Speaker Embeddings Incorporating Acoustic Conditions For Diarization

00:14:59

0 views

We present our work on training speaker embeddings, especially effective for speaker diarization. For various speaker recognition tasks, extracting speaker embeddings using Deep Neural Networks (DNNs) has become major methods. These embeddings are general

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Learning Perception And Planning With Deep Active Inference

00:08:17

0 views

Active inference is a process theory of the brain that states that all living organisms infer actions in order to minimize their (expected) free energy. However, current experiments are limited to predefined, often discrete, state spaces. In this paper we

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Hearing Aid Research Data Set For Acoustic Environment Recognition

00:13:21

1 view

State-of-the-art hearing aids (HA) are limited in recognizing acoustic environments. Much effort is spent on research to improve listening experience for HA users in every acoustic situation. There is, however, no dedicated public database to train acoust

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Recurrent Neural Audiovisual Word Embeddings For Synchronized Speech And Real-Time Mri

00:14:21

0 views

In this paper, the use of word embeddings for the segments found in audio and real-time magnetic resonance imaging (rtMRI) videos is addressed. In this study, word embeddings are created to store and retrieve data efficiently, and their representation pow

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

1.5Gbit/S 4.9W Hyperspectral Image Encoders On A Low-Power Parallel Heterogeneous Processing Platform

00:11:18

3 views

This work explores the utilization of low-power heterogeneous devices for parallelizing the compute-intensive hyper-spectral and multispectral image compression CCSDS-123 entropy encoders. Multithread processing allows for the near-optimal system?s bandwi

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Improving Music Transcription By Pre-Stacking A U-Net

00:10:00

1 view

We propose to pre-stack a U-Net as a way of improving the polyphonic music transcription performance of various baseline Convolutional Neural Networks (CNNS). The U-Net, a network architecture based on skip-connections between layers acts as a transformat

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Privacy Aware Acoustic Scene Synthesis Using Deep Spectral Feature Inversion

00:11:24

0 views

Gathering information about the acoustic environment of urban areas is now possible and studied in many major cities in the world. Part of the research is to find ways to inform the citizen about its sound environment while ensuring her privacy. We study

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Cloud-Driven Multi-Way Multiple-Antenna Relay Systems: Best-User-Link Selection And Joint Mmse Detection

00:18:03

0 views

In this work, we present a cloud-driven uplink framework for multi-way multiple-antenna relay systems which facilitates joint linear Minimum Mean Square Error (MMSE) symbol detection in the cloud and where users are selected to simultaneously transmit to

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Sparse Branch And Bound For Exact Optimization Of L0-Norm Penalized Least Squares

00:12:01

0 views

We propose a global optimization approach to solve l_0-norm penalized least-squares problems, using a dedicated branch-and-bound methodology. A specific tree search strategy is built, with branching rules inspired from greedy exploration techniques. We sh

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

An Lstm Based Architecture To Relate Speech Stimulus To Eeg

00:13:48

0 views

Modeling the relationship between natural speech and a recorded electroencephalogram (EEG) helps us understand how the brain processes speech and has various applications in neuroscience and brain-computer interfaces. In this context, so far mainly linear

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

A Composite Dnn Architecture For Speech Enhancement

00:14:16

0 views

In speech enhancement, the use of supervised algorithms in the form of deep neural networks (DNNs) has become tremendously popular in recent years. The target function of the DNN (and the associated estimators) is often either a masking function applied t

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Low-Rank Gradient Approximation For Memory-Efficient On-Device Training Of Deep Neural Network

00:12:20

0 views

Training machine learning models on mobile devices has the potential of improving both privacy and accuracy of the models. However, one of the major obstacles to achieving this goal is the memory limitation of mobile devices. Reducing training memory enab

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Image De-Raining Via Rdl: When Reweighted Convolutional Sparse Coding Meets Deep Learning

00:12:19

0 views

Over the past few decades, image de-raining has witnessed substantial progress due to the development of priors and deep learning based methods. However, few studies combine the merits of both. In this paper, we argue that domain expertise of conventional

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Minimum Latency Training Strategies For Streaming Sequence-To-Sequence Asr

00:16:19

0 views

Recently, a few novel streaming attention-based sequence-to-sequence (S2S) models have been proposed to perform online speech recognition with linear-time decoding complexity. However, in these models, the decisions to generate tokens are delayed compared

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Lqaid: Localized Quality Aware Image Denoising Using Deep Convolutional Neural Networks

00:12:01

0 views

In this paper we propose the Localized Quality Aware Image Denoising (LQAID) technique for image denoising using deep convolutional neural networks (CNNs). LQAID relies on local quality estimates over global cues like noise standard deviation since the pe

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Decomposed Cyclegan For Single Image Deraining With Unpaired Data

00:14:19

0 views

Most previous learning-based methods required paired rain image data. In practice, however, paired rain data cannot be collected. Inspired by adopting unpaired data in task of translation, in this paper we present a new method for rain removal using unpai

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

One-Bit Normalized Scatter Matrix Estimation For Complex Elliptically Symmetric Distributions

00:14:51

0 views

One-bit quantization has attracted attention in massive MIMO, radar, and array processing, due to its simplicity, low cost, and capability of parameter estimation. Specifically, the shape of the covariance of the unquantized data can be estimated from the

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Proximal Multitask Learning Over Distributed Networks With Jointly Sparse Structure

00:15:34

0 views

Modeling relations between local optimum parameter vectors in multitask networks has attracted much attention over the last years. This work considers a distributed optimization problem for parameter vectors with a jointly sparse structure among nodes, th

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Generating Multilingual Voices Using Speaker Space Translation Based On Bilingual Speaker Data

[2 Videos ]

We present progress towards bilingual Text-to-Speech which is able to transform a monolingual voice to speak a second language while preserving speaker voice quality. We demonstrate that a bilingual speaker embedding space contains a separate distribution

Show videos in this product

Generating Multilingual Voices Using Speaker Space Translation Based On Bilingual Speaker Data

00:13:15

0 views

We present progress towards bilingual Text-to-Speech which is able to transform a monolingual voice to speak a second language while preserving speaker voice quality. We demonstrate that a bilingual speaker embedding space contains a separate distribution
Generating Multilingual Voices Using Speaker Space Translation Based On Bilingual Speaker Data

00:13:15

0 views

We present progress towards bilingual Text-to-Speech which is able to transform a monolingual voice to speak a second language while preserving speaker voice quality. We demonstrate that a bilingual speaker embedding space contains a separate distribution

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Neural Percussive Synthesis Parameterised By High-Level Timbral Features

00:12:10

0 views

We present a deep neural network-based methodology for synthesising percussive sounds with control over high-level timbral characteristics of the sounds. This approach allows for intuitive control of a synthesizer, enabling the user to shape sounds withou

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Robustness Of Sparse Bayesian Learning In Correlated Environments

00:14:22

0 views

In this paper we explore the robustness of Sparse Bayesian Learning (SBL) in an environment with correlated sources. We provide two new perspectives to understand SBL's strategy for handling correlated sources. Using a Minimum Power Distortionless Respons

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

End-To-End Multi-Person Audio/Visual Automatic Speech Recognition

00:15:17

0 views

Traditionally, audio-visual automatic speech recognition has been studied under the assumption that the speaking face on the visual signal is the face matching the audio. However, in a more realistic setting, when multiple faces are potentially on screen

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Bband Index: A No-Reference Banding Artifact Predictor

00:13:47

0 views

Banding artifact, or false contouring, is a common video compression impairment that tends to appear on large flat regions in encoded videos. These staircase-shaped color bands can be very noticeable in high-definition videos. Here we study this artifact,

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Gender Differences On The Perception And Production Of Utterances With Willingness And Reluctance In Chinese

00:13:11

0 views

This study intends to explore the effects of gender differences on the perception and production of emotional intonation with willingness and reluctance. In the perceptual study, 20 native Mandarin listeners were instructed to rate perceived degree of wil

All Channels page: Communities submenu block

Communities

All Channels page: Societies submenu block

Societies

Events Showcase: ES submenu block

Event showcases

Recently Added Speakers

Events Hub Submenu block

Education: Education submenu block

Education Activity

2020 EAB AWARDS

2020 EAB AWARDS

IEEE ICASSP 2020 Virtual Conference May 2020