IEEE ICASSP 2020 Virtual Conference May 2020

Thu, 16 July, 2020

Showing 201 - 250 of 1951

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

A Hierarchical Tracker For Multi-Domain Dialogue State Tracking

00:12:55

0 views

The goal of Dialogue State Tracking (DST) is to estimate the current dialogue state given all the preceding conversation. Due to the increased number of state candidates, data sparsity problem is still a major hurdle for multi-domain DST. Existing methods

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Frequency Diverse Array Radar: A Closed-Form Solution To Design Weights For Desired Beampattern

00:13:37

1 view

In contrast to phased-array radar, frequency-diverse-array (FDA) radar transmits signals of linearly increasing frequencies across the array. As a consequence, the beampattern of an FDA radar becomes range, angle, and time dependent, which is different fr

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

M-Estimators Of Scatter With Eigenvalue Shrinkage

00:16:46

0 views

A popular regularized (shrinkage) covariance estimator is the shrinkage sample covariance matrix (SCM) which shares the same set of eigenvectors as the SCM but shrinks its eigenvalues toward its grand mean. In this paper, a more general approach is consid

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Speech Intelligibility Enhancement By Equalization For In-Car Applications

00:13:35

0 views

In this paper, we propose a speech intelligibility enhancement method for typical in-car applications in noisy environments. While traditional speech enhancement algorithms aim at increasing the Signal to Noise Ratio (SNR), the goal here is to increase in

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Content Vs Context: How About "walking Hand-In-Hand" For Image Clustering?

00:11:52

0 views

Image clustering has been one of the most important issues in the field of pattern recognition. However, most of existing methods only focus on utilizing either content or context information of images, failing to consider both of them. In fact, the power

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Weakly Supervised Semantic Segmentation For Remote Sensing Hyperspectral Imaging

00:12:06

0 views

This paper studies the problem of training a semantic segmentation neural network with weak annotations, in order to be applied in aerial vegetation images from Teide National Park. It proposes a Deep Seeded Region Growing system which consists on trainin

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

On End-To-End Multi-Channel Time Domain Speech Separation In Reverberant Environments

00:13:24

0 views

This paper introduces a new method for multi-channel time domain speech separation in reverberant environments. A fully-convolutional neural network structure has been used to directly separate speech from multiple microphone recordings, with no need of c

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Portfolio Cuts: A Graph-Theoretic Framework To Diversification

00:19:00

0 views

Investment returns naturally reside on irregular domains, however, standard multivariate portfolio optimization methods are agnostic to data structure. To this end, we investigate ways for domain knowledge to be conveniently incorporated into the analysis

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Hi-Mia : A Far-Field Text-Dependent Speaker Verification Database And The Baselines

00:11:54

0 views

This paper presents a large far-field text-dependent speaker verification database named HI-MIA. We aim to meet the data requirement for far-field microphone array based speaker verification since most of the publicly available databases are single channe

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Characterization Of A Snapshot Fourier Transform Imagingspectrometer Based On An Array Of Fabry-Perot Interferometers

00:14:21

0 views

This study focuses on a novel snapshot Fourier Transform imaging spectrometer based on an array of Fabry-Perot interferometers. This device fully relies on signal processing in order to provide intelligible outputs and thus requires a precise characterisa

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Maximally Energy-Concentrated Differential Window For Phase-Aware Signal Processing Using Instantaneous Frequency

00:13:39

0 views

The short-time Fourier transform (STFT) is widely employed in nonstationary signal analysis, whose property depends on window functions. Instantaneous frequency in STFT, the time-derivative of phase, is recently applied to many applications including spec

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Deep Multi-Region Hashing

00:13:55

0 views

Hashing has been widely used for large-scale approximate nearest neighbors retrieval own to its high efficiency. In the existing hashing methods, deep supervised hashing methods have achieved the best performance by utilizing the semantic labels on data w

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Beam Elimination Based On Sequentially Estimated A Posteriori Probabilities Of Winning

00:16:51

0 views

A robust and adaptive variable length beam selection strategy based on M-ary sequential competition was proposed in [1]. It was enhanced by the elimination of inauspicious beams during the ongoing competition to improve the efficiency and speed of the tra

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Acoustic Scene Classification For Mismatched Recording Devices Using Heated-Up Softmax And Spectrum Correction

[2 Videos ]

Deep neural networks (DNNs) are successful in applications with matching inference and training distributions. In real-world scenarios, DNNs have to cope with truly new data samples during inference, potentially coming from a shifted data distribution. Th

Show videos in this product

Acoustic Scene Classification For Mismatched Recording Devices Using Heated-Up Softmax And Spectrum Correction

00:14:28

0 views

Deep neural networks (DNNs) are successful in applications with matching inference and training distributions. In real-world scenarios, DNNs have to cope with truly new data samples during inference, potentially coming from a shifted data distribution. Th
Acoustic Scene Classification For Mismatched Recording Devices Using Heated-Up Softmax And Spectrum Correction

00:14:28

0 views

Deep neural networks (DNNs) are successful in applications with matching inference and training distributions. In real-world scenarios, DNNs have to cope with truly new data samples during inference, potentially coming from a shifted data distribution. Th

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Multiple Points Input For Convolutional Neural Networks In Replay Attack Detection

00:14:39

0 views

The models based on convolutional neural network (CNN) have shown remarkable performance in spoofing detection for automatic speaker verification. In order to input data into CNN-based models in mini-batch unit, the shape of all data in each mini-batch mu

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Improving Auditory Attention Decoding Performance Of Linear And Non-Linear Methods Using State-Space Model

00:14:21

680 views

Identifying the target speaker in hearing aid applications is crucial to improve speech understanding. Recent advances in electroencephalography (EEG) have shown that it is possible to identify the target speaker from single-trial EEG recordings using aud

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Meta-Learning To Communicate: Fast End-To-End Training For Fading Channels

00:16:19

0 views

When a channel model is available, learning how to communicate on fading noisy channels can be formulated as the (unsupervised) training of an autoencoder consisting of the cascade of encoder, channel, and decoder. An important limitation of the approach

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Lipreading Using Temporal Convolutional Networks

00:13:49

0 views

Lip-reading has attracted a lot of research attention lately thanks to advances in deep learning. The current state-of-the-art model for recognition of isolated words in-the-wild consists of a residual network and Bidirectional Gated Recurrent Unit (BGRU)

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Improving Cross-Dataset Performance Of Face Presentation Attack Detection Systems Using Face Recognition Datasets

00:14:36

848 views

Presentation attack detection (PAD) is now considered critically important for any face-recognition (FR) based access-control system. Current deep-learning based PAD systems show excellent performance when they are tested in intra-dataset scenarios. Under

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Orthogonal Training For Text-Independent Speaker Verification

00:13:26

0 views

In this paper we propose orthogonal training schemes to improve the effectiveness of cosine similarity measurements in text-independent speaker verification (SV) tasks. Compared to the PLDA backend, cosine similarity is simple to compute, and it does not

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Multi-Layer Content Interaction Through Quaternion Product For Visual Question Answering

00:03:39

0 views

Multi-modality fusion technologies have greatly improved the performance of neural network-based Video Description/Caption, Visual Question Answering (VQA) and Audio Visual Scene-aware Dialog (AVSD) over the recent years. Most previous approaches only exp

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

On The Choice Of Graph Neural Network Architectures

00:13:16

1 view

Seminal works on graph neural networks have primarily targeted semi-supervised node classification problems with few observed labels and high-dimensional signals. With the development of graph networks, this setup has become a de facto benchmark for a sig

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Griffinâ€“Lim Like Phase Recovery Via Alternating Direction Method Of Multipliers

00:12:58

0 views

Recovering a signal from its amplitude spectrogram, or phase recovery, exhibits many applications in acoustic signal processing. When only an amplitude spectrogram is available and no explicit information is given for the phases, the Griffin-Lim algorithm

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Bilevel Optimization Using Stationary Point Of Lower-Level Objective Function

00:14:56

590 views

In this letter, we address an audio signal separation problem and propose a new effective algorithm for solving a bilevel optimization in discriminative nonnegative matrix factorization (NMF). Recently, discriminative training of NMF bases has been develo

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Deepjscc: The Future Of Wireless Video Transmission

00:08:40

717 views

We propose a demonstration of a joint source-channel coding (JSCC) scheme, called DeepJSCC, for wireless video transmission. Unlike conventional digital communication systems, which rely on separate source and channel coding, DeepJSCC is a purely data-dri

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

A Random Gossip Bmuf Process For Neural Language Modeling

00:14:10

0 views

Neural network language model (NNLM) is an essential component of industrial ASR systems. One important challenge of training an NNLM is to leverage between scaling the learning process and handling big data. Conventional approaches such as block momentum

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Consistency-Aware Multi-Channel Speech Enhancement Using Deep Neural Networks

00:13:54

0 views

This paper proposes a deep neural network (DNN)--based multi-channel speech enhancement system in which a DNN is trained to maximize the quality of the enhanced time-domain signal. DNN-based multi-channel speech enhancement is often conducted in the time-

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Unsupervised Training For Deep Speech Source Separation With Kullback-Leibler Divergence Based Probabilistic Loss Function

00:14:19

0 views

In this paper, we propose a multi-channel speech source separation method with a deep neural network (DNN) which is trained under the condition that no clean signal is available. As an alternative to a clean signal, the proposed method adopts an estimated

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

A Deep Gradient Boosting Network For Optic Disc And Cup Segmentation

00:12:24

0 views

Segmentation of optic disc (OD) and optic cup (OC) is critical in automated fundus image analysis system. Existing state-ofthe-arts focus on designing deep neural networks with one or multiple dense prediction branches. Such kind of designs ignore connect

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Feature Selection Under Orthogonal Regression With Redundancy Minimizing

00:14:51

0 views

Various supervised embedded methods have been proposed to select discriminative features from original ones, such as Feature Selection with Orthogonal Regression (FSOR) and Robust Feature Selection. Compared with embedded methods based on the least square

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Audio Codec Enhancement With Generative Adversarial Networks

00:15:26

0 views

Audio codecs are typically transform-domain based and efficiently code stationary audio signals, but they struggle with speech and signals containing dense transient events such as applause. Specifically, with these two classes of signals as examples, we

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Differentiable Branching In Deep Networks For Fast Inference

00:09:45

0 views

In this paper, we consider the design of deep neural networks augmented with multiple auxiliary classifiers departing from the main (backbone) network. These classifiers can be used to perform early-exit from the network at various layers, making them con

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Cross Image Cubic Interpolator For Spatially Varying Exposures

00:07:03

0 views

Spatially varying exposures via rolling shutter is an efficient way to capture differently exposed images for high dynamic range (HDR) scenes. Neither camera movement nor moving objects is an issue for such a captured method. However, a possible issue is

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Beyond The Dcase 2017 Challenge On Rare Sound Event Detection: A Proposal For A More Realistic Training And Test Framework

00:13:15

0 views

There are many ways to evaluate rare sound event detection (SED) approaches, e.g., the DCASE 2017 challenge provides a widely employed framework. This paper proposes a rare SED training and test framework, which is reflecting an SED application in a more

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Lie Group State Estimation Via Optimal Transport

00:12:52

0 views

Many applications in science and engineering involve tracking the state of a stochastic differential equation (SDE) evolving in a Lie group. This has been tackled by particle filtering although some existing schemes fail to satisfy geometric constraints.

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Distributed Quantization For Sparse Time Sequences

00:15:27

0 views

Analog signals processed in digital hardware are quantized into a discrete bit-constrained representation. Quantization is typically carried out using analog-to-digital converters (ADCs), operating in a serial scalar manner. In some applications, a set of

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Maximum Likelihood Estimation Of The Interference-Plus-Noise Cross Power Spectral Density Matrix For Own Voice Retrieval

00:14:59

1 view

In headset and hearing aid applications, it is of interest to retrieve the user's own voice in a noisy environment, e.g. for telephony applications. To do so, the cross power spectral density (CPSD) of the noise is required. In this paper, a novel maximum

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Multitask Learning And Multistage Fusion For Dimensional Audiovisual Emotion Recognition

00:13:07

0 views

Due to its ability to accurately predict emotional state using multimodal features, audiovisual emotion recognition has recently gained more interest from researchers. This paper proposes two methods to predict emotional attributes from audio and visual d

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Joint Resource Allocation And Routing For Service Function Chaining With In-Subnetwork Processing

00:14:53

0 views

Network Function Virtualization (NFV) is an efficient approach to simplify and accelerate the deployment of diverse network services. A critical challenge lies in mapping Virtual Network Functions (VNFs) to high-volume servers, resource allocation, and tr

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

On-The-Fly Feature Selection And Classification With Application To Civic Engagement Platforms

00:14:07

0 views

Online feature selection and classification is crucial for time sensitive decision making. Existing work however either assumes that features are independent or produces a fixed number of features for classification. Instead, we propose an optimal framewo

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Attention Mechanism Enhanced Kernel Prediction Networks For Denoising Of Burst Images

00:13:02

0 views

Deep learning based image denoising methods have been extensively investigated. In this paper, attention mechanism enhanced kernel prediction networks (AME-KPNs) are proposed for burst image denoising, in which, nearly cost-free attention modules are adop

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Speech Enhancement Using Self-Adaptation And Multi-Head Self-Attention

[2 Videos ]

This paper investigates a self-adaptation method for speech enhancement using auxiliary speaker-aware features; we extract a speaker representation used for adaptation directly from the test utterance. Conventional studies of deep neural network (DNN)--ba

Show videos in this product

Speech Enhancement Using Self-Adaptation And Multi-Head Self-Attention

00:14:36

1 view

This paper investigates a self-adaptation method for speech enhancement using auxiliary speaker-aware features; we extract a speaker representation used for adaptation directly from the test utterance. Conventional studies of deep neural network (DNN)--ba
Speech Enhancement Using Self-Adaptation And Multi-Head Self-Attention

00:14:38

1 view

This paper investigates a self-adaptation method for speech enhancement using auxiliary speaker-aware features; we extract a speaker representation used for adaptation directly from the test utterance. Conventional studies of deep neural network (DNN)--ba

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

A Dsp Acceleration Framework For Software-Defined Radios On X86_64

00:12:58

0 views

This paper presents a DSP acceleration and assessment framework targeting SDR platforms on x86_64 architectures. Driven by the potential of rapid prototyping and evaluation of breakthrough concepts that these platforms provide, our work builds upon the we

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Constant Envelope Massive Mimo-Ofdm Precoding: An Improved Formulation And Solution

00:13:07

0 views

Constant Envelope (CE) precoding is an efficient technique for systems based on massive antenna arrays since the constant amplitude of the transmit signal facilitates the use of power efficient non-linear transmitter circuitry, such as power amplifiers (P

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

A Cross-Task Transfer Learning Approach To Adapting Deep Speech Enhancement Models To Unseen Background Noise Using Paired Senone Classifiers

00:11:02

1 view

We propose an environment adaptation approach that improves deep speech enhancement models via minimizing the Kullback- Leibler divergence between posterior probabilities produced by a multi-condition senone classifier (teacher) fed with noisy speech feat

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Optimum Kernel Particle Filter For Asymmetric Laplace Noise

00:12:05

0 views

In this paper we present on-line Bayesian filtering methods for time series models corrupted by asymmetric Laplace noise. An optimum kernel particle filter is designed for the general asymmetric case, and its performance is compared to that of a tradition

All Channels page: Communities submenu block

Communities

All Channels page: Societies submenu block

Societies

Events Showcase: ES submenu block

Event showcases

Recently Added Speakers

Events Hub Submenu block

Education: Education submenu block

Education Activity

2020 EAB AWARDS

2020 EAB AWARDS

IEEE ICASSP 2020 Virtual Conference May 2020