IEEE ICASSP 2020 Virtual Conference May 2020 | IEEETV

Thu, 16 July, 2020

Showing 1401 - 1450 of 1951

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

A Comprehensive Study Of Residual Cnns For Acoustic Modeling In Asr

00:13:32

1 view

Long short-term memory (LSTM) networks are the dominant architecture for large vocabulary continuous speech recognition (LVCSR) acoustic modeling due to their good performance. However, LSTMs are hard to tune and computationally expensive. To build a syst

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Effective Approximation Of Bandlimited Signals And Their Samples

00:13:49

0 views

Shannon's sampling theorem is of high importance in signal processing, because it links the continuous-time and discrete-time worlds. For bandlimited signals we can switch from one domain into the other without loosing information. In this paper we analyz

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Enhanced Mixture Population Monte Carlo Via Stochastic Optimization And Markov Chain Monte Carlo Sampling

00:13:42

0 views

The population Monte Carlo (PMC) algorithm is a popular adaptive importance sampling (AIS) method used for approximate computation of intractable integrals. Over the years, many advances have been made in the theory and implementation of PMC schemes. The

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

A Monte Carlo Search-Based Triplet Sampling Method For Learning Disentangled Representation Of Impulsive Noise On Steering Gear

00:13:45

0 views

The classification task of impact noise on vehicle steering system mainly addresses the issue of modeling the transient and impulsive nature. Though various deep learning models including triplet network have been developed, the existing triplet network b

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

An Optimal Symmetric Threshold Strategy For Remote Estimation Over The Collision Channel

00:14:14

0 views

A wireless sensing system with n sensors, observing independent and identically distributed continuous random variables with a symmetric probability density function, and one non-collocated estimator acting as a fusion center is considered. The sensors tr

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Depth Map Fingerprinting And Splicing Detection

00:12:26

0 views

With the ubiquity of social networks, images have become crucial in todays exchange of information. Most of these images are taken by smartphones. For forensic approaches relying on fixed image formation pipelines, the capabilities of smartphones using co

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Signal-Aware Broadband Doa Estimation Using Attention Mechanisms

00:15:00

0 views

We refer to direction-of-arrivals (DOAs) estimation of a user-defined subset of directional (desired) sound sources as signal-aware DOA estimation. Source selection, thereby, can be achieved with time-frequency masks to apply attention to TF bins dominate

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Subspace-Based Speech Correlation Vector Estimation For Single-Microphone Multi-Frame Mvdr Filtering

00:14:59

2 views

Aiming at exploiting the speech correlation across consecutive time-frames in the short-time Fourier transform domain, the multi-frame minimum variance distortionless response (MFMVDR) filter for single-microphone speech enhancement has been proposed. Thi

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Revisit Of Estimate Sequence For Accelerated Gradient Method

00:14:53

0 views

In this paper, we revisit the problem of minimizing a convex function $f(mathbf{x})$ with Lipschitz continuous gradient via accelerated gradient methods (AGM). To do so, we consider the so-called estimate sequence (ES), a useful analysis tool for establi

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Feature Drift Resilient Tracking Of The Carotid Artery Wall Using Unscented Kalman Filtering With Data Fusion

00:14:37

0 views

An analysis of the motion of the common carotid artery (CCA) provides effective indicators for cardiovascular diseases. Here, we propose a method for tracking CCA wall motion from a B-mode ultrasound video sequence. An unscented Kalman filter based on a s

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Modeling The Environment In Deep Reinforcement Learning: The Case Of Energy Harvesting Base Stations

00:13:20

0 views

In this paper, we focus on the design of energy self-sustainable mobile networks by enabling intelligent energy management that allows the base stations to mostly operate off-grid by using renewable energy. We propose a centralized control algorithm based

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Spherical Video Coding With Geometry And Region Adaptive Transform Domain Temporal Prediction

00:13:07

0 views

Many virtual and augmented reality applications depend critically on efficient compression of spherical videos. Current approaches apply a projection geometry to map a spherical video onto the plane(s), wherein a standard codec can be used for compression

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Domain Robust, Fast, And Compact Neural Language Models

00:12:58

1 view

Despite advances in neural language modeling, obtaining a good model on a large scale multi-domain dataset still remains a difficult task. We propose training methods for building neural language models for such a task, which are not only domain robust, b

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Generalized Linear Bandits With Safety Constraints

00:20:05

0 views

The classical multi-armed bandit is a class of sequential decision making problems where selecting actions incurs costs that are sampled independently from an unknown underlying distribution. Bandit algorithms have many applications in safety critical sys

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

A Hybrid Approach For Thermographic Imaging With Deep Learning

00:13:30

0 views

We propose a hybrid method for reconstructing thermographic images by combining the recently developed virtual wave concept with deep neural networks. The method can be used to detect defects inside materials in a non-destructive way. We propose two archi

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Learning Differentiable Sparse And Low Rank Networks For Audio-Visual Object Localization

00:12:06

0 views

Parsimonious modelling, including sparsity and low rankness, has becomes a cornerstone in modern machine learning and signal processing. However, these modelling techniques have limited capabity to learn from large-scale data, and often require some pre-d

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Deep Learning-Based Beam Alignment In Mmwave Vehicular Networks

00:15:00

0 views

Millimeter wave channels exhibit structure that allows beam alignment with fewer channel measurements than exhaustive beam search. From a compressed sensing (CS) perspective, the received channel measurements are usually obtained by multiplying a CS matri

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Deep Casa For Talker-Independent Monaural Speech Separation

00:13:19

0 views

Monaural speech separation is the task of separating target speech from interference in single-channel recordings. Although substantial progress has been made recently in deep learning based speech separation, previous studies usually focus on a single ty

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Stabilizing Multi-Agent Deep Reinforcement Learning By Implicitly Estimating Other Agents’ Behaviors

00:11:13

0 views

Deep reinforcement learning (DRL) is able to learn control policies for many complicated tasks, but it?s power has not been unleashed to handle multi-agent circumstances. Independent learning, where each agent treats others as part of the environment and

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Learning Connectivity And Higher-Order Interactions In Radial Distribution Grids

00:12:56

1 view

To perform any meaningful optimization task, distribution grid operators need to know the topology of their grids. Although power grid topology identification and verification has been recently studied, discovering instantaneous interplay among subsets of

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Classification Of Depth And Surface Edges With Deep Features

00:13:20

0 views

Edges in 2D images fall into two categories: depth edges and surface edges, depending on if the edge corresponds to an abrupt change in depth (the distance from the camera). This edge type is an efficient, robust, and effective information in many applica

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Tensor-To-Vector Regression For Multi-Channel Speech Enhancement Based On Tensor-Train Network

00:09:18

0 views

We propose a tensor-to-vector regression approach to multi-channel speech enhancement in order to address the issue of input size explosion and hidden-layer size expansion. The key idea is to cast the conventional deep neural network (DNN) based vector-to

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

A Dual-Staged Context Aggregation Method Towards Efficient End-To-End Speech Enhancement

00:14:02

0 views

In speech enhancement, an end-to-end deep neural network converts a noisy speech signal to a clean speech directly in time domain without time-frequency transformation or mask estimation. However, aggregating contextual information from a high-resolution

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Resting-State Eeg-Based Biometrics With Signals Features Extracted By Multivariate Empirical Mode Decomposition

00:14:46

0 views

EEG-based biometrics has gained great attention in recent years due to its superiority over traditional biometrics in terms of its resistance to circumvention. While there are numerous choices of data acquisition protocol, the present study is carried out

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Training Deep Spiking Neural Networks For Energy-Efficient Neuromorphic Computing

00:24:04

1 view

Spiking Neural Networks (SNNs) encode input information temporally using sparse spiking events, which can be harnessed to achieve higher computational efficiency. However, considering the rapid strides in accuracy enabled by Analog Neural Networks (ANNs),

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Speech Emotion Recognition With Local-Global Aware Deep Representation Learning

00:12:57

0 views

Convolutional neural networks (CNN) based deep representation learning methods for speech emotion recognition (SER) have demonstrated great success. The basic design of CNN restricts the ability to model only local information well. Capsule network (CapsN

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Multichannel Active Noise Control With Spatial Derivative Constraints To Enlarge The Quiet Zone

00:09:56

0 views

Active noise control is an efficient approach in dealing with unwanted acoustic disturbances. However, most of the active noise control algorithms aim to control the signal of the error sensor leading to local noise attenuation only around the error micro

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Speaker Diarization With Session-Level Speaker Embedding Refinement Using Graph Neural Networks

00:12:47

0 views

Deep speaker embedding models have been commonly used as a building block for speaker diarization systems; however, the speaker embedding model is usually trained according to a global loss defined on the training data, which could be sub-optimal for dist

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Korean Singing Voice Synthesis Based On Auto-Regressive Boundary Equilibrium Gan

00:10:19

0 views

Singing voice synthesis is a generative task that involves not only multidimensional controls of a singer model such as phonetic modulation by lyrics and pitch control by music score but also expressive elements such as breath sounds and vibrato. Recently

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

K-Space Trajectory Design For Reduced Mri Scan Time

00:12:13

0 views

The development of compressed sensing (CS) techniques for magnetic resonance imaging (MRI) is enabling a speedup of MRI scanning. To increase the incoherence in the sampling, a random selection of points on the k-space is deployed and a continuous traject

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Intelligent Student Behavior Analysis System For Real Classrooms

00:13:27

0 views

In this paper, we design an intelligent student behavior analysis system for recorded classrooms, which automatically detects hand-raising, standing, and sleeping behaviors of students. Detecting these behaviors is quite challenging mainly due to various

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Neural Network Training With Approximate Logarithmic Computations

00:24:23

0 views

The high computational complexity associated with training deep neural networks limits online and real-time training on edge devices. This paper proposed an end-to-end training and inference scheme that eliminates multiplications by approximate operations

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Multitaper Spectral Granger Causality With Application To Ssvep

00:13:32

0 views

The traditional parametric approach to Granger causality (GC), based on linear vector autoregressive modeling, suffers from difficulties related to the inaccurate modeling of the generative process. These limits can be solved by using non-parametric spect

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Learning To Generate Diverse Questions From Keywords

00:11:34

0 views

Diverse text generation has been emerging as an important topic of natural language generation. Traditional studies on question generation mainly investigate how to generate one question based on a given input (one-to-one). In this paper, we focus on a mo

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Synchronous Transformers For End-To-End Speech Recognition

00:12:15

0 views

For most of the attention-based sequence-to-sequence models, the decoder predicts the output sequence conditioned on the entire input sequence processed by the encoder. The asynchronous problem between the encoding and decoding makes these models difficul

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Multi-Constraint Spectral Co-Design For Colocated Mimo Radar And Mimo Communications

00:13:10

2 views

Single waveform design for automotive joint radar-communications (JRC) is being increasingly considered recently, as it addresses the problem of spectrum sharing between the two systems. The paper addresses the challenge of designing a waveform in MIMO-ra

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

From Video Game To Real Robot: The Transfer Between Action Spaces

00:13:21

0 views

Deep reinforcement learning has proven to be successful for learning tasks in simulated environments, but applying same techniques for robots in real-world domain is more challenging, as they require hours of training. To address this, transfer learning c

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Spatial-Temporal Feature Aggregation Network For Video Object Detection

00:12:41

0 views

Video object detection is a challenging problem in computer vision. In this paper, we propose a novel spatial-temporal feature aggregation network to deal with this issue. Specifically, we present a novel instance-level feature aggregation module as compl

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Steepening Squared Error Function Facilitates Online Adaptation Of Gaussian Scales

00:14:11

0 views

We previously proposed a joint learning scheme of Gaussian parameters (scales and centers) and coefficients for online nonlinear estimation. The instantaneous squared error cost in terms of the Gaussian scales, however, tends to have shallow slopes when t

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Information Flow Optimization In Inference Networks

00:13:16

0 views

The problem of maximizing the information flow through a sensor network tasked with an inference objective at the fusion center is considered. The sensor nodes take observations, compress and send them to the fusion center through a network of relays. The

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Fractional Fourier Transform Based Qrs Complex Detection In Ecg Signal

00:14:22

0 views

By exploiting fractional-Fourier-transform (FrFT), a novel technique for the QRS complex detection is proposed. The application of the FrFT rotates the Electrocardiograph (ECG) signal in the time-frequency plane. We claim this rotation can give simple and

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Singing Voice Conversion With Disentangled Representations Of Singer And Vocal Technique Using Variational Autoencoders

00:14:52

0 views

We propose a flexible framework that deals with both singer conversion and singers vocal technique conversion. The proposed model is trained on non-parallel corpora, accommodates many-to-many conversion, and leverages recent advances of variational autoen

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Federated Truth Inference Over Distributed Crowdsourcing Platforms

00:11:37

0 views

This work examines the truth inference problem in a distributed crowdsourcing scenario. Labeling tasks are outsourced to workers associated with different platforms, and truth inference is to be performed without sharing the workers' individual responses

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Forecasting Multi-Dimensional Processes Over Graphs

00:14:59

0 views

The forecasting of multi-variate time processes through graph-based techniques has recently been addressed under the graph signal processing framework. However, problems in the representation and the processing arise when each time series carries a vector

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Principal Angle Detector For Subspace Signal With Structured Unknown Interference

00:19:25

0 views

Detecting subspace signals is an important problem in radar and sonar signal processing, hyperspectral image processing, wireless communication, and other fields. Among these problems, a typical scenario is that one needs to detect a signal lying in a giv