IEEE ICASSP 2020 Virtual Conference May 2020

Thu, 16 July, 2020

Showing 1851 - 1900 of 1951

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Training Code-Switching Language Model With Monolingual Data

00:12:05

0 views

A lack of code-switching data complicates the training of code-switching (CS) language models. We propose an approach to train such CS language models on monolingual data only. By constraining and normalizing the output projection matrix in RNN-based lang

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Graph Metric Learning Via Gershgorin Disc Alignment

00:16:52

4 views

We propose a fast general projection-free metric learning framework, where the minimization objective $min_{M in cS} Q(M)$ is a convex differentiable function of the metric matrix $M$, and $M$ resides in the set $cS$ of generalized graph Laplacian

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Pose Refinement: Bridging The Gap Between Unsupervised Learning And Geometric Methods For Visual Odometry

00:12:55

0 views

Unsupervised Learning based monocular visual odometry (VO) has lately drawn significant attention owing to its potential in label-free leaning ability and robustness to camera parameters and environmental variations. However, due to the lack of pose optim

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Spatial Attentional Bilinear 3D Convolutional Network For Video-Based Autism Spectrum Disorder Detection

00:11:07

0 views

Video-based Autism Spectrum Disorder (ASD) detection is a challenge to most video classification networks due to the high degree of similarity between categories. Bilinear pooling is a second-order method, which is widely used in fine-grained visual recog

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Efficient Constrained Encoders Correcting A Single Nucleotide Edit In Dna Storage

00:15:48

0 views

A nucleotide substitution is said to occur when a base in {A, T} is substituted for a base in {C, G}, or vice versa. Recent experiment (Heckel et al. 2019) showed that a nucleotide substitution occurs with a significantly higher probability that other sub

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Transformer-Based Text-To-Speech With Weighted Forced Attention

00:14:01

0 views

This paper investigates state-of-the-art Transformer- and FastSpeech-based high-fidelity neural text-to-speech (TTS) with full-context label input for pitch accent languages. The aim is to realize faster training than conventional Tacotron-based models. I

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Redundant Convolutional Network With Attention Mechanism For Monaural Speech Enhancement

00:13:41

0 views

The redundant convolutional encoder decoder network has proven useful in speech enhancement tasks. It can capture localized time-frequency details of speech signals through both the fully convolutional network structure and feature selection capability re

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Predicting Word Error Rate For Reverberant Speech

00:12:52

0 views

Reverberation negatively impacts the performance of automatic speech recognition (ASR). Prior work on quantifying the effect of reverberation has shown that clarity (C50), a parameter that can be estimated from the acoustic impulse response, is correlated

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Multi-Label Consistent Convolutional Transform Learning: Application To Non-Intrusive Load Monitoring

00:08:21

0 views

Convolutional transform learning is an unsupervised framework we introduced recently, for feature generation based on learnt convolutions. In this work, we propose a supervised formulation for convolutional transform so as to address the multi-label class

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Unsupervised Multiple Source Localization Using Relative Harmonic Coefficients

00:13:50

2 views

This paper presents an unsupervised multi-source localization algorithm using a recently introduced feature called the relative harmonic coefficients. We derive a closed-form expression of the feature and briefly summarize its unique properties. We then e

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Data Augmentation Using Empirical Mode Decomposition On Neural Networks To Classify Impact Noise In Vehicle

00:13:18

0 views

In a vehicle, impact noise may occur during steering action due to clearance between parts of steering systems. Via structural path the noise is perceived by the drivers? ears and it can be the cause of a repair campaign. It is importatnt to know where th

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Learning Blind Denoising Network For Noisy Image Deblurring

00:12:12

0 views

Noisy image deblurring is to recover the blurry image in the presence of the random noise. The key to this problem is to know the noise level in each iteration. The existing methods manually adjust the regularization parameter for varying noise levels, wh

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Global Traffic State Recovery Via Local Observations With Generative Adversarial Networks

00:14:00

0 views

Traffic signal control for a large-scale traffic network is one challenging problem in intelligent transportation systems (ITS). High communication overheads are typically required to achieve the optimal control of the traffic signals in multiple road int

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Low-Rank Approximation Of Matrices Via A Rank-Revealing Factorization With Randomization

00:14:43

0 views

Given a matrix A with numerical rank k, the two-sided orthogonal decomposition (TSOD) computes a factorization A = UDV^T , where U and V are unitary, and D is (upper/lower) triangular. TSOD is rank-revealing as the middle factor D reveals the rank of A. T

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

In-Network Caching For Hybrid Satellite-Terrestrial Networks Using Deep Reinforcement Learning

00:15:17

0 views

Large number of redundant requests in wireless networks have lead to the hybrid satellite-terrestrial networks, where a satellite is used for content placement at edge caches at base stations (BSs), thereby reducing backhaul link usage. In this paper, we

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Multi-Task Center-Of-Pressure Metrics Estimation From Skeleton Using Graph Convolutional Network

00:11:43

0 views

Center of pressure (COP) is an important measurement of postural and gait control in human biomechanical studies. A vision-based estimation of COP metrics offers a way to obtain these gold-standard metrics for the detection of balance and gait problems. I

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

An Ensemble Based Approach For Generalized Detection Of Spoofing Attacks To Automatic Speaker Recognizers

00:10:45

0 views

As automatic speaker recognizer systems become mainstream, voice spoofing attacks are on the rise. Common attack strategies include replay, the use of text-to-speech synthesis, and voice conversion systems. While previously-proposed end-to-end detection f

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Selective Convolutional Network: An Efficient Object Detector With Ignoring Background

00:12:08

0 views

It is well known that attention mechanisms can effectively improve the performance of many CNNs including object detectors. Instead of refining feature maps prevalently, we reduce the prohibitive computational complexity by a novel attempt at attention. T

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Gaussian Process Imputation Of Multiple Financial Series

00:14:32

0 views

In Financial Signal Processing, multiple time series such as financial indicators, stock prices and exchange rates are strongly coupled due to their dependence on the latent state of the market and therefore they are required to be jointly analysed. We fo

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Low-Rank Mmwave Mimo Channel Estimation In One-Bit Receivers

00:14:15

0 views

Receivers with one-bit analog-to-digital converters (ADCs) are promising for high bandwidth millimeter wave (mmWave) systems as they consume less power than their full resolution counterparts. The extreme quantization in one-bit receivers and the use of l

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Just Noticeable Distortion Based Perceptually Lossless Intra Coding

00:11:04

0 views

Perceptual video coding plays a very important role in video codec optimization aiming at removing the perceptual redundancies in video content. In this paper, a just noticeable distortion (JND) guided perceptually lossless coding framework is proposed fo

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Forecasting Sparse Traffic Congestion Patterns Using Message-Passing Rnns

00:20:47

0 views

The ability to forecast traffic congestion ahead of time given road conditions has remained a prominent problem in road traffic analysis. In this work, we leverage mobility traces of public transport vehicles tracked by the New York City MTA and formulate

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Attentive Modality Hopping Mechanism For Speech Emotion Recognition

00:16:18

0 views

In this work, we explore the impact of visual modality in addition to speech and text for improving the accuracy of the emotion detection system. The traditional approaches tackle this task by independently fusing the knowledge from the various modalities

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Low-Complexity Lstm-Assisted Bit-Flipping Algorithm For Successive Cancellation List Polar Decoder

00:12:39

0 views

Polar codes have attracted much attention in the past decade due to their capacity-achieving performance. The higher decoding capacity is required for 5G and beyond 5G (B5G). Although the cyclic redundancy check (CRC)- assisted successive cancellation lis

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Subjective Quality Estimation Using Pesq For Hands-Free Terminals

00:15:07

0 views

Previous reports have mentioned the possibility that subjective quality of the echo-suppressed speech signal can be estimated based on perceptual evaluation of speech quality (PESQ), but there are few experimental results. We propose third-party listening

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Prediction Of Voicing And The F0 Contour From Electromagnetic Articulography Data For Articulation-To-Speech Synthesis

00:13:36

0 views

Articulation-to-speech synthesis based solely on supraglottal articulation requires some sort of intonation control. This paper examines to what extent the f0 contour of an utterance can be predicted from such supraglottal articulation data. To that end,

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Bayesian Multiple Change-Point Detection With Limited Communication

00:14:54

0 views

Several modern applications involve large-scale sensor networks for statistical inference. For example, such sensor networks are of significant interest for Internet of Things applications. In this paper, we consider Bayesian multiple change-point detecti

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Hand-3D-Studio: A New Multi-View System For 3D Hand Reconstruction

00:12:18

0 views

This paper proposes a new system named as Hand-3D-Studio to capture the 3D hand pose and shape information. Our system includes 15 synchronized DSLR cameras, which can acquire high quality multi-view 4K resolution color images in a circular manner. We the

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Source Enumeration Via Toeplitz Matrix Completion

00:13:48

5 views

This paper addresses the problem of source enumeration by an array of sensors in the presence of noise whose spatial covariance structure is a diagonal matrix with possibly different variances, referred to non-iid noise hereafter, when the sources are unc

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Multimodal Transformer Fusion For Continuous Emotion Recognition

00:13:29

0 views

Multimodal fusion increases the performance of emotion recognition because of the complementarity of different modalities. Compared with decision level and feature level fusion, model level fusion makes better use of the advantages of deep neural networks

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

End-To-End Training Of Time Domain Audio Separation And Recognition

00:13:52

0 views

The rising interest in single-channel multi-speaker speech separation sparked development of End-to-End (E2E) approaches to multispeaker speech recognition. However, up until now, state-of-the-art neural network?based time domain source separation has not

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Interrupted And Cascaded Permutation Invariant Training For Speech Separation

00:11:52

0 views

Permutation Invariant Training (PIT) has long been a stepping stone method for training speech separation model in handling the label ambiguity problem. With PIT selecting the minimum cost label assignments dynamically, very few studies considered the sep

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Dnn-Based Distributed Multichannel Mask Estimation For Speech Enhancement In Microphone Arrays

00:13:06

0 views

Multichannel processing is widely used for speech enhancement but several limitations appear when trying to deploy these solutions in the real world. Distributed sensor arrays that consider several devices with a few microphones is a viable solution which

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Performance Analysis For Path Attenuation Estimation Of Microwave Signals Due To Rainfall And Beyond

00:14:03

0 views

The attenuation of microwave signals can be used for meteorological observations. For example, the received signal level (RSL) of backhaul links of cellular systems, which usually has quantization error of 0.1 dB or more for commercial systems, has been u

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

A Bidirectional Context Propagation Network For Urine Sediment Particle Detection In Microscopic Images

00:12:27

0 views

The microscopic urine sediment examination is a crucial part in the evaluation of renal and urinary tract diseases. Recently, there are emerging CNNs-based detectors to detect the urine sediment particles in an end-to-end manner. However, it is not very c

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Hierarchical Attention Transfer Networks For Depression Assessment From Speech

00:10:39

0 views

A growing area of mental health research is the search for speech-based objective markers for conditions such as depression. However, when combined with machine learning, this search can be challenging due to a limited amount of annotated training data. I

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Group-Utility Metric For Efficient Sensor Selection And Removal In Lcmv Beamformers

00:16:28

0 views

In sensor arrays or sensor networks, tracking each sensor?s utility helps in excluding those which do not sufficiently contribute to the task at hand, thereby reducing energy consumption or avoiding model overfitting. In a linearly-constrained minimum var

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Optimal Sampling Rate And Bandwidth Of Bandlimited Signals - An Algorithmic Perspective

00:13:08

0 views

The bandwidth of a bandlimited signal is a key quantity that is relevant in numerous applications. For example, it determines the minimum sampling rate that is necessary to reconstruct a bandlimited signal from its samples. In this paper we study if it is

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Learning Spatio-Temporal Representations With Temporal Squeeze Pooling

00:09:31

0 views

In this paper, we propose a new video representation learning method, named Temporal Squeeze (TS) pooling, which can extract the essential movement information from a long sequence of video frames and map it into a set of few images, named Squeezed Images

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Improving Sample-Efficiency In Reinforcement Learning For Dialogue Systems By Using Trainable-Action-Mask

00:16:04

0 views

By interacting with human and learning from reward signals, reinforcement learning is an ideal way to build conversational AI. Concerning the expenses of real-users' responses, improving sample-efficiency has been the key issue when applying reinforcement

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Encoding Temporal Information For Automatic Depression Recognition From Facial Analysis

00:17:48

0 views

Depression is a mental illness that may be harmful to an individual?s health. Using deep learning models to recognize the facial expressions of individuals captured in videos has shown promising results for automatic depression detection. Typically, depre

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Cross Lingual Transfer Learning For Zero-Resource Domain Adaptation

00:16:53

0 views

We propose a method for zero-resource domain adaptation of DNN acoustic models, for use in low-resource situations where the only in-language training data available may be poorly matched to the intended target domain. Our method uses a multi-lingual mode

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Joint Scheduling And Beamforming For Delay Sensitive Traffic With Priorities And Deadlines

00:13:54

0 views

Packet scheduling in 5G networks can significantly affect the perfor- mance of beamforming techniques since the allocation of multiple users to the same time-frequency block causes interference between users. A combination of beamforming and scheduling ca

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Self-Driven Graph Volterra Models For Higher-Order Link Prediction

00:14:23

0 views

Link prediction is one of the core problems in network and data science with widespread applications. While predicting pairwise nodal interactions (links) in network data has been investigated extensively, predicting higher-order interactions (higher-orde

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Blind Bounded Source Separation Using Neural Networks With Local Learning Rules

00:13:00

1 view

An important problem encountered by both natural and engineered signal processing systems is blind source separation. In many instances of the problem, the sources are bounded by their nature and known to be so, even though the particular bound may not be

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Robust Covariance Matrix Estimation And Portfolio Allocation: The Case Of Non-Homogeneous Assets

00:14:12

0 views

This paper presents how the most recent improvements made on covariance matrix estimation and model order selection can be applied to the portfolio optimisation problem. The particular case of the Maximum Variety Portfolio is treated but the same improvem

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

A Deep Learning Approach To Object Affordance Segmentation

00:12:41

0 views

Learning to understand and infer object functionalities is an important step towards robust visual intelligence. Significant research efforts have recently focused on segmenting the object parts that enable specific types of human-object interaction, the

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Alignment-Length Synchronous Decoding For Rnn Transducer

00:15:46

0 views

We present a beam decoding strategy for recurrent neural network transducers which has the characteristic that all competing hypotheses within the beam have the same alignment length (number of output symbols plus BLANK symbols). We contrast the proposed

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Generalized Spatial Modulation For Wireless Terabits Systems Under Sub-Thz Channel With Rf Impairments

00:12:05

0 views

Multiple-Input Multiple-Output (MIMO) technique with Index Modulation (IM) over sub-TeraHertz (sub-THz) bands represent a promising solution to design new wireless ultra-high data rate systems. However, the system design over sub-THz bands suffers from ma

All Channels page: Communities submenu block

Communities

All Channels page: Societies submenu block

Societies

Events Showcase: ES submenu block

Event showcases

Recently Added Speakers

Events Hub Submenu block

Education: Education submenu block

Education Activity

2020 EAB AWARDS

2020 EAB AWARDS

IEEE ICASSP 2020 Virtual Conference May 2020