IEEE ICASSP 2020 Virtual Conference May 2020

Thu, 16 July, 2020

Showing 1901 - 1950 of 1951

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Ertis: Real-Time 3D Acoustic Sonar Imaging Using Sparse Microphone Arrays

00:09:06

1 view

In recent years, our research group has developed state of the art 3D sonar sensors which use a low-cost MEMS microphone array for real-time acoustic imaging in air. Using this sensor, various robotic applications have been developed, including obstacle a

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

A Novel Method For Obtaining Diffuse Field Measurements For Microphone Calibration

00:05:48

0 views

NOVELTY OF THE DEMO: Is it possible to obtain a diffused field response of a microphone array and perform calibration in under a minute? If such a method exists, is it possible to achieve an accuracy of half a dB from the expected response? The answer to

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

From Compressed Sensing to Deep Learning: Tasks, Structures, and Models

00:56:48

4 views

From Compressed Sensing to Deep Learning: Tasks, Structures, and Models.
Presenter: Yonina Eldar, ICASSP 2020.

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Attentive Item2Vec: Neural Attentive User Representations

00:14:04

0 views

Factorization methods for recommender systems tend to represent users as a single latent vector. However, user behavior and interests may change in the context of the recommendations that are presented to the user. For example, in the case of movie recomm

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Supervised Canonical Correlation Analysis Of Data On Symmetric Positive Definite Manifolds By Riemannian Dimensionality Reduction

00:14:10

0 views

Most computer vision problems entail data that reside on Riemannian manifolds. Canonical correlation analysis (CCA) is a powerful method that captures correlations between any two sets of matrices. In this paper, we propose a framework for a supervised CC

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Dynamic Oversampling In 1-Bit Quantized Asynchronous Large-Scale Multiple-Antenna Systems For Sustainable Iot Networks

00:21:13

0 views

In this paper, we propose a dynamic oversampling technique for asynchronous large-scale multiple-antenna systems with 1-bit analog-to-digital converters at the base station that is suitable for sustainable internet of things and cellular networks. To the

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Conditional Density Driven Grid Design In Point-Mass Filter

00:13:18

0 views

The paper is devoted to the state estimation of nonlinear stochastic dynamic systems. The stress is laid on a grid-based numerical solution to the Bayesian recursive relations using the point-mass filter (PMF). In the paper, a novel conditional density dr

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Camera Configuration Design In Cooperative Active Visual 3D Reconstruction: A Statistical Approach

00:13:20

0 views

Visual 3D reconstruction is an essential technique in computer vision which restores the 3D model of the scene from multi-view images. In this paper, we propose a statistical framework for the active visual 3D reconstruction. We first derive a closed-form

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

A Real Time Implementation Of A Bayer Domain Image Deblurring Core For Optical Blur Compensation

00:12:36

0 views

In this letter, we present an implementation of deblurring hardware to mitigate blur incurred by optical aberrations in a real-time manner to increase resolution for mobile camera modules. As optical aberrations tend to be variant according to spatial loc

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Trace Norm Generative Adversarial Networks For Sensor Generation And Feature Extraction

00:12:42

0 views

Generative Adversarial Networks (GANs) have been shown effective to generate realistic enough sensor data for industrial failure prediction. Compared to computer vision problems, where it is very common to have more than 1000 classes, the number of classe

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

A Multichannel Kalman-Based Wiener Filter Approach For Speaker Interference Reduction In Meetings

00:14:47

0 views

Recording a meeting and obtaining clean speech signals of each speaker is a challenging task. Even with a multichannel recording, in which all speakers are equipped with a close-talk microphone, speech of an active speaker still couples not only into his

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Simplified Dynamic Sc-Flip Polar Decoding

00:14:47

0 views

SC-Flip (SCF) decoding is a low-complexity polar code decoding algorithm alternative to SC-List (SCL) algorithm with small list sizes. To achieve the performance of the SCL algorithm with large list sizes, the Dynamic SC-Flip (DSCF) algorithm was proposed

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Full Reference Video Quality Measures Improvement Using Neural Networks

00:12:39

0 views

The accuracy of video quality metrics (VQMs) is an important issue for several applications. In this work, first we observe that the accuracy of several video quality metrics (VQMs) is strongly related to the spatial complexity index (SI) of the source. I

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Non-Uniform Video Time-Lapse Method Based On Motion Scenario And Stabilization Constraint

00:13:29

0 views

Time-lapse of user captured video becomes popular in many applications recently, non-uniform sampling and digital video stabilization (VS) are usually two independent steps to keep meaningful contents and provide stabilized output. However, non-uniform sa

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Federated Learning With Quantization Constraints

00:15:55

0 views

Traditional deep learning models are trained on centralized servers using labeled sample data collected from edge devices. This data often includes private information, which the users may not be willing to share. Federated learning (FL) is an emerging ap

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Estimating The Degree Of Sleepiness By Integrating Articulatory Feature Knowledge In Raw Waveform Based Cnns

00:13:21

0 views

Speech-based degree of sleepiness estimation is an emerging research problem. This paper investigates an end-to-end approach, where given raw waveform as input, a convolutional neural network (CNN) estimates at its output the degree of sleepiness. Within

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Triplet Loss Feature Aggregation For Scalable Hash

00:14:43

0 views

The increasing demands of high resolution and quality aggravate the status of heavy burden of cluster storage side and restricted bandwidth resources. Hence, video de-duplication in storage and transmission is becoming an important feature for video cloud

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Sequential Semi-Orthogonal Multi-Level Nmf With Negative Residual Reduction For Network Embedding

00:13:00

0 views

Network embedding is intended to produce low-dimensional vector representations of nodes in a network to preserve and extract the latent network structure, which has higher robustness to noise, outliers, and redundant data. Although a recently proposed mu

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Improved Probability Modelling For Exception Handling In Lossless Screen Content Coding

[2 Videos ]

Competitive methods for lossless screen content coding are based on modelling of probability distributions. The most effective approach for losslessly compressing images with up to 90000 colours is known as `soft context formation' (SCF). It scans the ima

Show videos in this product

Improved Probability Modelling For Exception Handling In Lossless Screen Content Coding

00:13:44

0 views

Competitive methods for lossless screen content coding are based on modelling of probability distributions. The most effective approach for losslessly compressing images with up to 90000 colours is known as `soft context formation' (SCF). It scans the ima
Improved Probability Modelling For Exception Handling In Lossless Screen Content Coding

00:00:00

0 views

Competitive methods for lossless screen content coding are based on modelling of probability distributions. The most effective approach for losslessly compressing images with up to 90000 colours is known as `soft context formation' (SCF). It scans the ima

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Ensemble Network For Ranking Images Based On Visual Appeal

00:12:04

0 views

We propose a computational framework for ranking images (group photos) taken at the same event within a short time span. The ranking is expected to correspond with human perception of overall appeal of the images. We hypothesize (and provide evidence thro

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

A Framework For The Robust Evaluation Of Sound Event Detection

00:15:00

0 views

This work defines a new framework for performance evaluation of polyphonic sound event detection (SED) systems, which overcomes the limitations of the conventional collar-based event decisions, event F-scores and event error rates. The proposed framework

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Compressing Flow Fields With Edge-Aware Homogeneous Diffusion Inpainting

00:13:25

0 views

In spite of the fact that efficient compression methods for dense two-dimensional flow fields would be very useful for modern video codecs, hardly any research has been performed in this area so far. Our paper addresses this problem by proposing the first

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Audio Feature Extraction For Vehicle Engine Noise Classification

00:13:50

0 views

In this paper we propose a new scheme for vehicle engine noise classification as a more privacy-preserving alternative to classifying vehicles based on video recordings. We establish two scenarios: diesel vs. petrol and heavy goods vehicle vs. personal ca

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Training Code-Switching Language Model With Monolingual Data

00:12:05

0 views

A lack of code-switching data complicates the training of code-switching (CS) language models. We propose an approach to train such CS language models on monolingual data only. By constraining and normalizing the output projection matrix in RNN-based lang

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Graph Metric Learning Via Gershgorin Disc Alignment

00:16:52

4 views

We propose a fast general projection-free metric learning framework, where the minimization objective $min_{M in cS} Q(M)$ is a convex differentiable function of the metric matrix $M$, and $M$ resides in the set $cS$ of generalized graph Laplacian

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Pose Refinement: Bridging The Gap Between Unsupervised Learning And Geometric Methods For Visual Odometry

00:12:55

0 views

Unsupervised Learning based monocular visual odometry (VO) has lately drawn significant attention owing to its potential in label-free leaning ability and robustness to camera parameters and environmental variations. However, due to the lack of pose optim

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Spatial Attentional Bilinear 3D Convolutional Network For Video-Based Autism Spectrum Disorder Detection

00:11:07

0 views

Video-based Autism Spectrum Disorder (ASD) detection is a challenge to most video classification networks due to the high degree of similarity between categories. Bilinear pooling is a second-order method, which is widely used in fine-grained visual recog

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Efficient Constrained Encoders Correcting A Single Nucleotide Edit In Dna Storage

00:15:48

0 views

A nucleotide substitution is said to occur when a base in {A, T} is substituted for a base in {C, G}, or vice versa. Recent experiment (Heckel et al. 2019) showed that a nucleotide substitution occurs with a significantly higher probability that other sub

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Transformer-Based Text-To-Speech With Weighted Forced Attention

00:14:01

0 views

This paper investigates state-of-the-art Transformer- and FastSpeech-based high-fidelity neural text-to-speech (TTS) with full-context label input for pitch accent languages. The aim is to realize faster training than conventional Tacotron-based models. I

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Redundant Convolutional Network With Attention Mechanism For Monaural Speech Enhancement

00:13:41

0 views

The redundant convolutional encoder decoder network has proven useful in speech enhancement tasks. It can capture localized time-frequency details of speech signals through both the fully convolutional network structure and feature selection capability re

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Predicting Word Error Rate For Reverberant Speech

00:12:52

0 views

Reverberation negatively impacts the performance of automatic speech recognition (ASR). Prior work on quantifying the effect of reverberation has shown that clarity (C50), a parameter that can be estimated from the acoustic impulse response, is correlated

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Multi-Label Consistent Convolutional Transform Learning: Application To Non-Intrusive Load Monitoring

00:08:21

0 views

Convolutional transform learning is an unsupervised framework we introduced recently, for feature generation based on learnt convolutions. In this work, we propose a supervised formulation for convolutional transform so as to address the multi-label class

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Unsupervised Multiple Source Localization Using Relative Harmonic Coefficients

00:13:50

2 views

This paper presents an unsupervised multi-source localization algorithm using a recently introduced feature called the relative harmonic coefficients. We derive a closed-form expression of the feature and briefly summarize its unique properties. We then e

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Data Augmentation Using Empirical Mode Decomposition On Neural Networks To Classify Impact Noise In Vehicle

00:13:18

0 views

In a vehicle, impact noise may occur during steering action due to clearance between parts of steering systems. Via structural path the noise is perceived by the drivers? ears and it can be the cause of a repair campaign. It is importatnt to know where th

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Learning Blind Denoising Network For Noisy Image Deblurring

00:12:12

0 views

Noisy image deblurring is to recover the blurry image in the presence of the random noise. The key to this problem is to know the noise level in each iteration. The existing methods manually adjust the regularization parameter for varying noise levels, wh

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Global Traffic State Recovery Via Local Observations With Generative Adversarial Networks

00:14:00

0 views

Traffic signal control for a large-scale traffic network is one challenging problem in intelligent transportation systems (ITS). High communication overheads are typically required to achieve the optimal control of the traffic signals in multiple road int

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Low-Rank Approximation Of Matrices Via A Rank-Revealing Factorization With Randomization

00:14:43

0 views

Given a matrix A with numerical rank k, the two-sided orthogonal decomposition (TSOD) computes a factorization A = UDV^T , where U and V are unitary, and D is (upper/lower) triangular. TSOD is rank-revealing as the middle factor D reveals the rank of A. T

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

In-Network Caching For Hybrid Satellite-Terrestrial Networks Using Deep Reinforcement Learning

00:15:17

0 views

Large number of redundant requests in wireless networks have lead to the hybrid satellite-terrestrial networks, where a satellite is used for content placement at edge caches at base stations (BSs), thereby reducing backhaul link usage. In this paper, we

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Multi-Task Center-Of-Pressure Metrics Estimation From Skeleton Using Graph Convolutional Network

00:11:43

0 views

Center of pressure (COP) is an important measurement of postural and gait control in human biomechanical studies. A vision-based estimation of COP metrics offers a way to obtain these gold-standard metrics for the detection of balance and gait problems. I

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

An Ensemble Based Approach For Generalized Detection Of Spoofing Attacks To Automatic Speaker Recognizers

00:10:45

0 views

As automatic speaker recognizer systems become mainstream, voice spoofing attacks are on the rise. Common attack strategies include replay, the use of text-to-speech synthesis, and voice conversion systems. While previously-proposed end-to-end detection f

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Selective Convolutional Network: An Efficient Object Detector With Ignoring Background

00:12:08

0 views

It is well known that attention mechanisms can effectively improve the performance of many CNNs including object detectors. Instead of refining feature maps prevalently, we reduce the prohibitive computational complexity by a novel attempt at attention. T

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Gaussian Process Imputation Of Multiple Financial Series

00:14:32

0 views

In Financial Signal Processing, multiple time series such as financial indicators, stock prices and exchange rates are strongly coupled due to their dependence on the latent state of the market and therefore they are required to be jointly analysed. We fo

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Low-Rank Mmwave Mimo Channel Estimation In One-Bit Receivers

00:14:15

0 views

Receivers with one-bit analog-to-digital converters (ADCs) are promising for high bandwidth millimeter wave (mmWave) systems as they consume less power than their full resolution counterparts. The extreme quantization in one-bit receivers and the use of l

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Just Noticeable Distortion Based Perceptually Lossless Intra Coding

00:11:04

0 views

Perceptual video coding plays a very important role in video codec optimization aiming at removing the perceptual redundancies in video content. In this paper, a just noticeable distortion (JND) guided perceptually lossless coding framework is proposed fo

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Forecasting Sparse Traffic Congestion Patterns Using Message-Passing Rnns

00:20:47

0 views

The ability to forecast traffic congestion ahead of time given road conditions has remained a prominent problem in road traffic analysis. In this work, we leverage mobility traces of public transport vehicles tracked by the New York City MTA and formulate

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Attentive Modality Hopping Mechanism For Speech Emotion Recognition

00:16:18

0 views

In this work, we explore the impact of visual modality in addition to speech and text for improving the accuracy of the emotion detection system. The traditional approaches tackle this task by independently fusing the knowledge from the various modalities

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Low-Complexity Lstm-Assisted Bit-Flipping Algorithm For Successive Cancellation List Polar Decoder

00:12:39

0 views

Polar codes have attracted much attention in the past decade due to their capacity-achieving performance. The higher decoding capacity is required for 5G and beyond 5G (B5G). Although the cyclic redundancy check (CRC)- assisted successive cancellation lis

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Subjective Quality Estimation Using Pesq For Hands-Free Terminals

00:15:07

0 views

Previous reports have mentioned the possibility that subjective quality of the echo-suppressed speech signal can be estimated based on perceptual evaluation of speech quality (PESQ), but there are few experimental results. We propose third-party listening

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Prediction Of Voicing And The F0 Contour From Electromagnetic Articulography Data For Articulation-To-Speech Synthesis

00:13:36

0 views

Articulation-to-speech synthesis based solely on supraglottal articulation requires some sort of intonation control. This paper examines to what extent the f0 contour of an utterance can be predicted from such supraglottal articulation data. To that end,

All Channels page: Communities submenu block

Communities

All Channels page: Societies submenu block

Societies

Events Showcase: ES submenu block

Event showcases

Recently Added Speakers

Events Hub Submenu block

Education: Education submenu block

Education Activity

2020 EAB AWARDS

2020 EAB AWARDS

IEEE ICASSP 2020 Virtual Conference May 2020