IEEE ICASSP 2020 Virtual Conference May 2020

Thu, 16 July, 2020

Showing 1601 - 1650 of 1951

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Video Frame Interpolation Via Residue Refinement

00:12:06

0 views

Video frame interpolation achieves temporal super-resolution by generating smooth transitions between frames. Although great success has been achieved by deep neural networks, the synthesized images stills suffer from poor visual appearance and unsatisfie

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Single-Channel Speech Separation Integrating Pitch Information Based On A Multi Task Learning Framework

00:14:58

0 views

Pitch is a critical cue for speech separation in humans? auditory perception. Although the technology of tracking pitch in single-talker speech succeeds in many applications, it?s still a challenging problem to extract pitch information from speech mixtur

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Low Mutual And Average Coherence Dictionary Learning Using Convex Approximation

00:14:05

0 views

In dictionary learning, a desirable property for the dictionary is to be of low mutual and average coherences. Mutual coherence is defined as the maximum absolute correlation between distinct atoms of the dictionary, whereas the average coherence is a mea

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Robust Online Mirror Saddle-Point Method For Constrained Resource Allocation

00:13:34

0 views

Online-learning literature has focused on designing algorithms that ensure sub-linear growth of the cumulative long-term constraint violations. The drawback of this guarantee is that strictly feasible actions may cancel out constraint violations on other

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Adaptive Subspace Detectors For Off-Grid Mismatched Targets

00:15:19

1 view

Abstract In classical detection framework, the parameter space is usually discretized, so that in reality received parameter dependent signals are never perfectly aligned with the signal model under test: it leads to the off-grid signal mismatch. In a Gau

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Corrgan: Sampling Realistic Financial Correlation Matrices Using Generative Adversarial Networks

00:15:08

0 views

We propose a novel approach for sampling realistic financial correlation matrices. This approach is based on generative adversarial networks. Experiments demonstrate that generative adversarial networks are able to recover most of the known stylized facts

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Triggerless Random Interleaved Sampling

00:15:00

0 views

A single short sequence of samples taken at sub-Nyquist rate rarely allows for periodic signal recovery. If there is more than one such sequence and time offsets between these sequences are given, the signal approximation is possible and is known as equiv

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Ava Active Speaker: An Audio-Visual Dataset For Active Speaker Detection

00:14:36

0 views

Active speaker detection is an important component in video analysis algorithms for applications such as speaker diarization, video re-targeting for meetings, speech enhancement, and human-robot interaction. The absence of a large, carefully labeled audio

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Self-Supervised Deep Learning For Fisheye Image Rectification

00:13:31

0 views

To rectify fisheye distortion from a single image, we advance self-supervised learning strategies and propose a unique deep learning model of Fisheye GAN (FE-GAN). Our FEGAN learns pixel-level distortion flow from sets of fisheye distorted images and dist

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Efficient Techniques For In-Band System Information Broadcast In Multi-Cell Massive Mimo

00:14:46

0 views

In this paper we consider joint beamforming of data to scheduled terminals (STs) and broadcast of system information (SI) to idle terminals (ITs) on the same time-frequency resource in multi-cell multi-user massive MIMO systems. We propose two different m

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Speech Emotion Recognition With Dual-Sequence Lstm Architecture

00:15:00

0 views

Speech Emotion Recognition (SER) has emerged as a critical component of the next generation of human-machine interfacing technologies. In this work, we propose a new dual-level model that predicts emotions based on both MFCC features and mel-spectrograms

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Online Community Detection By Spectral Cusum

00:15:00

0 views

We present an online community change detection algorithm called {it spectral CUSUM} to detect the emergence of a community using a subspace projection procedure based on a Gaussian model setting. Theoretical analysis is provided to characterize the aver

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Channel-Attention Dense U-Net For Multichannel Speech Enhancement

00:12:40

0 views

Supervised deep learning has gained significant attention for speech enhancement recently. The state-of-the-art deep learning methods perform the task by learning a ratio/binary mask that is applied to the mixture in the time-frequency domain to produce c

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Neural Lattice Search For Speech Recognition

00:12:15

1 view

To improve the accuracy of automatic speech recognition, a two-pass decoding strategy is widely adopted. The first-pass model generates compact word lattices, which are utilized by the second-pass model to perform rescoring. Currently, the most popular re

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Beam-Tasnet: Time-Domain Audio Separation Network Meets Frequency-Domain Beamformer

00:14:56

0 views

Recent studies have shown that acoustic beamforming using a microphone array plays an important role in the construction of high-performance automatic speech recognition (ASR) systems, especially for noisy and overlapping speech conditions. In parallel wi

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Flexibly-Tunable Bitcube-Based Perceptual Encryption Within Jpeg Compression

[2 Videos ]

We propose a perceptual encryption within JPEG compression (EWJ). Although some of the conventional EWJs have the `tunability,' which is a property of how many perceptual degradation levels can be provided with the single encryption technique, it is insuf

Show videos in this product

Flexibly-Tunable Bitcube-Based Perceptual Encryption Within Jpeg Compression

00:13:40

0 views

We propose a perceptual encryption within JPEG compression (EWJ). Although some of the conventional EWJs have the `tunability,' which is a property of how many perceptual degradation levels can be provided with the single encryption technique, it is insuf
Flexibly-Tunable Bitcube-Based Perceptual Encryption Within Jpeg Compression

00:00:00

0 views

We propose a perceptual encryption within JPEG compression (EWJ). Although some of the conventional EWJs have the `tunability,' which is a property of how many perceptual degradation levels can be provided with the single encryption technique, it is insuf

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Processing Convolutional Neural Networks On Cache

00:13:00

0 views

With the advent of Big Data application domains, several Machine Learning (ML) signal-processing algorithms such as Convolutional Neural Networks (CNNs) are required to process progressively larger datasets at a great cost in terms of both compute power a

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Vimo: Vital Sign Monitoring Using Commodity Millimeter Wave Radio

00:14:37

0 views

Accurate monitoring of human vital signs (e.g. breathing and heart rates) is crucial in detecting medical problems. In this paper, we propose ViMo, a calibration-free remote Vital sign Monitoring system that can simultaneously monitor multiple users by le

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

A Recursive Bayesian Solution For The Excess Over Threshold Distribution With Stochastic Parameters

00:16:28

0 views

In this paper, we propose a new approach for analyzing extreme values that are witnessed in financial markets. Our goal is to compute the predictive distribution of extreme events that are clustered in time and, as opposed to modeling just the maximum of

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Computing Hilbert Transform And Spectral Factorization For Signal Spaces Of Smooth Functions

00:14:51

0 views

Although the Hilbert transform and the spectral factorization are of central importance in signal processing, both operations can generally not be calculated in closed form. Therefore, algorithmic solutions are prevalent which provide an approximation of

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Content Based Singing Voice Extraction From A Musical Mixture

00:12:39

0 views

We present a deep learning based methodology for extracting the singing voice signal from a musical mixture based on the underlying linguistic content. Our model follows an encoder decoder architecture and takes as input the magnitude component of the spe

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Line Spectral Estimation With Palindromic Kernels

00:10:59

0 views

Estimation of line spectra is a classical problem in signal processing and arises in many applications. The problem is to estimate the frequencies and corresponding amplitudes of a sum of (possibly complex-valued) sinusoidal components from noisy measurem

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Confidence Estimation For Black Box Automatic Speech Recognition Systems Using Lattice Recurrent Neural Networks

00:14:47

0 views

Recently, there has been growth in providers of speech transcription services enabling others to leverage technology they would not normally be able to use. As a result, speech-enabled solutions have become commonplace. Their success critically relies on

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Clutter Identification Based On Sparse Recovery And L1-Type Probabilistic Distance Measures

00:18:01

0 views

Cognitive radar framework has recently been proposed in radar signal processing to develope algorithms for target detection, tracking, and waveform design in the presence of nonstationary environmental (clutter) characteristics. In this framework, there a

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Spoken Document Retrieval Leveraging Bert-Based Modeling And Query Reformulation

00:14:06

0 views

Spoken document retrieval (SDR) has long been deemed a fundamental and important step towards efficient organization of, and access to multimedia associated with spoken content. In this paper, we present a novel study of SDR leveraging the Bidirectional E

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Efficient Image Super Resolution Via Channel Discriminative Deep Neural Network Pruning

00:06:48

0 views

Deep convolutional neural networks (CNN) have demonstrated superior performance in image super-resolution (SR) problem.However, CNNs are known to be heavily over-parameterized, and suffer from abundant redundancy. The growing size ofCNNs may be incompatib

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Attention Driven Fusion For Multi-Modal Emotion Recognition

00:14:40

0 views

Deep learning has emerged as a powerful alternative to hand-crafted methods for emotion recognition on combined acoustic and text modalities. Baseline systems model emotion information in text and acoustic modes independently using Deep Convolutional Neur

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Emet: Embeddings From Multilingual-Encoder Transformer For Fake News Detection

00:13:20

0 views

In the last few years, social media networks have changed human life experience and behavior as it has broken down communication barriers, allowing ordinary people to actively produce multimedia content on a massive scale. On this wise, the information di

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Statistics Pooling Time Delay Neural Network Based On X-Vector For Speaker Verification

00:12:50

0 views

This paper aims to improve speaker embedding representation based on x-vector for extracting more detailed information for speaker verification. We propose a statistics pooling time delay neural network (TDNN), in which the TDNN structure integrates stati

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Large-Scale Fading Precoding For Maximizing The Product Of Sinrs

00:15:01

0 views

This paper considers the large-scale fading precoding design for mitigating the pilot contamination in the downlink of multi-cell massive MIMO (multiple-input multiple-output) systems. Rician fading with spatially correlated channels are considered where

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Adrn: Attention-Based Deep Residual Network For Hyperspectral Image Denoising

00:12:31

0 views

Hyperspectral image (HSI) denoising is of crucial importance for many subsequent applications, such as HSI classification and interpretation. In this paper, we propose an attention-based deep residual network to directly learn a mapping from noisy HSI to

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Vapar Synth - A Variational Parametric Model For Audio Synthesis

00:15:14

0 views

With the advent of data-driven statistical modeling and abundant computing power, researchers are turning increasingly to deep learning for audio synthesis. These methods try to model audio signals directly in the time or frequency domain. In the interest

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Sequential Joint Detection And Estimation With An Application To Joint Symbol Decoding And Noise Power Estimation

00:13:32

0 views

Jointly testing multiple hypotheses and estimating a random parameter of the underlying model is investigated in a sequential setup. The optimal scheme is designed such that it minimizes the expected number of used samples while keeping the probabilities

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Automatic Epileptic Seizure Onset-Offset Detection Based On Cnn In Scalp Eeg

00:14:26

1 view

We establish a deep learning-based method to automatically detect the epileptic seizure onsets and offsets in multi-channel electroencephalography (EEG) signals. A convolutional neural network (CNN) is designed to identify occurrences of seizures in EEG e

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Robust Fundamental Frequency Estimation In Coloured Noise

00:14:23

0 views

Most parametric fundamental frequency estimators make the implicit assumption that any corrupting noise is additive, white Gaussian. Under this assumption, the maximum likelihood (ML) and the least squares estimators are the same, and statistically effici

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Saliency-Based Image Contrast Enhancement With Reversible Data Hiding

00:13:57

0 views

Reversible data hiding (RDH) has become a hot research area in the recent years due to its wide applications such as authentication. Among all the RDH methods proposed, contrast enhancement based reversible data hiding is one that was recently proposed. H

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Spectrum Allocation In Wireless Networks For Crowd Labelling

00:11:57

0 views

The massive sensing data generated by Internet-of-Things will provide fuel for ubiquitous artificial intelligence (AI), while tremendous labels are required for AI model training via supervised learning. To tackle this challenge, a novel framework of wire

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Staged Training Strategy And Multi-Activation For Audio Tagging With Noisy And Sparse Multi-Label Data

00:11:18

0 views

Audio tagging aims to predict whether certain acoustic events occur in the audio clips. Due to the difficulty and huge cost of obtaining manually labeled data with high confidence, researchers begin to focus on audio tagging using a small set of manually-

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Encoding And Decoding Mixed Bandlimited Signals Using Spiking Integrate-And-Fire Neurons

00:12:17

0 views

Conventional sampling focuses on encoding and decoding bandlimited signals by recording signal amplitudes at known time points. Alternately, sampling can be approached using biologically-inspired schemes. Among these are integrate-and-fire time encoding m

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Asr Is All You Need: Cross-Modal Distillation For Lip Reading

00:12:22

0 views

The goal of this work is to train strong models for visual speech recognition without requiring human annotated ground truth data. We achieve this by distilling from an Automatic Speech Recognition (ASR) model that has been trained on a large-scale audio-

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Learning To Rank Music Tracks Using Triplet Loss

00:13:09

0 views

Most music streaming services rely on automatic recommendation algorithms to exploit their large music catalogs. These algorithms aim at retrieving a ranked list of music tracks based on their similarity with a target music track. In this work, we propose

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Probabilistic Filter And Smoother For Variational Inference Of Bayesian Linear Dynamical Systems

00:14:16

0 views

Variational inference of a Bayesian linear dynamical system is a powerful method for estimating latent variable sequences and learning sparse dynamic models in domains ranging from neuroscience to audio processing. The hardest part of the method is inferr

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Deliberation Model Based Two-Pass End-To-End Speech Recognition

00:15:44

0 views

End-to-end (E2E) models have made rapid progress in automatic speech recognition (ASR) and perform competitively relative to conventional models. To further improve the quality, a two-pass model has been proposed to rescore streamed hypotheses using the n

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Regularized Fast Multichannel Nonnegative Matrix Factorization With Ilrma-Based Prior Distribution Of Joint-Diagonalization Process

00:13:00

0 views

In this paper, we address a convolutive blind source separation (BSS) problem and propose a new extended framework of FastMNMF by introducing prior information for joint diagonalization of the spatial covariance matrix model. Recently, FastMNMF has been p

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Epi-Neighborhood Distribution Based Light Field Depth Estimation

00:13:59

0 views

In this paper, a novel depth estimation algorithm tackling foreground occlusion is proposed based on the neighborhood distribution in the sheared epipolar images (EPIs). First, the EPI is sheared to perform refocusing. Next a series of sheared EPI?s neigh

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Multi Image Depth From Defocus Network With Boundary Cue For Dual Aperture Camera

00:12:32

0 views

In this paper, we estimate depth information using two defocused images from dual aperture camera. Recent advances in deep learning techniques have increased the accuracy of depth estimation. Besides, methods of using a defocused image in which an object

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Defense Against Adversarial Attacks On Spoofing Countermeasures Of Asv

00:12:47

0 views

Various forefront countermeasure methods for automatic speaker verification (ASV) with considerable performance in anti-spoofing are proposed in the ASVspoof 2019 challenge. However, previous work has shown that countermeasure models are vulnerable to adv

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Improving Device Directedness Classification Of Utterances With Semantic Lexical Features

00:14:58

0 views

User interactions with personal assistants like Alexa, Google Home and Siri are typically initiated by a wake term or wakeword. Several personal assistants feature "follow-up" modes that allow users to make additional interactions without the need of a wa

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Comparison Of Glottal Closure Instants Detection Algorithms For Emotional Speech

00:16:51

0 views

In production of voiced speech, epochs or glottal closure instants (GCIs) refer to the instants of significant excitation of the vocal tract. Extraction of GCIs is used as a pre-processing stage in many areas of speech technology, such as in prosody modif

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

An Ontology-Aware Framework For Audio Event Classification

00:13:50

0 views

Recent advancements in audio event classification often ignore the structure and relation between the label classes available as prior information. This structure can be defined by ontology and augmented in the classifier as a form of domain knowledge. To

All Channels page: Communities submenu block

Communities

All Channels page: Societies submenu block

Societies

Events Showcase: ES submenu block

Event showcases

Recently Added Speakers

Events Hub Submenu block

Education: Education submenu block

Education Activity

2020 EAB AWARDS

2020 EAB AWARDS

IEEE ICASSP 2020 Virtual Conference May 2020