IEEE ICASSP 2020 Virtual Conference May 2020

Thu, 16 July, 2020

Showing 1701 - 1750 of 1951

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Deliberation Model Based Two-Pass End-To-End Speech Recognition

00:15:44

0 views

End-to-end (E2E) models have made rapid progress in automatic speech recognition (ASR) and perform competitively relative to conventional models. To further improve the quality, a two-pass model has been proposed to rescore streamed hypotheses using the n

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Regularized Fast Multichannel Nonnegative Matrix Factorization With Ilrma-Based Prior Distribution Of Joint-Diagonalization Process

00:13:00

0 views

In this paper, we address a convolutive blind source separation (BSS) problem and propose a new extended framework of FastMNMF by introducing prior information for joint diagonalization of the spatial covariance matrix model. Recently, FastMNMF has been p

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Epi-Neighborhood Distribution Based Light Field Depth Estimation

00:13:59

0 views

In this paper, a novel depth estimation algorithm tackling foreground occlusion is proposed based on the neighborhood distribution in the sheared epipolar images (EPIs). First, the EPI is sheared to perform refocusing. Next a series of sheared EPI?s neigh

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Multi Image Depth From Defocus Network With Boundary Cue For Dual Aperture Camera

00:12:32

0 views

In this paper, we estimate depth information using two defocused images from dual aperture camera. Recent advances in deep learning techniques have increased the accuracy of depth estimation. Besides, methods of using a defocused image in which an object

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Defense Against Adversarial Attacks On Spoofing Countermeasures Of Asv

00:12:47

0 views

Various forefront countermeasure methods for automatic speaker verification (ASV) with considerable performance in anti-spoofing are proposed in the ASVspoof 2019 challenge. However, previous work has shown that countermeasure models are vulnerable to adv

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Improving Device Directedness Classification Of Utterances With Semantic Lexical Features

00:14:58

0 views

User interactions with personal assistants like Alexa, Google Home and Siri are typically initiated by a wake term or wakeword. Several personal assistants feature "follow-up" modes that allow users to make additional interactions without the need of a wa

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Comparison Of Glottal Closure Instants Detection Algorithms For Emotional Speech

00:16:51

0 views

In production of voiced speech, epochs or glottal closure instants (GCIs) refer to the instants of significant excitation of the vocal tract. Extraction of GCIs is used as a pre-processing stage in many areas of speech technology, such as in prosody modif

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Indoor Altitude Estimation Of Unmanned Aerial Vehicles Using A Bank Of Kalman Filters

00:14:29

0 views

Altitude estimation is important for successful control and navigation of unmanned aerial vehicles (UAVs). UAVs do not have indoor access to GPS signals and can only use on-board sensors for reliable estimation of altitude. Unfortunately, most existing na

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Speaker-Aware Training Of Attention-Based End-To-End Speech Recognition Using Neural Speaker Embeddings

00:14:46

0 views

In speaker-aware training, a speaker embedding is appended to DNN input features. This allows the DNN to effectively learn representations, which are robust to speaker variability. We apply speaker-aware training to attention-based end-to-end speech recog

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Learning Eating Environments Through Scene Clustering

00:09:14

0 views

It is well known that dietary habits have a significant influence on health. While many studies have been conducted to understand this relationship, little is known about the relationship between eating environments and health. Yet researchers and health

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Open Set Video Camera Model Verification

00:15:51

0 views

We introduce a new open set video forensics problem called video camera model verification. The video camera model verification task is to determine if two query videos were captured by the same camera model. Importantly, verification must be reliable on

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Solving Non-Convex Non-Differentiable Min-Max Games Using Proximal Gradient Method

00:11:40

0 views

Min-max saddle point games appear in a wide range of applications in machine leaning and signal processing. Despite their wide applicability, theoretical studies are mostly limited to the special convex-concave structure. While some recent works generaliz

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Time Difference Of Arrival Estimation From Frequency-Sliding Generalized Cross-Correlations Using Convolutional Neural Networks

00:10:55

0 views

The interest in deep learning methods for solving traditional signal processing tasks has been steadily growing in the last years. Time delay estimation (TDE) in adverse scenarios is a challenging problem, where classical approaches based on generalized c

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Metric Representations Of Networks: A Uniqueness Result

00:12:44

0 views

In this paper, we consider the problem of projecting networks onto metric spaces. Networks are structures that encode relationships between pairs of elements or nodes. However, these relationships can be independent of each other, and need not be defined

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Phonetic Feedback For Speech Enhancement With And Without Parallel Speech Data

00:13:34

0 views

While deep learning systems have gained significant ground in speech enhancement research, these systems have yet to make use of the full potential of deep learning systems to provide high-level feedback. In particular, phonetic feedback is rare in speech

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Overlap-Aware Diarization: Resegmentation Using Neural End-To-End Overlapped Speech Detection

00:14:06

0 views

We address the problem of effectively handling overlapping speech in a diarization system. First, we detail a neural Long Short-Term Memory-based architecture for overlap detection. Secondly, detected overlap regions are exploited in conjunction with a fr

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Hybrid Precoding For Secure Transmission In Reflect-Array-Assisted Massive Mimo Systems

00:12:49

0 views

Recently, a hybrid analog-digital architecture has been proposed for multiuser MIMO transmission in the millimeter-wave spectrum using reflect-arrays. The architecture exhibits scalability and high energy-efficiency while keeping the transmitter cost-effi

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

D-Slam: Diffusion Source Localization And Trajectory Mapping

00:15:33

0 views

We consider physical fields induced by a finite number of instantaneous diffusion sources, which we sample using a mobile sensor, along unknown trajectories composed of multiple linear segments. We address the problem of estimating the sources, as well as

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Revealing Hidden Drawings In Leonardo's 'The Virgin Of The Rocks' From Macro X-Ray Fluorescence Scanning Data Through Element Line Localisation

00:14:54

1 view

Macro X-Ray Fluorescence (XRF) scanning is an increasingly widely used imaging technique for the non-invasive detection and mapping of chemical elements in Old Master paintings. Existing approaches for XRF signal analysis require varying degrees of expert

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Preconditioning Admm For Fast Decentralized Optimization

00:16:22

0 views

In this work, we consider the distributed optimization problem using networked computing machines. Specifically, we are interested in solving this problem using the alternating direction method of multipliers (ADMM) while accounting for edge weights. Exis

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Multi-Resolution Overlapping Stripes Network For Person Re-Identification

00:11:54

0 views

This paper addresses the person re-identification (PReID) problem by combining global and local information at multiple feature resolutions with different loss functions. Many previous studies address this problem using either part-based features or globa

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Selective Attention Encoders By Syntactic Graph Convolutional Networks For Document Summarization

00:12:16

1 view

Abstractive text summarization is a challenging task, and one need to design a mechanism to effectively extract salient information from the source text and then generate a summary. A parsing process of the source text contains critical syntactic or seman

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

A Probabilistic Scheme For Representation Learning With Radial Transform Images

00:15:36

0 views

Data representation can facilitate training of deep neural network when limited data is available. We have previously proposed the radial transform sampling method as a data representation technique for training neural networks. In this paper, a probabili

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Joint Blind Calibration And Time-Delay Estimation For Multiband Ranging

00:13:45

0 views

In this paper, we focus on the problem of blind joint calibration of multiband transceivers and time-delay (TD) estimation of multipath channels. We show that this problem can be formulated as a particular case of covariance matching. Although this proble

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Addressing Challenges In Building Web-Scale Content Classification Systems

00:15:49

0 views

Understanding the semantic meaning of content on the web through the lens of a taxonomy has many practical advantages. However, when building large-scale content classification systems, practitioners are faced with unique challenges involving finding the

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Low-Latency Single Channel Speech Enhancement Using U-Net Convolutional Neural Networks

00:15:27

0 views

Single-channel speech enhancement (SE) can be described, in its simplest terms, as learning a transformation from single-channel noisy speech to the clean speech. To do this, we propose a simple but effective U-Net convolutional neural network (CNN) based

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

A Generalization Of Principal Component Analysis

00:14:50

0 views

Conventional principal component analysis (PCA) finds a principal vector that maximizes the sum of second powers of principal components. We consider a generalized PCA that aims at maximizing the sum of an arbitrary convex function of principal components

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

An Improved Selective Active Noise Control Algorithm Based On Empirical Wavelet Transform

00:12:21

0 views

The gradual adaptation and possibility of divergence have been the two main obstacles in the efficient implementation of conventional adaptive active noise control (ANC) to a wider range of applications. Selective ANC (SANC) has been proposed to rapidly r

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Learning A Representation For Cover Song Identification Using Convolutional Neural Network

00:11:43

0 views

Cover song identification is a challenging task in the field of Music Information Retrieval (MIR) due to complex musical variations between query tracks and cover versions. Previous works typically utilize hand-crafted features and alignment algorithms fo

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Reduced-Complexity Singular Value Decomposition For Tucker Decomposition: Algorithm And Hardware

00:13:31

0 views

Tensors, as the multidimensional generalization of matrices, are naturally suited for representing and processing high dimensional data. To date, tensors have been widely adopted in various data-intensive applications, such as machine learning and big dat

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Counting Dense Objects In Remote Sensing Images

00:13:13

0 views

Estimating accurate number of interested objects from a given image is a challenging yet important task. Significant efforts have been made to address this problem and achieve great progress, yet counting number of ground objects from remote sensing image

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Ednfc-Net: Convolutional Neural Network With Nested Feature Concatenation For Nuclei-Instance Segmentation

00:13:28

0 views

Accurate nuclei identification is an important step in diagnosis of several diseases. The problem is complex due to heterogeneity in structure, color, and texture among the different categories of cells. The problem is further complicated due to overlappe

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

A Novel Saliency-Driven Oil Tank Detection Method For Synthetic Aperture Radar Images

00:13:46

0 views

Synthetic aperture radar (SAR) imaging system plays an important role in earth observation research. This leads to the significance of target detection in SAR image. In this paper, we propose a novel saliency-driven oil tank detection method (SDD) for SAR

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Proximal Distance Algorithm For Nonconvex Qcqp With Beamforming Applications

00:14:21

0 views

This paper studies nonconvex quadratically constrained quadratic program (QCQP), which is known to be NP-hard in general. In the past decades, various approximate approaches have been developed to tackle the QCQP, including semidefinite relaxation (SDR),

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Audio Sound Determination Using Feature Space Attention Based Convolution Recurrent Neural Network

00:16:18

0 views

The classification framework has been popularly adopted to perform sound event detection. However, the existing neural network based classification based approaches treat each feature dimension equally and the varying influence of feature dimensions has n

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

A Return To Dereverberation In The Frequency Domain Using A Joint Learning Approach

00:12:53

0 views

Dereverberation is often performed in the time-frequency domain using mostly deep learning approaches. Time-frequency domain processing, however, may not be necessary when reverberation is modeled by the convolution operation. In this paper, we investigat

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Adaptation Of Rnn Transducer With Text-To-Speech Technology For Keyword Spotting

00:13:57

1 view

With the advent of recurrent neural network transducer (RNN-T) model, the performance of keyword spotting (KWS) systems has greatly improved. However, the KWS systems, employed for wake-word detection, still rely on the availability of keyword specific tr

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Regression Before Classification For Temporal Action Detection

00:12:01

0 views

Action classification combined with location regression is a widely-utilized mechanism in existing temporal action detection methods. However, there exists an inconsistency problem between locations and categories of action instances in this mechanism. Mo

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Resource Management In The Multibeam Noma-Based Satellite Downlink

00:15:41

0 views

A beam-free approach to channel allocation in a multi-beam four-color satellite coverage area is taken. Non-Orthogonal Multiple Access (NOMA) and Orthogonal Multiple Access (OMA) are compared as methods to serve users non-necessarily located on the refere

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Iq-Stan: Image Quality Guided Spatio-Temporal Attention Network For License Plate Recognition

00:13:55

0 views

License plate recognition (LPR) is one of the essential components in intelligent transportation systems. Although the image processing algorithms for LPR have been extensively studied in the past several years, the recognition performance is still not sa

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

A Unified Sequence-To-Sequence Front-End Model For Mandarin Text-To-Speech Synthesis

00:15:40

0 views

In Mandarin text-to-speech (TTS) system, the front-end text processing module significantly influences the intelligibility and naturalness of synthesized speech. Building a typical pipeline-based front-end which consists of multiple individual components

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Unsupervised Key Hand Shape Discovery Of Sign Language Videos With Correspondence Sparse Autoencoders

00:12:03

0 views

Recognition of sign language is a difficult task which often requires tedious annotations by sign language experts. End-to-end learning attempts that bypass frame level annotations have achieved some success in limited datasets, but it has been shown that

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Self-Supervised Learning For Audio-Visual Speaker Diarization

00:12:23

0 views

Speaker diarization, which is to find the speech segments of specific speakers, has been widely used in human-centered applications such as video conferences or human-computer interaction systems. In this paper, we propose a self-supervised audio-video sy

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Balanced Binary Neural Networks With Gated Residual

00:12:16

0 views

Binary neural networks have attracted numerous attention in recent years. However, mainly due to the information loss stemming from the biased binarization, how to preserve the accuracy of networks still remains a critical issue. In this paper, we attempt

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Robust Speaker Recognition Using Unsupervised Adversarial Invariance

00:11:47

0 views

In this paper, we address the problem of speaker recognition in challenging acoustic conditions using a novel method to extract robust speaker-discriminative speech representations. We adopt a recently proposed unsupervised adversarial invariance architec

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Text Adaptation For Speaker Verification With Speaker-Text Factorized Embeddings

00:12:02

1 view

Text mismatch between pre-collected data, either training data or enrollment data, and the actual test data can significantly hurt text-dependent speaker verification (SV) system performance. Although this problem can be solved by carefully collecting dat

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Multi-View Clustering Via Mixed Embedding Approximation

00:12:21

0 views

This paper tackles multi-view clustering via proposing a novel mixed embedding approximation (MEA) method. Formally, we aim to learn a uniform orthogonal embedding based on the orthogonal pre-embeddings of each view. At first, we hope that the uniform emb

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Multilinear Generalized Singular Value Decomposition (Ml-Gsvd) With Application To Coordinated Beamforming In Multi-User Mimo Systems

00:14:31

0 views

In this paper, we propose a new Multilinear Generalized Singular Value Decomposition (ML-GSVD) which allows to jointly factorize a set of matrices with one common dimension. The ML-GSVD is an extension of the Generalized Singular Value Decomposition (GSVD

All Channels page: Communities submenu block

Communities

All Channels page: Societies submenu block

Societies

Events Showcase: ES submenu block

Event showcases

Recently Added Speakers

Events Hub Submenu block

Education: Education submenu block

Education Activity

2020 EAB AWARDS

2020 EAB AWARDS

IEEE ICASSP 2020 Virtual Conference May 2020