IEEE ICASSP 2020 Virtual Conference May 2020

Thu, 16 July, 2020

Showing 1351 - 1400 of 1951

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Time-Frequency Analysis Of Unimodal Sensory Processing In Autism Spectrum Disorder

00:13:11

0 views

This work summarizes the results of a time-frequency analysis of sensory processing in young adults with Autism Spectrum Disorder via continuous wavelet transform. The sensory tasks consisted of two blocks of unimodal sensory stimuli of the same type (i.e

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Roimix: Proposal-Fusion Among Multiple Images For Underwater Object Detection

00:12:25

0 views

Generic object detection algorithms have proven their excellent performance in recent years. However, object detection on underwater datasets is still less explored. In contrast to generic datasets, underwater images usually have color shift and low contr

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Duration Robust Weakly Supervised Sound Event Detection

00:14:19

1 view

Task 4 of the DCASE2018 challenge demonstrated that substantially more research is needed for a real-world application of sound event detection. Analyzing the challenge results it can be seen that most successful models are biased towards predicting long

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Federated Learning With Mutually Cooperating Devices: A Consensus Approach Towards Server-Less Model Optimization

00:15:01

0 views

Abstract Federated learning (FL) is emerging as a new paradigm for training a machine learning model in cooperative networks. The model parameters are optimized collectively by large populations of interconnected devices, acting as cooperative learners th

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Visually Guided Self Supervised Learning Of Speech Representations

00:14:44

0 views

Self supervised representation learning has recently attracted a lot of research interest for both the audio and visual modalities. However, most works typically focus on a particular modality or feature alone and there has been very limited work that stu

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Improving Proper Noun Recognition In End-To-End Asr By Customization Of The Mwer Loss Criterion

00:12:25

0 views

Proper nouns present a challenge for end-to-end (E2E) automatic speech recognition (ASR) systems in that a particular name may appear only rarely during training, and may have a pronunciation similar to that of a more common word. Unlike conventional ASR

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Mining Effective Negative Training Samples For Keyword Spotting

00:14:46

0 views

Max-pooling neural network architectures have been proven to be useful for keyword spotting (KWS), but standard training methods suffer from a class-imbalance problem when using all frames from negative utterances. To address the problem, we propose an in

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Target Parameter Estimation Via One-Bit Pmcw Radar

00:13:32

0 views

We consider the problem of phase modulated continuous wave (PMCW) radar signal processing when the receiver utilizes one-bit sampling with known time-varying thresholds. We formulate the target parameter estimation problem as a sparse signal recovery prob

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Attention-Based Asr With Lightweight And Dynamic Convolutions

00:13:25

0 views

End-to-end (E2E) automatic speech recognition (ASR) with sequence- to-sequence models has gained attention because of its simple model training compared with conventional hidden Markov model based ASR. Recently, several studies report the state-of-the-art

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Transmit Beamforming Design With Received-Interference Power Constraints: The Zero-Forcing Relaxation

00:16:08

1 view

The use of multi-antenna transmitters is emerging as an essential technology of the future wireless communication systems. While Zero-Forcing Beamforming (ZFB) has become the most popular low-complexity transmit beamforming design, it has some drawbacks b

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Bp-Vb-Ep Based Static And Dynamic Sparse Bayesian Learning With Kronecker Structured Dictionaries

00:12:04

0 views

In many applications such as massive multi-input multi-output (MIMO) radar, massive MIMO channel estimation, speech processing, image and video processing, the received signals are tensors. In such applications, utilizing techniques from tensor algebra ca

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Spatio-Temporal And Geometry Constrained Network For Automobile Visual Odometry

00:14:31

0 views

Visual odometry (VO) is an essence of vision-based localization and mapping system where existing learning-based approaches utilize CNN and RNN to model camera motion and gain promising results. However, these methods lack full use of the relationship bet

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Demystifying Tasnet: A Dissecting Approach

00:12:59

0 views

In recent years time domain speech separation has excelled over frequency domain separation in single channel scenarios and noise-free environments. In this paper we dissect the gains of the time-domain audio separation network (TasNet) approach by gradua

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Bayesian Estimation Of Plda With Noisy Training Labels, With Applications To Speaker Verification

00:12:05

0 views

This paper proposes a method for Bayesian estimation of probabilistic linear discriminant analysis (PLDA) when training labels are noisy. Label errors can be expected during e.g. large or distributed data collections, or for crowd-sourced data labeling. B

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Speaker Diarization With Region Proposal Network

00:13:09

0 views

Speaker diarization is an important pre-processing step for many speech applications, and it aims to solve the "who spoke when" problem. The standard diarization systems can achieve satisfactory results in various scenarios, but they are composed of sever

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Weighted Null Vector Initialization And Its Application To Phase Retrieval

00:15:56

0 views

Phase retrieval problem is an nonlinear inverse problem of recovering real- or complex-valued signal from quadratic measurements, which arises in various applications. The best-known algorithms for solving this problem are non-convex methods starting with

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Power Spectrum Optimization For Capacity Of The Extended Spectrum Hybrid Fiber Coax Network

00:13:10

0 views

Capacity requirements of the fixed access network keep increasing towards multi-gigabit connections. For Hybrid Fiber Coaxial (HFC) networks, aggregated rates around 30 Gbit/s can be achieved by increasing the DOCSIS spectrum to 3GHz, assuming a spectral

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Generative Pre-Training For Speech With Autoregressive Predictive Coding

00:14:31

0 views

Learning meaningful and general representations from unannotated speech that are applicable to a wide range of tasks remains challenging. In this paper we propose to use autoregressive predictive coding (APC), a recently proposed self-supervised objective

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

An Attention-Based Joint Acoustic And Text On-Device End-To-End Model

00:18:09

0 views

Recently, we introduced a two-pass on-device end-to-end (E2E) speech recognition model, which runs RNN-T in the first-pass and then rescores/redecodes the result using a noncausal Listen, Attend and Spell (LAS) decoder. This on-device model obtained simil

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Non-Gaussian Ble-Based Indoor Localization Via Gaussian Sum Filtering Coupled With Wasserstein Distance

00:14:46

0 views

With recent breakthroughs in signal processing, communication and networking systems, we are more and more surrounded by smart connected devices empowered by the Internet of Thing (IoT). Bluetooth Low Energy (BLE) is considered as the main-stream technolo

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

An Efficient Augmented Lagrangian-Based Method For Linear Equality-Constrained Lasso

00:11:20

0 views

Variable selection is one of the most important tasks in statistics and machine learning. To incorporate more prior information about the regression coefficients, various constrained Lasso models have been proposed in the literature. Compared with the cla

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Pixel-Wise Linear/Nonlinear Nonnegative Matrix Factorization For Unmixing Of Hyperspectral Data

00:13:07

0 views

Nonlinear spectral unmixing is a challenging and important task in hyperspectral image analysis. The kernel-based bi-objective nonnegative matrix factorization (Bi-NMF) has shown its usefulness in nonlinear unmixing; However, it suffers several issues tha

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Atomic Norm Denoising In Blind Two-Dimensional Super-Resolution

00:14:59

3 views

In this work, we develop a new framework for denoising in blind two-dimensional (2D) super-resolution that recovers a set of 2D continuous parameters as well as unknown waveforms from noisy samples. We apply the atomic norm to denoise a weighted sum of ti

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Phoneme Boundary Detection Using Learnable Segmental Features

00:14:12

0 views

Phoneme boundary detection plays an essential first step for a variety of speech processing applications such as speaker diarization, speech science, keyword spotting, etc. In this work, we propose a neural architecture coupled with a parameterized struct

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

The Role Of Annotation Fusion Methods In The Study Of Human-Reported Emotion Experience During Music Listening

00:14:21

0 views

Music is a universally-enjoyed art form, but listeners often respond to it in tremendously different ways. The same song can bring one person great joy and another deep sorrow. This paper focuses on modeling human music experience at the group level. In t

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Talker-Independent Speaker Separation In Reverberant Conditions

00:14:44

0 views

Speaker separation refers to the task of separating a mixture signal comprising two or more speakers. Impressive advances have been made recently in deep learning based talker-independent speaker separation. But such advances are achieved in anechoic cond

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Robust And Computationally-Efficient Anomaly Detection Using Powers-Of-Two Networks

00:13:07

0 views

Robust and computationally efficient anomaly detection in videos is a problem in video surveillance systems. We propose a technique to increase robustness and reduce computational complexity in a Convolutional Neural Network (CNN) based anomaly detector t

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

A Dataset For Measuring Reading Levels In India At Scale

00:10:51

0 views

One out of four children in India are leaving grade eight without basic reading skills. Measuring the reading levels in a vast country like India poses significant hurdles. Recent advances in machine learning opens up the possibility of automating this ta

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Characterizing Speech Adversarial Examples Using Self-Attention U-Net Enhancement

[2 Videos ]

Recent studies have highlighted adversarial examples as ubiquitous threats to the deep neural network (DNN) based speech recognition systems. In this work, we present a U-Net based attention model, U-Net$_{At}$, to enhance adversarial speech signals. Spec

Show videos in this product

Characterizing Speech Adversarial Examples Using Self-Attention U-Net Enhancement

00:14:18

2 views

Recent studies have highlighted adversarial examples as ubiquitous threats to the deep neural network (DNN) based speech recognition systems. In this work, we present a U-Net based attention model, U-Net$_{At}$, to enhance adversarial speech signals. Spec
Characterizing Speech Adversarial Examples Using Self-Attention U-Net Enhancement

00:14:01

0 views

Recent studies have highlighted adversarial examples as ubiquitous threats to the deep neural network (DNN) based speech recognition systems. In this work, we present a U-Net based attention model, U-Net$_{At}$, to enhance adversarial speech signals. Spec

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Anomaly Detection With Training Data In Hyperspectral Imagery

00:13:31

0 views

In this paper, we investigate the anomaly detection problem for multi-pixel targets in hyperspectral imagery when training data are available. We derive the generalized likelihood ratio test and obtain its analytical expressions of the probability of fals

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Classifying Anomalies For Network Security

00:14:15

1 view

Detecting and classifying anomalous behaviors in computer networks remains a formidable challenge. This work outlines a machine learning technique that uses deep neural networks to detect and classify a variety of network attacks. Our approach is based on

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Resilient To Byzantine Attacks Finite-Sum Optimization Over Networks

00:13:44

0 views

This contribution deals with distributed finite-sum optimization for learning over networks in the presence of malicious Byzantine attacks. To cope with such attacks, resilient approaches so far combine stochastic gradient descent (SGD) with different rob

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

A Comparative Study Of Western And Chinese Classical Music Based On Soundscape Models

00:14:57

0 views

Whether literally or suggestively, the concept of soundscape is alluded in both modern and ancient music. In this study, we examine whether we can analyze and compare Western and Chinese classical music based on soundscape models. We addressed this questi

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Pitch Estimation Via Self-Supervision

00:10:40

0 views

We present a method to estimate the fundamental frequency in monophonic audio, often referred to as pitch estimation. In contrast to existing methods, our neural network can be fully trained only on unlabeled data, using self-supervision. A tiny amount of

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Load Management With Predictions Of Solar Energy Production For Cloud Data Centers

00:14:01

0 views

Power supply of big infrastructures is today a tremendous operational cost for providers and the expected growth of Internet traffic and services will lead to a further expansion of the computing and networking infrastructures and this, in its turn, raise

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Enhanced Method Of Audio Coding Using Cnn-Based Spectral Recovery With Adaptive Structure

00:12:29

0 views

A process of spectral recovery can enhance the performance of transform-based audio coding by transmitting only a portion of spectral data and recovering the missing spectral data in the decoder. This study proposes an enhanced method of audio coding base

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Urtis: A Small 3D Imaging Sonar Sensor For Robotic Applications

00:12:00

0 views

State-of-the-art autonomous vehicles mainly rely on optical sensors to perceive their environment. However, the performance of these sensors worsens dramatically in environments where airborne particles are present. Sonar sensors rely on acoustic waves wh

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Near Capacity Rcqd Constellations For Papr Reduction Of Ofdm Systems

00:12:37

0 views

We investigate an optimized blind SeLected Mapping (SLM) algorithm to reduce the Peak-to-Average Power Ratio (PAPR) for Orthogonal Frequency Division Multiplexing (OFDM) systems with Signal Space Diversity (SSD). Several phase sequences based on two Rotat

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Acu-Net: A 3D Attention Context U-Net For Multiple Sclerosis Lesion Segmentation

[2 Videos ]

Multiple Sclerosis (MS) lesion segmentation from MR images is important for neuroimaging analysis. MS is diffuse, multifocal, and tend to involve peripheral brain structures such as the white matter, corpus callosum, and brainstem. Recently, U-Net has mad

Show videos in this product

Acu-Net: A 3D Attention Context U-Net For Multiple Sclerosis Lesion Segmentation

00:12:46

0 views

Multiple Sclerosis (MS) lesion segmentation from MR images is important for neuroimaging analysis. MS is diffuse, multifocal, and tend to involve peripheral brain structures such as the white matter, corpus callosum, and brainstem. Recently, U-Net has mad
Acu-Net: A 3D Attention Context U-Net For Multiple Sclerosis Lesion Segmentation

00:12:46

0 views

Multiple Sclerosis (MS) lesion segmentation from MR images is important for neuroimaging analysis. MS is diffuse, multifocal, and tend to involve peripheral brain structures such as the white matter, corpus callosum, and brainstem. Recently, U-Net has mad

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

End-To-End Speech Translation With Self-Contained Vocabulary Manipulation

00:12:32

0 views

In machine translation, vocabulary manipulation is a way to reduce the target vocabulary based on the source sentence and the word dictionary, which is effective to lower latency during inference for text translation in industrial application. But vocabul

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Effect Of Choice Of Probability Distribution, Randomness, And Search Methods For Alignment Modeling In Sequence-To-Sequence Text-To-Speech Synthesis Using Hard Alignment

00:13:56

0 views

Sequence-to-sequence text-to-speech (TTS) is dominated by soft-attention-based methods. Recently, hard-attention-based methods have been proposed to prevent fatal alignment errors, but their sampling method of discrete alignment is poorly investigated. Th

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Ecg Heartbeat Classification Based On Multi-Scale Wavelet Convolutional Neural Networks

00:14:57

0 views

This paper proposes a novel Deep Learning technique for ECG beats classification. Unlike the traditional Deep Learning models, a new Multi-Scale Wavelet Convolutional Neural Networks (MS-WCNN) is proposed to recognize automatically various cardiac arrhyth

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Convergence-Guaranteed Independent Positive Semidefinite Tensor Analysis Based On Student's T Distribution

00:13:30

0 views

In this paper, we address a blind source separation (BSS) problem and propose a new extended framework of independent positive semidefinite tensor analysis (IPSDTA). IPSDTA is a state-of-the-art BSS method that enables us to take interfrequency correlatio

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Nearest Kronecker Product Decomposition Based Normalized Least Mean Square Algorithm

00:13:48

0 views

Recently, nearest Kronecker product (NKP) decomposition based Wiener filter and Recursive Least Squares (RLS) have been proposed and was found to be a good candidate for system identification and echo cancellation and was shown to offer better tracking pe

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Accounting For Microprosody In Modeling Intonation

00:13:55

0 views

Intonation models are often used for the generation of fundamental frequency (f0) contours in speech synthesis. Current intonation models only represent the intentional f0 components that are related to the phonological structure of the utterance. However

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Time-Predictable Software-Defined Architecture With Sdf-Based Compiler Flow For 5G Baseband Processing

00:20:12

0 views

The advent of 5G networks motivates the need for high-performance, low-power, time-predictable hardware that can handle the aggressive real-time latency and throughput requirements of baseband processing. With newer generations like 5G, programmable hardw

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Data Selection Kernel Conjugate Gradient Algorithm

00:15:00

0 views

In recent years, the interest in kernel methods has increased exponentially, mainly due to applications including phenomena that cannot be well modeled by linear systems. Furthermore, the demand for high-speed communications and improvement in computer ca

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Learning Spatio-Temporal Convolutional Network For Real-Time Object Tracking

00:12:49

0 views

Siamese series of tracking networks have shown great potentials in achieving balanced accuracy and beyond real-time speed. However, most of existing siamese trackers only consider appearance features of first frame, and hardly benefit from interframe info

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Improving Language Identification For Multilingual Speakers

00:12:38

0 views

Spoken language identification (LID) technologies have improved in recent years from discriminating largely distinct languages to discriminating highly similar languages or even dialects of the same language. One aspect that has been mostly neglected, how

All Channels page: Communities submenu block

Communities

All Channels page: Societies submenu block

Societies

Events Showcase: ES submenu block

Event showcases

Recently Added Speakers

Events Hub Submenu block

Education: Education submenu block

Education Activity

2020 EAB AWARDS

2020 EAB AWARDS

IEEE ICASSP 2020 Virtual Conference May 2020