Showing 251 - 300 of 1951
- IEEE MemberUS $11.00
- Society MemberUS $0.00
- IEEE Student MemberUS $11.00
- Non-IEEE MemberUS $15.00
Multi-Task Learning In Autonomous Driving Scenarios Via Adaptive Feature Refinement Networks
Many deep learning applications benefit from multi-task learning with several related objectives. In autonomous driving scenarios, being able to accurately infer motion and spatial information is essential for scene understanding. In this paper, we combin
- IEEE MemberUS $11.00
- Society MemberUS $0.00
- IEEE Student MemberUS $11.00
- Non-IEEE MemberUS $15.00
Anti-Jamming Routing For Internet Of Satellites: A Reinforcement Learning Approach
The anti-jamming routing for the Internet of Satellites (IoS) has drawn increasing attentions due to the unknown interrupts, unexpected congestion and smart jamming. This paper investigates anti-jamming routing scheme for heterogeneous IoS, with the aim o
- IEEE MemberUS $11.00
- Society MemberUS $0.00
- IEEE Student MemberUS $11.00
- Non-IEEE MemberUS $15.00
Frequency And Temporal Convolutional Attention For Text-Independent Speaker Recognition
Majority of the recent approaches for text-independent speaker recognition apply attention or similar techniques for aggregation of frame-level feature descriptors generated by a deep neural network (DNN) front-end. In this paper, we propose methods of co
- IEEE MemberUS $11.00
- Society MemberUS $0.00
- IEEE Student MemberUS $11.00
- Non-IEEE MemberUS $15.00
Admm-Based One-Bit Quantized Signal Detection For Massive Mimo Systems With Hardware Impairments
This paper considers signal detection in massive multiple-input multiple-output (MIMO) systems with general additive hardware impairments and one-bit quantization. First, we present the quantization-unaware and Bussgang decomposition-based linear receiver
- IEEE MemberUS $11.00
- Society MemberUS $0.00
- IEEE Student MemberUS $11.00
- Non-IEEE MemberUS $15.00
Back-To-Back Butterfly Network, An Adaptive Permutation Network For New Communication Standards
In this paper, we introduce an adaptive Back-to-Back Butterfly Network (B?BN) dedicated to next communication standards. It can perform any kind of permutation, and its architecture is based on a concatenation of basic networks. However for a set of permu
- IEEE MemberUS $11.00
- Society MemberUS $0.00
- IEEE Student MemberUS $11.00
- Non-IEEE MemberUS $15.00
Low-Rank Tensor Ring Model For Completing Missing Visual Data
Low rank tensor factorization can be viewed as a higher order generalization of low-rank matrix factorization, both of which have been used for image and video representation and reconstruction from compressive measurements. In this paper, we present an a
- IEEE MemberUS $11.00
- Society MemberUS $0.00
- IEEE Student MemberUS $11.00
- Non-IEEE MemberUS $15.00
Robust Unsupervised Audio-Visual Speech Enhancement Using A Mixture Of Variational Autoencoders
Recently, an audio-visual speech generative model based on variational autoencoder (VAE) has been proposed, which is combined with a non-negative matrix factorization (NMF) model for noise variance to perform unsupervised speech enhancement. When visual d
- IEEE MemberUS $11.00
- Society MemberUS $0.00
- IEEE Student MemberUS $11.00
- Non-IEEE MemberUS $15.00
Coded Illumination And Multiplexing For Lensless Imaging
Mask-based lensless cameras offer an alternative option to conventional cameras. Compared to conventional cameras, lensless cameras can be extremely thin, flexible, and light-weight. Despite these advantages, the quality of images recovered from the lensl
- IEEE MemberUS $11.00
- Society MemberUS $0.00
- IEEE Student MemberUS $11.00
- Non-IEEE MemberUS $15.00
Detecting Multiple Speech Disfluencies Using A Deep Residual Network With Bidirectional Long Short-Term Memory
Stuttering is a speech impediment affecting tens of millions of people on an everyday basis. Even with its commonality, there is minimal data and research on the identification and classification of stuttered speech. This paper tackles the problem of dete
- IEEE MemberUS $11.00
- Society MemberUS $0.00
- IEEE Student MemberUS $11.00
- Non-IEEE MemberUS $15.00
Attention Mechanism Enhanced Kernel Prediction Networks For Denoising Of Burst Images
Deep learning based image denoising methods have been extensively investigated. In this paper, attention mechanism enhanced kernel prediction networks (AME-KPNs) are proposed for burst image denoising, in which, nearly cost-free attention modules are adop
- IEEE MemberUS $11.00
- Society MemberUS $0.00
- IEEE Student MemberUS $11.00
- Non-IEEE MemberUS $15.00
- IEEE MemberUS $11.00
- Society MemberUS $0.00
- IEEE Student MemberUS $11.00
- Non-IEEE MemberUS $15.00
A Dsp Acceleration Framework For Software-Defined Radios On X86_64
This paper presents a DSP acceleration and assessment framework targeting SDR platforms on x86_64 architectures. Driven by the potential of rapid prototyping and evaluation of breakthrough concepts that these platforms provide, our work builds upon the we
- IEEE MemberUS $11.00
- Society MemberUS $0.00
- IEEE Student MemberUS $11.00
- Non-IEEE MemberUS $15.00
Constant Envelope Massive Mimo-Ofdm Precoding: An Improved Formulation And Solution
Constant Envelope (CE) precoding is an efficient technique for systems based on massive antenna arrays since the constant amplitude of the transmit signal facilitates the use of power efficient non-linear transmitter circuitry, such as power amplifiers (P
- IEEE MemberUS $11.00
- Society MemberUS $0.00
- IEEE Student MemberUS $11.00
- Non-IEEE MemberUS $15.00
A Cross-Task Transfer Learning Approach To Adapting Deep Speech Enhancement Models To Unseen Background Noise Using Paired Senone Classifiers
We propose an environment adaptation approach that improves deep speech enhancement models via minimizing the Kullback- Leibler divergence between posterior probabilities produced by a multi-condition senone classifier (teacher) fed with noisy speech feat
- IEEE MemberUS $11.00
- Society MemberUS $0.00
- IEEE Student MemberUS $11.00
- Non-IEEE MemberUS $15.00
Optimum Kernel Particle Filter For Asymmetric Laplace Noise
In this paper we present on-line Bayesian filtering methods for time series models corrupted by asymmetric Laplace noise. An optimum kernel particle filter is designed for the general asymmetric case, and its performance is compared to that of a tradition
- IEEE MemberUS $11.00
- Society MemberUS $0.00
- IEEE Student MemberUS $11.00
- Non-IEEE MemberUS $15.00
Aipnet: Generative Adversarial Pre-Training Of Accent-Invariant Networks For End-To-End Speech Recognition
As one of the major sources in speech variability, accents have posed a grand challenge to the robustness of speech recognition systems. In this paper, our goal is to build a unified end-to-end speech recognition system that generalizes well across accent
- IEEE MemberUS $11.00
- Society MemberUS $0.00
- IEEE Student MemberUS $11.00
- Non-IEEE MemberUS $15.00
Mobility-Aware Beam Steering In Metasurface-Based Programmable Wireless Environments
Programmable wireless environments (PWEs) utilize electromagnetic metasurfaces to transform wireless propagation into a software-controlled resource. In this work we study the effects of user device mobility on the efficiency of PWEs. An analytical model
- IEEE MemberUS $11.00
- Society MemberUS $0.00
- IEEE Student MemberUS $11.00
- Non-IEEE MemberUS $15.00
Knowledge Enhanced Latent Relevance Mining For Question Answering
Answer selection which aims to select the most appropriate answer from a pre-selected candidate pool has become increasingly important in a variety of practical applications. Previous work tends to use complex attention mechanisms to capture contextual re
- IEEE MemberUS $11.00
- Society MemberUS $0.00
- IEEE Student MemberUS $11.00
- Non-IEEE MemberUS $15.00
On Network Science And Mutual Information For Explaining Deep Neural Networks
In this paper, we present a new approach to interpreting deep learning models. By coupling mutual information with network science, we explore how information flows through feedforward networks. We show that efficiently approximating mutual information al
- IEEE MemberUS $11.00
- Society MemberUS $0.00
- IEEE Student MemberUS $11.00
- Non-IEEE MemberUS $15.00
A Memory Augmented Architecture For Continuous Speaker Identification In Meetings
We introduce and analyze a novel approach to the problem of speaker identification in multi-party recorded meetings. Given a speech segment and a set of available candidate profiles, a data-driven approach is proposed learning the distance relations betwe
- IEEE MemberUS $11.00
- Society MemberUS $0.00
- IEEE Student MemberUS $11.00
- Non-IEEE MemberUS $15.00
New Metrics For Evaluating The Accuracy Of Fundamental Frequency Estimation Approaches In Musical Signals
This paper demonstrates the importance of assessing the performance of fundamental frequency estimation algorithms on note-level descriptors in addition to frame-level accuracy. Note-level descriptors provide a better description of the human experience o
- IEEE MemberUS $11.00
- Society MemberUS $0.00
- IEEE Student MemberUS $11.00
- Non-IEEE MemberUS $15.00
Adversarial Mixup Synthesis Training For Unsupervised Domain Adaptation
Domain adversarial training is a popular approach for Unsupervised Domain Adaptation~(DA). However, the transferability of adversarial training framework may drop greatly on the adaptation tasks with a large distribution divergence between source and targ
- IEEE MemberUS $11.00
- Society MemberUS $0.00
- IEEE Student MemberUS $11.00
- Non-IEEE MemberUS $15.00
Compare Learning: Bi-Attention Network For Few-Shot Learning
Learning with few labeled data is a key challenge for visual recognition, as deep neural networks tend to overfit using a few samples only. One of the Few-shot learning methods called metric learning addresses this challenge by first learning a deep dista
- IEEE MemberUS $11.00
- Society MemberUS $0.00
- IEEE Student MemberUS $11.00
- Non-IEEE MemberUS $15.00
Robust Frequency-Domain Recursive Least M-Estimate Adaptive Filter For Acoustic System Identification
To identify acoustic systems in non-Gaussian and Gaussian noises, a robust frequency-domain recursive least M-estimate (FRLM) adaptive filtering algorithm is proposed. The cost function of the adaptive filter is defined by using a robust time-domain M-est
- IEEE MemberUS $11.00
- Society MemberUS $0.00
- IEEE Student MemberUS $11.00
- Non-IEEE MemberUS $15.00
Vggsound: A Large-Scale Audio-Visual Dataset
Our goal is to collect a large-scale audio-visual dataset with low label noise from videos `in the wild' using computer vision techniques. The resulting dataset can be used for training and evaluating audio recognition models. We make three contributions.
- IEEE MemberUS $11.00
- Society MemberUS $0.00
- IEEE Student MemberUS $11.00
- Non-IEEE MemberUS $15.00
A Simple And Efficient Iterative Method For Toa Localization
This paper develops a simple and efficient method for source localization using signal time-of-arrival (TOA) measurements. There exist many TOA localization algorithms, most of which require matrix inversions. Their complexity often makes them unsuitable
- IEEE MemberUS $11.00
- Society MemberUS $0.00
- IEEE Student MemberUS $11.00
- Non-IEEE MemberUS $15.00
Automatic And Simultaneous Adjustment Of Learning Rate And Momentum For Stochastic Gradient-Based Optimization Methods
Stochastic gradient-based methods are prominent for training machine learning and deep learning models. The performance of these techniques depends on their hyperparameter tuning over time and varies for different models and problems. Manual adjustment of
- IEEE MemberUS $11.00
- Society MemberUS $0.00
- IEEE Student MemberUS $11.00
- Non-IEEE MemberUS $15.00
Guided Learning For Weakly-Labeled Semi-Supervised Sound Event Detection
We propose a simple but efficient method termed Guided Learning for weakly-labeled semi-supervised sound event detection (SED). There are two sub-targets implied in weakly-labeled SED: audio tagging and boundary detection. Instead of designing a single mo
- IEEE MemberUS $11.00
- Society MemberUS $0.00
- IEEE Student MemberUS $11.00
- Non-IEEE MemberUS $15.00
Transmit Beampattern Shaping Via Waveform Design In Cognitive Mimo Radar
This paper is focused on designing a set of constant modulus waveform for cognitive Multiple-Input Multiple-Output (MIMO) radar systems. The aim is to shape the beampattern in transmitter to minimize the Integrated Side-lobe Level (ISL) in spatial domain
- IEEE MemberUS $11.00
- Society MemberUS $0.00
- IEEE Student MemberUS $11.00
- Non-IEEE MemberUS $15.00
Power Optimization Using Embedded Automatic Gain Control Algorithm With Photoplethysmography Signal Quality Classification
This paper presents the design and implementation of an Automatic Gain Control (AGC) embedded algorithm for photoplethysmographic (PPG) sensors. We use a number of statistical and spectral characteristics of the raw and filtered PPG signals, referred to a
- IEEE MemberUS $11.00
- Society MemberUS $0.00
- IEEE Student MemberUS $11.00
- Non-IEEE MemberUS $15.00
Fir Filter Design And Implementation For Phase-Based Processing
Complex steerable pyramid (CSP) is widely used to decompose images into muti-scale and oriented subbands for phase-based processing, such as video magnification, frame interpolation, and view synthesis. The conventional implementation is based on frequenc
- IEEE MemberUS $11.00
- Society MemberUS $0.00
- IEEE Student MemberUS $11.00
- Non-IEEE MemberUS $15.00
Low-Tubal-Rank Tensor Recovery From One-Bit Measurements
This paper focuses on the recovery of low-tubal-rank tensors from binary measurements under the frame of tensor Singular Value Decomposition. We show that the direction of a tubal-rank-$r$ tensor $m{mathcal{X}}in R^{n_1 imes n_2 imes n_3}$ can be a
- IEEE MemberUS $11.00
- Society MemberUS $0.00
- IEEE Student MemberUS $11.00
- Non-IEEE MemberUS $15.00
- IEEE MemberUS $11.00
- Society MemberUS $0.00
- IEEE Student MemberUS $11.00
- Non-IEEE MemberUS $15.00
Cooperative Learning Via Federated Distillation Over Fading Channels
Cooperative training methods for distributed machine learning are typically based on the exchange of local gradients or local model parameters. The latter approach is known as Federated Learning (FL). An alternative solution with reduced communication ove
- IEEE MemberUS $11.00
- Society MemberUS $0.00
- IEEE Student MemberUS $11.00
- Non-IEEE MemberUS $15.00
Complex Transformer: A Framework For Modeling Complex-Valued Sequence
While deep learning has received a surge of interest in a variety of fields in recent years, major deep learning models barely use complex numbers. However, speech, signal and audio data are naturally complex-valued after Fourier Transform, and studies ha
- IEEE MemberUS $11.00
- Society MemberUS $0.00
- IEEE Student MemberUS $11.00
- Non-IEEE MemberUS $15.00
Robust Matrix Completion Via Lp-Greedy Pursuits
A novel $ell_p$-greedy pursuit (GP) algorithm for robust matrix completion, i.e., recovering a low-rank matrix from only a subset of its noisy and outlier-contaminated entries, is devised. The $ell_p$-GP uses the strategy of sequential rank-one update.
- IEEE MemberUS $11.00
- Society MemberUS $0.00
- IEEE Student MemberUS $11.00
- Non-IEEE MemberUS $15.00
Robust Tdoa Indoor Tracking Using Constrained Measurement Filtering And Grid-Based Filtering
This paper considers exploiting the time difference of arrival (TDOA) measurements from a ultra wideband (UWB) indoor positioning system to locate a moving point target. In indoor environments, measured TDOAs are subject to large errors due to multipath a
- IEEE MemberUS $11.00
- Society MemberUS $0.00
- IEEE Student MemberUS $11.00
- Non-IEEE MemberUS $15.00
Sequence-Level Consistency Training For Semi-Supervised End-To-End Automatic Speech Recognition
This paper presents a novel semi-supervised end-to-end automatic speech recognition (ASR) method that employs consistency training with the use of unlabeled data. In consistency training, unlabeled data can be utilized for constraining a model such that i
- IEEE MemberUS $11.00
- Society MemberUS $0.00
- IEEE Student MemberUS $11.00
- Non-IEEE MemberUS $15.00
Cross-Speaker Silent-Speech Command Word Recognition Using Electro-Optical Stomatography
Speech recognition based on articulatory movements instead of the acoustic signal is of growing interest in the community. In this work, we present the results of a study using a novel measurement technology called Electro-Optical Stomatography to capture
- IEEE MemberUS $11.00
- Society MemberUS $0.00
- IEEE Student MemberUS $11.00
- Non-IEEE MemberUS $15.00
Learning Semi-Supervised Anonymized Representations By Mutual Information
This paper addresses the problem of removing from a set of data (here images) a given private information, while still allowing other utilities on the processed data. This is obtained by training concurrently a GAN-like discriminator and an autoencoder. T
- IEEE MemberUS $11.00
- Society MemberUS $0.00
- IEEE Student MemberUS $11.00
- Non-IEEE MemberUS $15.00
Decoding 5G-Nr Communications Via Deep Learning
Upcoming modern communications are based on 5G specifications and aim at providing solutions for novel vertical industries. One of the major changes of the physical layer is the use of Low-Density Parity-Check (LDPC) code for channel coding. Although LDPC
- IEEE MemberUS $11.00
- Society MemberUS $0.00
- IEEE Student MemberUS $11.00
- Non-IEEE MemberUS $15.00
Teaching Signals And Systems - A First Course In Signal Processing
Signals and systems is a well known fundamental course in signal processing. How this course is taught to a student can spell the difference between whether s/he pursues a career in this field or not. Giving due consideration to this matter, this paper re
- IEEE MemberUS $11.00
- Society MemberUS $0.00
- IEEE Student MemberUS $11.00
- Non-IEEE MemberUS $15.00
Looking Enhances Listening: Recovering Missing Speech Using Images
Speech is understood better by using visual context; for this reason, there have been many attempts to use images to adapt automatic speech recognition (ASR) systems. Current work, however, has shown that visually adapted ASR models only use images as a r
- IEEE MemberUS $11.00
- Society MemberUS $0.00
- IEEE Student MemberUS $11.00
- Non-IEEE MemberUS $15.00
A Hierarchical Model For Dialog Act Recognition Considering Acoustic And Lexical Context Information
Dialog act recognition (DAR) is important to capture speakers' intention in a dialog system. Traditional methods commonly use the lexical information from transcripts, acoustic information from speech, and dialog context information to do DAR. However, in
- IEEE MemberUS $11.00
- Society MemberUS $0.00
- IEEE Student MemberUS $11.00
- Non-IEEE MemberUS $15.00
A Low-Complexity Map Detector For Distributed Networks
This work describes a generalization of our previous maximum likelihood (ML) detector to a maximum a posteriori (MAP) detector in distributed networks using the diffusion LMS algorithm. Nodes in the network must decide between two concurrent hypotheses co
- IEEE MemberUS $11.00
- Society MemberUS $0.00
- IEEE Student MemberUS $11.00
- Non-IEEE MemberUS $15.00
Non-Parametric Community Change-Points Detection In Streaming Graph Signals
Detecting changes in network-structured time series data is of utmost importance in critical applications as diverse as detecting denial of service attacks against online service providers or monitoring energy and water supplies. The aim of this paper is
- IEEE MemberUS $11.00
- Society MemberUS $0.00
- IEEE Student MemberUS $11.00
- Non-IEEE MemberUS $15.00
Detect Insider Attacks Using Cnn In Decentralized Optimization
This paper studies the security issue of a gossip-based distributed projected gradient (DPG) algorithm, when it is applied for solving a decentralized multi-agent optimization. It is known that the gossip-based DPG algorithm is vulnerable to insider attac
- IEEE MemberUS $11.00
- Society MemberUS $0.00
- IEEE Student MemberUS $11.00
- Non-IEEE MemberUS $15.00
What Is Best For Spoken Language Understanding: Small But Task-Dependant Embeddings Or Huge But Out-Of-Domain Embeddings?
Word embeddings are shown to be a great asset for several Natural Language and Speech Processing tasks. While they are already evaluated on various NLP tasks, their evaluation on spoken or natural language understanding (SLU) is less studied. The goal of