Showing 1351 - 1400 of 1951
- IEEE MemberUS $11.00
- Society MemberUS $0.00
- IEEE Student MemberUS $11.00
- Non-IEEE MemberUS $15.00
Time-Frequency Analysis Of Unimodal Sensory Processing In Autism Spectrum Disorder
This work summarizes the results of a time-frequency analysis of sensory processing in young adults with Autism Spectrum Disorder via continuous wavelet transform. The sensory tasks consisted of two blocks of unimodal sensory stimuli of the same type (i.e
- IEEE MemberUS $11.00
- Society MemberUS $0.00
- IEEE Student MemberUS $11.00
- Non-IEEE MemberUS $15.00
Roimix: Proposal-Fusion Among Multiple Images For Underwater Object Detection
Generic object detection algorithms have proven their excellent performance in recent years. However, object detection on underwater datasets is still less explored. In contrast to generic datasets, underwater images usually have color shift and low contr
- IEEE MemberUS $11.00
- Society MemberUS $0.00
- IEEE Student MemberUS $11.00
- Non-IEEE MemberUS $15.00
Duration Robust Weakly Supervised Sound Event Detection
Task 4 of the DCASE2018 challenge demonstrated that substantially more research is needed for a real-world application of sound event detection. Analyzing the challenge results it can be seen that most successful models are biased towards predicting long
- IEEE MemberUS $11.00
- Society MemberUS $0.00
- IEEE Student MemberUS $11.00
- Non-IEEE MemberUS $15.00
Federated Learning With Mutually Cooperating Devices: A Consensus Approach Towards Server-Less Model Optimization
Abstract Federated learning (FL) is emerging as a new paradigm for training a machine learning model in cooperative networks. The model parameters are optimized collectively by large populations of interconnected devices, acting as cooperative learners th
- IEEE MemberUS $11.00
- Society MemberUS $0.00
- IEEE Student MemberUS $11.00
- Non-IEEE MemberUS $15.00
Visually Guided Self Supervised Learning Of Speech Representations
Self supervised representation learning has recently attracted a lot of research interest for both the audio and visual modalities. However, most works typically focus on a particular modality or feature alone and there has been very limited work that stu
- IEEE MemberUS $11.00
- Society MemberUS $0.00
- IEEE Student MemberUS $11.00
- Non-IEEE MemberUS $15.00
Improving Proper Noun Recognition In End-To-End Asr By Customization Of The Mwer Loss Criterion
Proper nouns present a challenge for end-to-end (E2E) automatic speech recognition (ASR) systems in that a particular name may appear only rarely during training, and may have a pronunciation similar to that of a more common word. Unlike conventional ASR
- IEEE MemberUS $11.00
- Society MemberUS $0.00
- IEEE Student MemberUS $11.00
- Non-IEEE MemberUS $15.00
Mining Effective Negative Training Samples For Keyword Spotting
Max-pooling neural network architectures have been proven to be useful for keyword spotting (KWS), but standard training methods suffer from a class-imbalance problem when using all frames from negative utterances. To address the problem, we propose an in
- IEEE MemberUS $11.00
- Society MemberUS $0.00
- IEEE Student MemberUS $11.00
- Non-IEEE MemberUS $15.00
Target Parameter Estimation Via One-Bit Pmcw Radar
We consider the problem of phase modulated continuous wave (PMCW) radar signal processing when the receiver utilizes one-bit sampling with known time-varying thresholds. We formulate the target parameter estimation problem as a sparse signal recovery prob
- IEEE MemberUS $11.00
- Society MemberUS $0.00
- IEEE Student MemberUS $11.00
- Non-IEEE MemberUS $15.00
Attention-Based Asr With Lightweight And Dynamic Convolutions
End-to-end (E2E) automatic speech recognition (ASR) with sequence- to-sequence models has gained attention because of its simple model training compared with conventional hidden Markov model based ASR. Recently, several studies report the state-of-the-art
- IEEE MemberUS $11.00
- Society MemberUS $0.00
- IEEE Student MemberUS $11.00
- Non-IEEE MemberUS $15.00
Transmit Beamforming Design With Received-Interference Power Constraints: The Zero-Forcing Relaxation
The use of multi-antenna transmitters is emerging as an essential technology of the future wireless communication systems. While Zero-Forcing Beamforming (ZFB) has become the most popular low-complexity transmit beamforming design, it has some drawbacks b
- IEEE MemberUS $11.00
- Society MemberUS $0.00
- IEEE Student MemberUS $11.00
- Non-IEEE MemberUS $15.00
Bp-Vb-Ep Based Static And Dynamic Sparse Bayesian Learning With Kronecker Structured Dictionaries
In many applications such as massive multi-input multi-output (MIMO) radar, massive MIMO channel estimation, speech processing, image and video processing, the received signals are tensors. In such applications, utilizing techniques from tensor algebra ca
- IEEE MemberUS $11.00
- Society MemberUS $0.00
- IEEE Student MemberUS $11.00
- Non-IEEE MemberUS $15.00
Spatio-Temporal And Geometry Constrained Network For Automobile Visual Odometry
Visual odometry (VO) is an essence of vision-based localization and mapping system where existing learning-based approaches utilize CNN and RNN to model camera motion and gain promising results. However, these methods lack full use of the relationship bet
- IEEE MemberUS $11.00
- Society MemberUS $0.00
- IEEE Student MemberUS $11.00
- Non-IEEE MemberUS $15.00
Demystifying Tasnet: A Dissecting Approach
In recent years time domain speech separation has excelled over frequency domain separation in single channel scenarios and noise-free environments. In this paper we dissect the gains of the time-domain audio separation network (TasNet) approach by gradua
- IEEE MemberUS $11.00
- Society MemberUS $0.00
- IEEE Student MemberUS $11.00
- Non-IEEE MemberUS $15.00
Bayesian Estimation Of Plda With Noisy Training Labels, With Applications To Speaker Verification
This paper proposes a method for Bayesian estimation of probabilistic linear discriminant analysis (PLDA) when training labels are noisy. Label errors can be expected during e.g. large or distributed data collections, or for crowd-sourced data labeling. B
- IEEE MemberUS $11.00
- Society MemberUS $0.00
- IEEE Student MemberUS $11.00
- Non-IEEE MemberUS $15.00
Speaker Diarization With Region Proposal Network
Speaker diarization is an important pre-processing step for many speech applications, and it aims to solve the "who spoke when" problem. The standard diarization systems can achieve satisfactory results in various scenarios, but they are composed of sever
- IEEE MemberUS $11.00
- Society MemberUS $0.00
- IEEE Student MemberUS $11.00
- Non-IEEE MemberUS $15.00
Weighted Null Vector Initialization And Its Application To Phase Retrieval
Phase retrieval problem is an nonlinear inverse problem of recovering real- or complex-valued signal from quadratic measurements, which arises in various applications. The best-known algorithms for solving this problem are non-convex methods starting with
- IEEE MemberUS $11.00
- Society MemberUS $0.00
- IEEE Student MemberUS $11.00
- Non-IEEE MemberUS $15.00
Power Spectrum Optimization For Capacity Of The Extended Spectrum Hybrid Fiber Coax Network
Capacity requirements of the fixed access network keep increasing towards multi-gigabit connections. For Hybrid Fiber Coaxial (HFC) networks, aggregated rates around 30 Gbit/s can be achieved by increasing the DOCSIS spectrum to 3GHz, assuming a spectral
- IEEE MemberUS $11.00
- Society MemberUS $0.00
- IEEE Student MemberUS $11.00
- Non-IEEE MemberUS $15.00
Generative Pre-Training For Speech With Autoregressive Predictive Coding
Learning meaningful and general representations from unannotated speech that are applicable to a wide range of tasks remains challenging. In this paper we propose to use autoregressive predictive coding (APC), a recently proposed self-supervised objective
- IEEE MemberUS $11.00
- Society MemberUS $0.00
- IEEE Student MemberUS $11.00
- Non-IEEE MemberUS $15.00
An Attention-Based Joint Acoustic And Text On-Device End-To-End Model
Recently, we introduced a two-pass on-device end-to-end (E2E) speech recognition model, which runs RNN-T in the first-pass and then rescores/redecodes the result using a noncausal Listen, Attend and Spell (LAS) decoder. This on-device model obtained simil
- IEEE MemberUS $11.00
- Society MemberUS $0.00
- IEEE Student MemberUS $11.00
- Non-IEEE MemberUS $15.00
Non-Gaussian Ble-Based Indoor Localization Via Gaussian Sum Filtering Coupled With Wasserstein Distance
With recent breakthroughs in signal processing, communication and networking systems, we are more and more surrounded by smart connected devices empowered by the Internet of Thing (IoT). Bluetooth Low Energy (BLE) is considered as the main-stream technolo
- IEEE MemberUS $11.00
- Society MemberUS $0.00
- IEEE Student MemberUS $11.00
- Non-IEEE MemberUS $15.00
An Efficient Augmented Lagrangian-Based Method For Linear Equality-Constrained Lasso
Variable selection is one of the most important tasks in statistics and machine learning. To incorporate more prior information about the regression coefficients, various constrained Lasso models have been proposed in the literature. Compared with the cla
- IEEE MemberUS $11.00
- Society MemberUS $0.00
- IEEE Student MemberUS $11.00
- Non-IEEE MemberUS $15.00
Pixel-Wise Linear/Nonlinear Nonnegative Matrix Factorization For Unmixing Of Hyperspectral Data
Nonlinear spectral unmixing is a challenging and important task in hyperspectral image analysis. The kernel-based bi-objective nonnegative matrix factorization (Bi-NMF) has shown its usefulness in nonlinear unmixing; However, it suffers several issues tha
- IEEE MemberUS $11.00
- Society MemberUS $0.00
- IEEE Student MemberUS $11.00
- Non-IEEE MemberUS $15.00
Atomic Norm Denoising In Blind Two-Dimensional Super-Resolution
In this work, we develop a new framework for denoising in blind two-dimensional (2D) super-resolution that recovers a set of 2D continuous parameters as well as unknown waveforms from noisy samples. We apply the atomic norm to denoise a weighted sum of ti
- IEEE MemberUS $11.00
- Society MemberUS $0.00
- IEEE Student MemberUS $11.00
- Non-IEEE MemberUS $15.00
Phoneme Boundary Detection Using Learnable Segmental Features
Phoneme boundary detection plays an essential first step for a variety of speech processing applications such as speaker diarization, speech science, keyword spotting, etc. In this work, we propose a neural architecture coupled with a parameterized struct
- IEEE MemberUS $11.00
- Society MemberUS $0.00
- IEEE Student MemberUS $11.00
- Non-IEEE MemberUS $15.00
The Role Of Annotation Fusion Methods In The Study Of Human-Reported Emotion Experience During Music Listening
Music is a universally-enjoyed art form, but listeners often respond to it in tremendously different ways. The same song can bring one person great joy and another deep sorrow. This paper focuses on modeling human music experience at the group level. In t
- IEEE MemberUS $11.00
- Society MemberUS $0.00
- IEEE Student MemberUS $11.00
- Non-IEEE MemberUS $15.00
Talker-Independent Speaker Separation In Reverberant Conditions
Speaker separation refers to the task of separating a mixture signal comprising two or more speakers. Impressive advances have been made recently in deep learning based talker-independent speaker separation. But such advances are achieved in anechoic cond
- IEEE MemberUS $11.00
- Society MemberUS $0.00
- IEEE Student MemberUS $11.00
- Non-IEEE MemberUS $15.00
Robust And Computationally-Efficient Anomaly Detection Using Powers-Of-Two Networks
Robust and computationally efficient anomaly detection in videos is a problem in video surveillance systems. We propose a technique to increase robustness and reduce computational complexity in a Convolutional Neural Network (CNN) based anomaly detector t
- IEEE MemberUS $11.00
- Society MemberUS $0.00
- IEEE Student MemberUS $11.00
- Non-IEEE MemberUS $15.00
A Dataset For Measuring Reading Levels In India At Scale
One out of four children in India are leaving grade eight without basic reading skills. Measuring the reading levels in a vast country like India poses significant hurdles. Recent advances in machine learning opens up the possibility of automating this ta
- IEEE MemberUS $11.00
- Society MemberUS $0.00
- IEEE Student MemberUS $11.00
- Non-IEEE MemberUS $15.00
- IEEE MemberUS $11.00
- Society MemberUS $0.00
- IEEE Student MemberUS $11.00
- Non-IEEE MemberUS $15.00
Anomaly Detection With Training Data In Hyperspectral Imagery
In this paper, we investigate the anomaly detection problem for multi-pixel targets in hyperspectral imagery when training data are available. We derive the generalized likelihood ratio test and obtain its analytical expressions of the probability of fals
- IEEE MemberUS $11.00
- Society MemberUS $0.00
- IEEE Student MemberUS $11.00
- Non-IEEE MemberUS $15.00
Classifying Anomalies For Network Security
Detecting and classifying anomalous behaviors in computer networks remains a formidable challenge. This work outlines a machine learning technique that uses deep neural networks to detect and classify a variety of network attacks. Our approach is based on
- IEEE MemberUS $11.00
- Society MemberUS $0.00
- IEEE Student MemberUS $11.00
- Non-IEEE MemberUS $15.00
Resilient To Byzantine Attacks Finite-Sum Optimization Over Networks
This contribution deals with distributed finite-sum optimization for learning over networks in the presence of malicious Byzantine attacks. To cope with such attacks, resilient approaches so far combine stochastic gradient descent (SGD) with different rob
- IEEE MemberUS $11.00
- Society MemberUS $0.00
- IEEE Student MemberUS $11.00
- Non-IEEE MemberUS $15.00
A Comparative Study Of Western And Chinese Classical Music Based On Soundscape Models
Whether literally or suggestively, the concept of soundscape is alluded in both modern and ancient music. In this study, we examine whether we can analyze and compare Western and Chinese classical music based on soundscape models. We addressed this questi
- IEEE MemberUS $11.00
- Society MemberUS $0.00
- IEEE Student MemberUS $11.00
- Non-IEEE MemberUS $15.00
Pitch Estimation Via Self-Supervision
We present a method to estimate the fundamental frequency in monophonic audio, often referred to as pitch estimation. In contrast to existing methods, our neural network can be fully trained only on unlabeled data, using self-supervision. A tiny amount of
- IEEE MemberUS $11.00
- Society MemberUS $0.00
- IEEE Student MemberUS $11.00
- Non-IEEE MemberUS $15.00
Load Management With Predictions Of Solar Energy Production For Cloud Data Centers
Power supply of big infrastructures is today a tremendous operational cost for providers and the expected growth of Internet traffic and services will lead to a further expansion of the computing and networking infrastructures and this, in its turn, raise
- IEEE MemberUS $11.00
- Society MemberUS $0.00
- IEEE Student MemberUS $11.00
- Non-IEEE MemberUS $15.00
Enhanced Method Of Audio Coding Using Cnn-Based Spectral Recovery With Adaptive Structure
A process of spectral recovery can enhance the performance of transform-based audio coding by transmitting only a portion of spectral data and recovering the missing spectral data in the decoder. This study proposes an enhanced method of audio coding base
- IEEE MemberUS $11.00
- Society MemberUS $0.00
- IEEE Student MemberUS $11.00
- Non-IEEE MemberUS $15.00
Urtis: A Small 3D Imaging Sonar Sensor For Robotic Applications
State-of-the-art autonomous vehicles mainly rely on optical sensors to perceive their environment. However, the performance of these sensors worsens dramatically in environments where airborne particles are present. Sonar sensors rely on acoustic waves wh
- IEEE MemberUS $11.00
- Society MemberUS $0.00
- IEEE Student MemberUS $11.00
- Non-IEEE MemberUS $15.00
Near Capacity Rcqd Constellations For Papr Reduction Of Ofdm Systems
We investigate an optimized blind SeLected Mapping (SLM) algorithm to reduce the Peak-to-Average Power Ratio (PAPR) for Orthogonal Frequency Division Multiplexing (OFDM) systems with Signal Space Diversity (SSD). Several phase sequences based on two Rotat
- IEEE MemberUS $11.00
- Society MemberUS $0.00
- IEEE Student MemberUS $11.00
- Non-IEEE MemberUS $15.00
- IEEE MemberUS $11.00
- Society MemberUS $0.00
- IEEE Student MemberUS $11.00
- Non-IEEE MemberUS $15.00
End-To-End Speech Translation With Self-Contained Vocabulary Manipulation
In machine translation, vocabulary manipulation is a way to reduce the target vocabulary based on the source sentence and the word dictionary, which is effective to lower latency during inference for text translation in industrial application. But vocabul
- IEEE MemberUS $11.00
- Society MemberUS $0.00
- IEEE Student MemberUS $11.00
- Non-IEEE MemberUS $15.00
Effect Of Choice Of Probability Distribution, Randomness, And Search Methods For Alignment Modeling In Sequence-To-Sequence Text-To-Speech Synthesis Using Hard Alignment
Sequence-to-sequence text-to-speech (TTS) is dominated by soft-attention-based methods. Recently, hard-attention-based methods have been proposed to prevent fatal alignment errors, but their sampling method of discrete alignment is poorly investigated. Th
- IEEE MemberUS $11.00
- Society MemberUS $0.00
- IEEE Student MemberUS $11.00
- Non-IEEE MemberUS $15.00
Ecg Heartbeat Classification Based On Multi-Scale Wavelet Convolutional Neural Networks
This paper proposes a novel Deep Learning technique for ECG beats classification. Unlike the traditional Deep Learning models, a new Multi-Scale Wavelet Convolutional Neural Networks (MS-WCNN) is proposed to recognize automatically various cardiac arrhyth
- IEEE MemberUS $11.00
- Society MemberUS $0.00
- IEEE Student MemberUS $11.00
- Non-IEEE MemberUS $15.00
Convergence-Guaranteed Independent Positive Semidefinite Tensor Analysis Based On Student's T Distribution
In this paper, we address a blind source separation (BSS) problem and propose a new extended framework of independent positive semidefinite tensor analysis (IPSDTA). IPSDTA is a state-of-the-art BSS method that enables us to take interfrequency correlatio
- IEEE MemberUS $11.00
- Society MemberUS $0.00
- IEEE Student MemberUS $11.00
- Non-IEEE MemberUS $15.00
Nearest Kronecker Product Decomposition Based Normalized Least Mean Square Algorithm
Recently, nearest Kronecker product (NKP) decomposition based Wiener filter and Recursive Least Squares (RLS) have been proposed and was found to be a good candidate for system identification and echo cancellation and was shown to offer better tracking pe
- IEEE MemberUS $11.00
- Society MemberUS $0.00
- IEEE Student MemberUS $11.00
- Non-IEEE MemberUS $15.00
Accounting For Microprosody In Modeling Intonation
Intonation models are often used for the generation of fundamental frequency (f0) contours in speech synthesis. Current intonation models only represent the intentional f0 components that are related to the phonological structure of the utterance. However
- IEEE MemberUS $11.00
- Society MemberUS $0.00
- IEEE Student MemberUS $11.00
- Non-IEEE MemberUS $15.00
Time-Predictable Software-Defined Architecture With Sdf-Based Compiler Flow For 5G Baseband Processing
The advent of 5G networks motivates the need for high-performance, low-power, time-predictable hardware that can handle the aggressive real-time latency and throughput requirements of baseband processing. With newer generations like 5G, programmable hardw
- IEEE MemberUS $11.00
- Society MemberUS $0.00
- IEEE Student MemberUS $11.00
- Non-IEEE MemberUS $15.00
Data Selection Kernel Conjugate Gradient Algorithm
In recent years, the interest in kernel methods has increased exponentially, mainly due to applications including phenomena that cannot be well modeled by linear systems. Furthermore, the demand for high-speed communications and improvement in computer ca
- IEEE MemberUS $11.00
- Society MemberUS $0.00
- IEEE Student MemberUS $11.00
- Non-IEEE MemberUS $15.00
Learning Spatio-Temporal Convolutional Network For Real-Time Object Tracking
Siamese series of tracking networks have shown great potentials in achieving balanced accuracy and beyond real-time speed. However, most of existing siamese trackers only consider appearance features of first frame, and hardly benefit from interframe info
- IEEE MemberUS $11.00
- Society MemberUS $0.00
- IEEE Student MemberUS $11.00
- Non-IEEE MemberUS $15.00
Improving Language Identification For Multilingual Speakers
Spoken language identification (LID) technologies have improved in recent years from discriminating largely distinct languages to discriminating highly similar languages or even dialects of the same language. One aspect that has been mostly neglected, how