Articles

Sort by
Previous Page Page 1 of 12 Next Page
  1. Research

    Efficient music identification using ORB descriptors of the spectrogram image

    Audio fingerprinting has been an active research field typically used for music identification. Robust audio fingerprinting technology is used to successfully perform content-based audio identification regardl...

    Dominic Williams, Akash Pooransingh and Jesse Saitoo

    EURASIP Journal on Audio, Speech, and Music Processing 2017 2017:17

    Published on: 11 July 2017

  2. Research

    Autocorrelation-based noise subtraction method with smoothing, overestimation, energy, and cepstral mean and variance normalization for noisy speech recognition

    Autocorrelation domain is a proper domain for clean speech signal and noise separation. In this paper, a method is proposed to decrease effects of noise on the clean speech signal, autocorrelation-based noise ...

    Gholamreza Farahani

    EURASIP Journal on Audio, Speech, and Music Processing 2017 2017:13

    Published on: 21 June 2017

  3. Research

    Two-layer similarity fusion model for cover song identification

    Various musical descriptors have been developed for Cover Song Identification (CSI). However, different descriptors are based on various assumptions, designed for representing distinct characteristics of music...

    Ning Chen, Mingyu Li and Haidong Xiao

    EURASIP Journal on Audio, Speech, and Music Processing 2017 2017:12

    Published on: 22 May 2017

  4. Research

    A computational study of auditory models in music recognition tasks for normal-hearing and hearing-impaired listeners

    The benefit of auditory models for solving three music recognition tasks—onset detection, pitch estimation, and instrument recognition—is analyzed. Appropriate features are introduced which enable the use of s...

    Klaus Friedrichs, Nadja Bauer, Rainer Martin and Claus Weihs

    EURASIP Journal on Audio, Speech, and Music Processing 2017 2017:7

    Published on: 2 March 2017

  5. Research

    Context-dependent factored language models

    The incorporation of grammatical information into speech recognition systems is often used to increase performance in morphologically rich languages. However, this introduces demands for sufficiently large tra...

    Gregor Donaj and Zdravko Kačič

    EURASIP Journal on Audio, Speech, and Music Processing 2017 2017:6

    Published on: 28 February 2017

  6. Research

    Efficiency of chosen speech descriptors in relation to emotion recognition

    This research paper presents parametrization of emotional speech using a pool of common features utilized in emotion recognition such as fundamental frequency, formants, energy, MFCC, PLP, and LPC coefficients. T...

    Dorota Kamińska, Tomasz Sapiński and Gholamreza Anbarjafari

    EURASIP Journal on Audio, Speech, and Music Processing 2017 2017:3

    Published on: 20 February 2017

  7. Research

    Cantor Digitalis: chironomic parametric synthesis of singing

    Cantor Digitalis is a performative singing synthesizer that is composed of two main parts: a chironomic control interface and a parametric voice synthesizer. The control interface is based on a pen/touch graph...

    Lionel Feugère, Christophe d’Alessandro, Boris Doval and Olivier Perrotin

    EURASIP Journal on Audio, Speech, and Music Processing 2017 2017:2

    Published on: 23 January 2017

  8. Research

    New approach for determining the QoS of MP3-coded voice signals in IP networks

    Present-day IP transport platforms being what they are, it will never be possible to rule out conflicts between the available services. The logical consequence of this assertion is the inevitable conclusion th...

    Tadeus Uhl, Stefan Paulsen and Krzysztof Nowicki

    EURASIP Journal on Audio, Speech, and Music Processing 2017 2017:1

    Published on: 19 January 2017

  9. Research

    Structure of pauses in speech in the context of speaker verification and classification of speech type

    Statistics of pauses appearing in Polish as a potential source of biometry information for automatic speaker recognition were described. The usage of three main types of acoustic pauses (silent, filled and bre...

    Magdalena Igras-Cybulska, Bartosz Ziółko, Piotr Żelasko and Marcin Witkowski

    EURASIP Journal on Audio, Speech, and Music Processing 2016 2016:18

    Published on: 9 November 2016

  10. Research

    Fast fundamental frequency determination via adaptive autocorrelation

    We present an algorithm for the estimation of fundamental frequencies in voiced audio signals. The method is based on an autocorrelation of a signal with a segment of the same signal. During operation, frequen...

    Michael Staudacher, Viktor Steixner, Andreas Griessner and Clemens Zierhofer

    EURASIP Journal on Audio, Speech, and Music Processing 2016 2016:17

    Published on: 24 October 2016

  11. Research

    Single microphone speech separation by diffusion-based HMM estimation

    We present a novel non-iterative and rigorously motivated approach for estimating hidden Markov models (HMMs) and factorial hidden Markov models (FHMMs) of high-dimensional signals. Our approach utilizes the a...

    Yochay R. Yeminy, Yosi Keller and Sharon Gannot

    EURASIP Journal on Audio, Speech, and Music Processing 2016 2016:16

    Published on: 18 October 2016

  12. Research

    A hybrid input-type recurrent neural network for LVCSR language modeling

    Substantial amounts of resources are usually required to robustly develop a language model for an open vocabulary speech recognition system as out-of-vocabulary (OOV) words can hurt recognition accuracy. In th...

    Vataya Chunwijitra, Ananlada Chotimongkol and Chai Wutiwiwatchai

    EURASIP Journal on Audio, Speech, and Music Processing 2016 2016:15

    Published on: 8 August 2016

  13. Research

    Voice activity detection algorithm based on long-term pitch information

    A new voice activity detection algorithm based on long-term pitch divergence is presented. The long-term pitch divergence not only decomposes speech signals with a bionic decomposition but also makes full use ...

    Xu-Kui Yang, Liang He, Dan Qu and Wei-Qiang Zhang

    EURASIP Journal on Audio, Speech, and Music Processing 2016 2016:14

    Published on: 7 July 2016

Previous Page Page 1 of 12 Next Page

Funding your APC

​​​​​​​Open access funding and policy support by SpringerOpen​​

​​​​We offer a free open access support service to make it easier for you to discover and apply for article-processing charge (APC) funding. Learn more here