<?xml version="1.0" encoding="utf-8" standalone="yes"?>
<item>
  <id>05362683</id>
  <dt>a</dt>
  <an>05362683</an>
  <augroup>
    <au>Butko, Taras</au>
    <au>Temko, Andrey</au>
    <au>Nadeu, Climent</au>
    <au>Canton, Cristian</au>
  </augroup>
  <ti>Inclusion of video information for detection of acoustic events using the fuzzy integral.</ti>
  <so>Popescu-Belis, Andrei (ed.) et al., Machine learning for multimodal interaction. 5th international workshop, MLMI 2008, Utrecht, The Netherlands, September 8--10, 2008. Proceedings. Berlin: Springer (ISBN 978-3-540-85852-2/pbk). Lecture Notes in Computer Science 5237, 74-85 (2008).</so>
  <py>2008</py>
  <pu>Berlin: Springer</pu>
  <lagroup>
    <la>EN</la>
  </lagroup>
  <ccgroup>
  </ccgroup>
  <utgroup>
    <ut>acoustic event detection</ut>
    <ut>fuzzy integral</ut>
    <ut>multimodality</ut>
    <ut>support vector machines</ut>
    <ut>hidden Markov models</ut>
    <ut>video 3D tracking</ut>
  </utgroup>
  <cigroup>
  </cigroup>
  <ligroup>
    <li>doi:10.1007/978-3-540-85853-9_7</li>
  </ligroup>
  <abgroup>
    <ab>Summary: When applied to interactive seminars, the detection of acoustic events from only audio information shows a large amount of errors, which are mostly due to the temporal overlaps of sounds. Video signals may be a useful additional source of information to cope with that problem for particular events. In this work, we aim at improving the detection of steps by using two audio-based Acoustic Event Detection (AED) systems, with SVM and HMM, and a video-based AED system, which employs the output of a 3D video tracking algorithm. The fuzzy integral is used to fuse the outputs of the three detection systems. Experimental results using the CLEAR 2007 evaluation data show that video information can be successfully used to improve the results of audio-based AED.</ab>
    <rv></rv>
  </abgroup>
</item>