The original paper is in English. Non-English content has been machine-translated and may contain typographical errors or mistranslations. ex. Some numerals are expressed as "XNUMX".
Copyrights notice
The original paper is in English. Non-English content has been machine-translated and may contain typographical errors or mistranslations. Copyrights notice
주파수 영역 바이노럴 모델(FDBM)은 이전에 여러 음원의 위치를 파악하기 위해 제안되었습니다. 이 방법은 두 개의 입력 신호만 필요하고 머리에서 발생하는 회절에 따른 양이간 위상 및 레벨 차이를 이용하기 때문에 머리를 물체로 간주할 때 적용 유연성이 매우 높습니다. 두 개의 마이크를 기준으로 물체가 대칭인 경우 중앙면의 대칭으로 인해 인간이 전후 혼란을 겪게 되어 음원 위치 파악 성능이 저하됩니다. 본 논문에서는 FDBM을 이용한 마이크 쌍 출력 조합을 통해 음원 위치 파악 성능 저하를 줄이는 방안을 제안한다. 제안한 방법을 보안 카메라 시스템에 적용하여 평가한 결과, 혼란스러운 원뿔 수가 줄어들어 음원 위치 파악 성능이 향상되는 것으로 나타났습니다.
The copyright of the original papers published on this site belongs to IEICE. Unauthorized use of the original or translated papers is prohibited. See IEICE Provisions on Copyright for details.
부
Yoshifumi CHISAKI, Toshimichi TAKADA, Masahiro NAGANISHI, Tsuyoshi USAGAWA, "Azimuthal and Elevation Localization Using Inter-Channel Phase and Level Differences for a Hemispheric Object" in IEICE TRANSACTIONS on Fundamentals,
vol. E91-A, no. 10, pp. 3059-3062, October 2008, doi: 10.1093/ietfec/e91-a.10.3059.
Abstract: The frequency domain binaural model (FDBM) has been previously proposed to localize multiple sound sources. Since the method requires only two input signals and uses interaural phase and level differences caused by the diffraction generated by the head, flexibility in application is very high when the head is considered as an object. When an object is symmetric with respect to the two microphones, the performance of sound source localization is degraded, as a human being has front-back confusion due to the symmetry in a median plane. This paper proposes to reduce the degradation of performance on sound source localization by a combination of the microphone pair outputs using the FDBM. The proposed method is evaluated by applying to a security camera system, and the results showed performance improvement in sound source localization because of reducing the number of cones of confusion.
URL: https://global.ieice.org/en_transactions/fundamentals/10.1093/ietfec/e91-a.10.3059/_p
부
@ARTICLE{e91-a_10_3059,
author={Yoshifumi CHISAKI, Toshimichi TAKADA, Masahiro NAGANISHI, Tsuyoshi USAGAWA, },
journal={IEICE TRANSACTIONS on Fundamentals},
title={Azimuthal and Elevation Localization Using Inter-Channel Phase and Level Differences for a Hemispheric Object},
year={2008},
volume={E91-A},
number={10},
pages={3059-3062},
abstract={The frequency domain binaural model (FDBM) has been previously proposed to localize multiple sound sources. Since the method requires only two input signals and uses interaural phase and level differences caused by the diffraction generated by the head, flexibility in application is very high when the head is considered as an object. When an object is symmetric with respect to the two microphones, the performance of sound source localization is degraded, as a human being has front-back confusion due to the symmetry in a median plane. This paper proposes to reduce the degradation of performance on sound source localization by a combination of the microphone pair outputs using the FDBM. The proposed method is evaluated by applying to a security camera system, and the results showed performance improvement in sound source localization because of reducing the number of cones of confusion.},
keywords={},
doi={10.1093/ietfec/e91-a.10.3059},
ISSN={1745-1337},
month={October},}
부
TY - JOUR
TI - Azimuthal and Elevation Localization Using Inter-Channel Phase and Level Differences for a Hemispheric Object
T2 - IEICE TRANSACTIONS on Fundamentals
SP - 3059
EP - 3062
AU - Yoshifumi CHISAKI
AU - Toshimichi TAKADA
AU - Masahiro NAGANISHI
AU - Tsuyoshi USAGAWA
PY - 2008
DO - 10.1093/ietfec/e91-a.10.3059
JO - IEICE TRANSACTIONS on Fundamentals
SN - 1745-1337
VL - E91-A
IS - 10
JA - IEICE TRANSACTIONS on Fundamentals
Y1 - October 2008
AB - The frequency domain binaural model (FDBM) has been previously proposed to localize multiple sound sources. Since the method requires only two input signals and uses interaural phase and level differences caused by the diffraction generated by the head, flexibility in application is very high when the head is considered as an object. When an object is symmetric with respect to the two microphones, the performance of sound source localization is degraded, as a human being has front-back confusion due to the symmetry in a median plane. This paper proposes to reduce the degradation of performance on sound source localization by a combination of the microphone pair outputs using the FDBM. The proposed method is evaluated by applying to a security camera system, and the results showed performance improvement in sound source localization because of reducing the number of cones of confusion.
ER -