• Login
    View Item 
    •   DSpace@RPI Home
    • Rensselaer Libraries
    • RPI Theses Online (Complete)
    • View Item
    •   DSpace@RPI Home
    • Rensselaer Libraries
    • RPI Theses Online (Complete)
    • View Item
    JavaScript is disabled for your browser. Some features of this site may not work without it.

    Development and evaluation of spherical microphone array-enabled systems for immersive multi-user environments

    Author
    Mathews, Jonathan
    View/Open
    Mathews_rpi_0185E_11966.pdf (85.78Mb)
    Other Contributors
    Braasch, Jonas; Krueger, Ted (Theodore Edward), 1954-; Radke, Richard J., 1974-; Xiang, Ning;
    Date Issued
    2021-12
    Subject
    Architectural sciences
    Degree
    PhD;
    Terms of Use
    This electronic version is a licensed copy owned by Rensselaer Polytechnic Institute (RPI), Troy, NY. Copyright of original work retained by author.;
    Metadata
    Show full item record
    URI
    https://hdl.handle.net/20.500.13015/6144
    Abstract
    As viable applications for human-scale immersive spaces continue to be discovered, the need for reactive elements to enable effective sensing becomes increasingly prevalent. In this thesis, a spherical microphone array, or higher-order Ambisonic (HOA) array, is examined in the context of immersive, multi-user environments, to assess feasibility of real-time operation for sound-source tracking, isolation, and telematic broadcast. The design and operation of a prototype HOA array is reviewed, including static and adaptive beamforming methods for sound source isolation. Several contemporary techniques for direction of arrival (DOA) estimation are explored, with their benefits and drawbacks examined. A method for DOA estimation which addresses the shortcomings of existing techiques, the Sparse Iterative Search (SIS), is designed and evaluated against existing techniques, using a computational model and laboratory trials. This method is extended using a clustering method to demonstrate acoustic source identification. To avoid limitations with acoustic-only source identification techniques, a multiple-sensor paradigm is designed to identify multiple users, associate voice activity to individual users, and isolate voice activity using the fusion of streaming data from an HOA array and array of depth-sensing cameras with low latency. This sensor modality is tested using a computational model and tests in a lab environment. This document records the theory, process, and execution of this research study.;
    Description
    December 2021; School of Architecture
    Department
    School of Architecture;
    Publisher
    Rensselaer Polytechnic Institute, Troy, NY
    Relationships
    Rensselaer Theses and Dissertations Online Collection;
    Access
    Restricted to current Rensselaer faculty, staff and students in accordance with the Rensselaer Standard license. Access inquiries may be directed to the Rensselaer Libraries.;
    Collections
    • RPI Theses Online (Complete)

    Browse

    All of DSpace@RPICommunities & CollectionsBy Issue DateAuthorsTitlesSubjectsThis CollectionBy Issue DateAuthorsTitlesSubjects

    My Account

    Login

    DSpace software copyright © 2002-2023  DuraSpace
    Contact Us | Send Feedback
    DSpace Express is a service operated by 
    Atmire NV