Kirigami: Lightweight Speech Filtering for Privacy-Preserving Activity Recognition using Audio

[paper (IMWUT 2024)]

Authors: [Sudershan Boovaraghavan] [Haozhe Zhou] [Mayank Goel] [Yuvraj Agarwal]

Abstract: Audio-based human activity recognition (HAR) is very popular due to many human activities having unique sound signatures that can be detected by machine learning (ML) approaches under various environmental contexts without re-training. These audio-based ML HAR pipelines often use common featurization techniques, such as extracting various statistical and spectral features by converting time domain signals to the frequency domain (using an FFT) and building ML models on top of them. Some of these approaches also claim privacy benefits by preventing the identification of human speech. However, recent deep learning-based automatic speech recognition (ASR) models pose new privacy challenges to these featurization techniques. In this paper, we aim to systematically characterize various proposed featurization approaches for audio data in terms of their privacy risks using a set of metrics for speech intelligibility (PER and WER) as well as the utility tradeoff in terms of the resulting accuracy of ML-based activity recognition. Our results demonstrate the vulnerability of several of these approaches to recent ASR models, particularly when subjected to re-tuning or retraining, with fine-tuned ASR models achieving an average Phoneme Error Rate (PER) of 39.99% and Word Error Rate (WER) of 44.43% in speech recognition for these approaches. We then propose Kirigami, a lightweight machine learning-based audio speech filter that removes human speech segments reducing the efficacy of various ASR techniques (70.48% PER and 101.40% WER) while also preventing sounds for HAR tasks from being filtered, thereby maintaining HAR accuracy (76.0% accuracy). We show that Kirigami can be implemented on common edge microcontrollers with limited computational capabilities and memory, providing a path to deployment on IoT devices. Finally, we conducted a real-world user study and showed the robustness of Kirigami on a laptop and an ARM Cortex-M4F microcontroller under three different background noises.

Reference

Download Paper Here

BibTeX Reference:

@article{boovaraghavan2024kirigami,
  title={Kirigami: Lightweight speech filtering for privacy-preserving activity recognition using audio},
  author={Boovaraghavan, Sudershan and Zhou, Haozhe and Goel, Mayank and Agarwal, Yuvraj},
  journal={Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies},
  volume={8},
  number={1},
  pages={1--28},
  year={2024},
  publisher={ACM New York, NY, USA}
}

Installation:

1. Clone (or Fork!) this repository

git clone https://github.com/synergylabs/kirigami.git

2. Install portaudio

Use Homebrew to install the prerequisite portaudio (https://files.portaudio.com/download.html) library, then install python packages:

MAC

brew install portaudio

GNU/Linux

Use the package manager to install PyAudio. For example, on Debian-based systems:

sudo apt install python3-pyaudio

3. Create a virtual environment

We recommend using Conda (https://docs.anaconda.com/miniconda/#miniconda-latest-installer-links). Tested on Ubuntu 22.04, with python 3.10.

conda create --name kirigami_env pip
conda activate kirigami_env

4. Install python packages

python -m pip install -r requirements.txt

Caution: If you run issues on mac with pyaudio installation, follow the below steps:

pip uninstall pyaudio
python3 -m pip install pyaudio --global-option="build_ext" --global-option="-I/opt/homebrew/include" --global-option="-L/opt/homebrew/lib"

Usage:

Live Visualization

1. Running the streamed visualization of Kirigami locally.

python live_vis.py

Filter Training:

1. Download and prepare required dataset as mentioned in init_dataset.py.

We will need:

TIMIT (https://catalog.ldc.upenn.edu/LDC93S1)
ESC-50 (https://github.com/karolpiczak/ESC-50)
MS-SNSD (https://github.com/microsoft/MS-SNSD)

You need to download the TIMIT dataset yourself. The ESC-50 and MS-SNSD datasets will be downloaded automatically by the script.

python init_dataset.py

2. Explore training your own Kirigami filters.

You can follow the steps inside experiments to train basic Logistic Regression models or customize your own models to detect speech.

experiments/speech_detector.ipynb
experiments/background_detector.ipynb

Updates:

2024-09-13: Initial release of Kirigami live visualization and filter training scripts.

Contact:

For more information please contact [email protected] or [email protected]

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
attack_models/phoneme		attack_models/phoneme
dataset_prep		dataset_prep
experiments		experiments
kirigami_filters		kirigami_filters
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
config.json		config.json
init_config.py		init_config.py
init_dataset.py		init_dataset.py
live_vis.py		live_vis.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Kirigami: Lightweight Speech Filtering for Privacy-Preserving Activity Recognition using Audio

Reference

Installation:

1. Clone (or Fork!) this repository

2. Install portaudio

MAC

GNU/Linux

3. Create a virtual environment

4. Install python packages

Usage:

Live Visualization

1. Running the streamed visualization of Kirigami locally.

Filter Training:

1. Download and prepare required dataset as mentioned in init_dataset.py.

2. Explore training your own Kirigami filters.

Updates:

Contact:

About

Releases

Packages

Contributors 3

Languages

License

synergylabs/kirigami

Folders and files

Latest commit

History

Repository files navigation

Kirigami: Lightweight Speech Filtering for Privacy-Preserving Activity Recognition using Audio

Reference

Installation:

1. Clone (or Fork!) this repository

2. Install portaudio

MAC

GNU/Linux

3. Create a virtual environment

4. Install python packages

Usage:

Live Visualization

1. Running the streamed visualization of Kirigami locally.

Filter Training:

1. Download and prepare required dataset as mentioned in init_dataset.py.

2. Explore training your own Kirigami filters.

Updates:

Contact:

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages