project-bch-als-msg-banking

Boston Children's Hospital ALS Message Banking Project (Double Dipping Audio Editor)

Speech impairment is common in patients with ALS. The ALS Message Banking Project aims to preserve their voice and assist them with their needs. The platform allows the user to login to their own dashboard where they can upload and retrieve audio files securely. The project currently supports three implemented audio processing features (deadspace trimming, file splitting, and audio normalization). The ultimate goal is to have several fully functional filters to enhance sound quality so parts of the audio clips can be reused to produce a new "voice" for the user.

Technical Architectures

Frontend:
- React.js
Backend:
- Flask Python
- Audio Processing:
  - PyDub Library
  - WebRTC VAD Library
Database:
- Firestore to store user information + file names
- Google Cloud Storage for audio files

Running/Deploying the Application

Follow the steps below to run and deploy the application

Frontend

cd client

Run Frontend Locally

In client/src/components, update line 1 in settings.js so that module.exports equals 'http://localhost:8080' (change it back to 'https://api-dev-z2scpwkwva-uc.a.run.app' when pushing/deploying)

npm install

npm start

To Deploy Frontend

Push to main or create a PR to main to see the deployed website. CI/CD deploys to Firebase Hosting. The frontend is accessible at als-message-banking.web.app/ right now.

Backend

cd backend

Run Backend Locally

Setup GOOGLE_APPLICATION_CREDENTIALS as described here - https://cloud.google.com/docs/authentication/getting-started
Place the generated JSON key into the backend folder and rename it to 'serviceaccount.json"

docker build -t als-backend .
docker run --env GOOGLE_APPLICATION_CREDENTIALS='./serviceaccount.json' -p 8080:8080 als-backend

Note

Modify the Cloud Firestore DB rules, as requests are set to reject after a certain date

To Deploy Backend

Note: Incomplete GAR workflow file needs edits for functional CD, needs to provide serviceaccount.json access to docker container somehow
Backend is deployed to GCP on the Google Artifact Registry using a Docker repo
Set up authentication with docker using the gcloud credential helper as described here - https://cloud.google.com/artifact-registry/docs/docker/authentication

run this config command when deploying for the first time.

gcloud auth configure-docker us-docker.pkg.dev

then run the following commands to deploy the API.

docker build -t us-docker.pkg.dev/als-message-banking/docker/api-dev:latest .
docker push us-docker.pkg.dev/als-message-banking/docker/api-dev:latest
gcloud run deploy api-dev \
--image=us-docker.pkg.dev/als-message-banking/docker/api-dev:latest \
--platform=managed \
--region=us-central1 \
--project=als-message-banking

Additional Info (Audio Processing)

Resources Tried/General Notes:

The currently implemented processing functions only work with files converted to mono (use pyDub)
Deadspace trimmer/splitter:
- uses VAD which relies on the clarity of the patient's voice in the audio file
- "padding_duration_ms" value in the "vad_collector" function call decides how much silence is allowed between each word in a sentence
  - greater values mean more leniency is given for pauses between words
Background noise reduction:
- Attempted to use python's noisereduce library for background noise, but the library distorsts the voice in the audio file, which affects the dead space trimmer/splitter function.
- Dolby API has superior background noise reduction capabilities, but requires funding. Client is deciding whether they would like to use it or not.

Known Bugs and Future Steps

Issues

Click Here to view the list of bugs and feature suggestions.

Next Steps

Develop additional audio processing features:
- Background noise reduction
- Improve the clarity of slurred subject voice
Other features:
- Folder layout in processed files pages
- User is able to delete processed files
- User input in trimming/splitting process

Name		Name	Last commit message	Last commit date
Latest commit History 170 Commits
.github/workflows		.github/workflows
backend		backend
client		client
.firebaserc		.firebaserc
.gitignore		.gitignore
COLLABORATORS		COLLABORATORS
LICENSE		LICENSE
README.md		README.md
arch1.png		arch1.png
arch2.png		arch2.png
firebase.json		firebase.json
firestore.indexes.json		firestore.indexes.json
firestore.rules		firestore.rules

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

project-bch-als-msg-banking

Technical Architectures

Running/Deploying the Application

Frontend

Run Frontend Locally

To Deploy Frontend

Backend

Run Backend Locally

Note

To Deploy Backend

Additional Info (Audio Processing)

Resources Tried/General Notes:

Known Bugs and Future Steps

Issues

Next Steps

About

Releases 2

Packages

Contributors 10

Languages

License

BU-Spark/se-bch-als-msg-banking

Folders and files

Latest commit

History

Repository files navigation

project-bch-als-msg-banking

Technical Architectures

Running/Deploying the Application

Frontend

Run Frontend Locally

To Deploy Frontend

Backend

Run Backend Locally

Note

To Deploy Backend

Additional Info (Audio Processing)

Resources Tried/General Notes:

Known Bugs and Future Steps

Issues

Next Steps

About

Resources

License

Stars

Watchers

Forks

Releases 2

Packages 0

Contributors 10

Languages

Packages