Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ibus-engines.stt: init at unstable-2017-12-30 #173847

Draft
wants to merge 4 commits into
base: master
Choose a base branch
from

Conversation

jtojnar
Copy link
Member

@jtojnar jtojnar commented May 21, 2022

Description of changes

Accidentally found a Hackaday article mentioning ibus-deepspeech in the bookmarks of my RSS feed reader so decided to try it out. Updated stt while at it.

I managed to make the example from gst-deepspeech work with upstream audio samples but not with my microphone – perhaps. The ibus engine unfortunately does not seem to do anything for me either.

Edit: It must be some issue with the gstreamer pipeline since when I record my voice and play it, it works :

$ nix-shell -I nixpkgs=$HOME/Projects/nixpkgs -p gst_all_1.gstreamer -p gst-deepspeech --run 'gst-launch-1.0 --messages filesrc location=$PWD/fox.flac ! decodebin ! audioconvert ! audiorate ! audioresample ! deepspeech silence-length=20 ! fakesink | grep -E deepspeech.+text'
TensorFlow: v2.8.0-8-g06c8fea58fd
 Coqui STT: v1.3.0-0-g148fa743
INFO: Created TensorFlow Lite XNNPACK delegate for CPU.
Got message #169 from element "deepspeech0" (element): deepspeech, timestamp=(guint64)18446744073709551615, stream-time=(guint64)18446744073709551615, running-time=(guint64)18446744073709551615, intermediate=(boolean)false, text=(string)"to\ be\ bound\ to\ jumped\ over\ the\ lazy\ dog";

Whereas the pipeline from the ibus engine is way too off:

$ nix-shell -I nixpkgs=$HOME/Projects/nixpkgs -p gst_all_1.gstreamer -p gst-deepspeech --run 'gst-launch-1.0 --messages pulsesrc ! audioconvert ! audiorate ! audioresample ! deepspeech silence-length=20 ! fakesink | grep -E deepspeech.+text'
TensorFlow: v2.8.0-8-g06c8fea58fd
 Coqui STT: v1.3.0-0-g148fa743
INFO: Created TensorFlow Lite XNNPACK delegate for CPU.
Got message #62 from element "deepspeech0" (element): deepspeech, timestamp=(guint64)18446744073709551615, stream-time=(guint64)18446744073709551615, running-time=(guint64)18446744073709551615, intermediate=(boolean)true, text=(string)"for\ the\ ";
Got message #63 from element "deepspeech0" (element): deepspeech, timestamp=(guint64)18446744073709551615, stream-time=(guint64)18446744073709551615, running-time=(guint64)18446744073709551615, intermediate=(boolean)true, text=(string)"for\ the\ wise";
Got message #64 from element "deepspeech0" (element): deepspeech, timestamp=(guint64)18446744073709551615, stream-time=(guint64)18446744073709551615, running-time=(guint64)18446744073709551615, intermediate=(boolean)true, text=(string)"for\ the\ wise";
Got message #65 from element "deepspeech0" (element): deepspeech, timestamp=(guint64)18446744073709551615, stream-time=(guint64)18446744073709551615, running-time=(guint64)18446744073709551615, intermediate=(boolean)true, text=(string)"for\ the\ wise";
Things done
  • Built on platform(s)
    • x86_64-linux
    • aarch64-linux
    • x86_64-darwin
    • aarch64-darwin
  • For non-Linux: Is sandbox = true set in nix.conf? (See Nix manual)
  • Tested, as applicable:
  • Tested compilation of all packages that depend on this change using nix-shell -p nixpkgs-review --run "nixpkgs-review rev HEAD". Note: all changes have to be committed, also see nixpkgs-review usage
  • Tested basic functionality of all binary files (usually in ./result/bin/)
  • 22.05 Release Notes (or backporting 21.11 Release notes)
    • (Package updates) Added a release notes entry if the change is major or breaking
    • (Module updates) Added a release notes entry if the change is significant
    • (Module addition) Added a release notes entry if adding a new NixOS module
    • (Release notes changes) Ran nixos/doc/manual/md-to-db.sh to update generated release notes
  • Fits CONTRIBUTING.md.

@jtojnar jtojnar added the 6.topic: accessibility Make things in Nixpkgs usable by as many people of varying ability as possible label May 21, 2022
@jtojnar jtojnar requested a review from davidak May 21, 2022 08:13
@jtojnar jtojnar changed the title stt: 0.9.3 → 1.3.0 ibus-engines.stt: init at unstable-2017-12-30 May 21, 2022
@ofborg ofborg bot added the 8.has: package (new) This PR adds a new package label May 21, 2022
@ofborg ofborg bot requested a review from rvolosatovs May 21, 2022 08:28
@ofborg ofborg bot added 10.rebuild-darwin: 0 This PR does not cause any packages to rebuild on Darwin 10.rebuild-linux: 1-10 labels May 21, 2022
@jtojnar
Copy link
Member Author

jtojnar commented May 21, 2022

There is also some work to make stt plug-in part of gstreamer but not ready, AFACT: Elleo/gst-deepspeech#19 (comment) https://gitlab.freedesktop.org/gstreamer/gstreamer/-/merge_requests/1749

@jtojnar jtojnar marked this pull request as draft May 21, 2022 09:37
@nixos-discourse
Copy link

This pull request has been mentioned on NixOS Discourse. There might be relevant details there:

https://discourse.nixos.org/t/speech-to-text-virtual-keyboard-in-nixos/22684/2

@wegank wegank added the 2.status: merge conflict This PR has merge conflicts with the target branch label Mar 20, 2024
@wegank wegank added the 2.status: stale https://github.com/NixOS/nixpkgs/blob/master/.github/STALE-BOT.md label Jul 4, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
2.status: merge conflict This PR has merge conflicts with the target branch 2.status: stale https://github.com/NixOS/nixpkgs/blob/master/.github/STALE-BOT.md 6.topic: accessibility Make things in Nixpkgs usable by as many people of varying ability as possible 8.has: package (new) This PR adds a new package 10.rebuild-darwin: 0 This PR does not cause any packages to rebuild on Darwin 10.rebuild-linux: 1-10
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants