Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

labels_dict is considering files which are not actually labels. #1

Open
meetgandhi123 opened this issue Dec 5, 2022 · 1 comment
Open

Comments

@meetgandhi123
Copy link

The labels_dict is considering some folders which are not really useful. By considering all the folders in the Speech_Command_V0.02 folder like the following:

  • background_noise
  • testing_list.txt
  • validation_list.txt.....

The labels index are not just 0-34 but from the range 0-40.

From:

labels_dict=os.listdir(train_audio_path)

output:
labels_dict = ['tree', 'cat', 'go', 'left', 'yes', '.DS_Store', 'sheila', 'learn', 'stop', 'backward', 'seven', 'follow', 'zero', 'three', 'down', 'no', 'up', 'six', 'four', 'nine', 'LICENSE', 'happy', 'validation_list.txt', 'background_noise', 'wow', 'visual', 'house', 'README.md', 'off', 'five', 'dog', 'one', 'eight', 'testing_list.txt', 'on', 'two', 'marvin', 'bird', 'forward', 'right', 'bed']

To:

labels_dict=list(set(labels))
labels_dict = ['tree', 'cat', 'go', 'left', 'yes', 'sheila', 'learn', 'stop', 'backward', 'seven', 'follow', 'zero', 'three', 'down', 'no', 'up', 'six', 'four', 'nine', 'happy', 'wow', 'visual', 'house', 'off', 'five', 'dog', 'one', 'eight', 'on', 'two', 'marvin', 'bird', 'forward', 'right', 'bed']

@aminul-huq
Copy link
Owner

Hi,

Yes. You are correct. This code does have that issue. When I wrote it I actually removed those files manually at the beginning but never mentioned it in the repository.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants