[MRG] Work on standalone model decoding function #197

shuttle1987 · 2018-09-09T10:54:09Z

This is for use with the web API to decode audio files using existing trained models.

Created minimal test corpus, closes Create minimal test corpus #166
Test filesystem based API for saving and loading models, closes Filesystem artifacts need tests #158
Added missing type stubs, closes Add type signatures to just about everything #90

shuttle1987 · 2018-09-10T08:31:59Z

I'm thinking it might be good to provide a parameter to the persephone.model.decode function to specify the names of the inputs.

Right now we have the following:

        # TODO These placeholder names should be a backup if names from a newer
        # naming scheme aren't present. Otherwise this won't generalize to
        # different architectures.
        feed_dict = {"Placeholder:0": batch_x,
                     "Placeholder_1:0": batch_x_lens}

        dense_decoded = sess.run("SparseToDense:0", feed_dict=feed_dict)

I propose that we create a parameters for:

batch_x name for the location for where the batch data is being fed in
batch_x_lens name for the location for where the lengths of the batch data is being fed in

We can default both these to their current placeholder values if they are not specified.

shuttle1987 · 2018-09-10T13:42:27Z

I successfully did a run through with this and it works. I would very much like to create a test case for this.

shuttle1987 · 2018-09-12T17:44:19Z

Just putting in the work to get a test case that can run this through in it's entirety. There's some bugs that appear to be getting uncovered from doing this. At the point there are a couple of failing test cases but this might be just due to the tests not being right.

oadams · 2018-09-14T16:50:46Z

The test case written in ff6d958 actually exposes a bit of a nasty edge case, if the model never satisfies this conditional in the training:
                    if valid_ler < best_valid_ler:
Then it never saves a checkpoint file and the evaluation will fail at the end of the Model.train method.

Just initialize best_valid_ler to a large number so that a model always gets saved. It's currently 2.0.

In 5b08f7e I added the following exception in the edge case where evaluation cannot occur:

                # Check we actually saved a checkpoint
                if not self.saved_model_path:
                    raise PersephoneException(
                        "No checkpoint was saved so model evaluation cannot be performed. "
                        "This can happen if the validaion LER never converges.")
                # Finally, run evaluation on the test set.
                self.eval(restore_model_path=self.saved_model_path)

This will give more feedback if there was a problem with the model. Unfortunately the test isn't passing just yet because of the same issue, will likely need a better test corpus.

What corpus are you currently using? If the sine wave test is correct I imagine it should pretty immediately get 100% accuracy.

shuttle1987 · 2018-09-14T17:07:46Z

What corpus are you currently using? If the sine wave test is correct I imagine it should pretty immediately get 100% accuracy.

A bad one, where the following letters represent the frequencies of piano notes:

test: A
validation: B
training: C

With no overlap from any of these the results ended up being a degenerate case I suspect.

shuttle1987 · 2018-09-14T17:33:02Z

Just initialize best_valid_ler to a large number so that a model always gets saved. It's currently 2.0.

This is something I didn't consider when writing the test case, it might be a good thing to use to reduce the unit test time in this case.

shuttle1987 · 2018-09-14T17:56:00Z

There's quite a few significant changes in this PR and it will unblock work on the web API. As such it would be good to release a new version after this is merged.

- Added TODO - Minor syntax changes

oadams

Nice. Before merging, we need to verify that decoding is working. I've tried doing this: ac03ded
but it's not working so I either need to change the training params, make the training set bigger still, or just test on a wav seen in training.

The last option, just testing on a WAV seen in training, may be best for now.

Also still running the tutorial test.

shuttle1987 · 2018-09-15T02:36:05Z

I think I'd just test on a WAV seen in training, the purpose of these tests was just to have coverage over the non-Tensorflow model parts of the code without having to run the much slower experiment test suite containing real data. Now that said if there's a way of making this sample data test work better that has value in testing the model as well that could be good for a separate test case. The value of a good test of the model is somewhat orthogonal to tests of the code around the model.

shuttle1987 · 2018-09-15T07:01:51Z

I had considered fixing #153 in this PR but I hesitate to make more change here as this PR is at a somewhat stable state and is already over 50 commits. Seeing as this PR will unblock work on the Web API I would like to focus on finalizing anything that's required to merge this one in.

As for integration testing times is this something a GPU based install could help with or are we needing to specifically test on CPU as well?

oadams · 2018-09-17T13:23:58Z

I appreciate that this isn't about model testing, but there was already a test_na.py::test_fast() test that used a couple utterances of Na and tested basically the same surrounding architecture, except for decoding from a saved model. The tests for decode here should not just assert that the decoding is None.

In any case, I've put test wavs in training and decoding still outputs some empty lists.

There's two integration tests. One tests the tutorial: you can run this on a CPU pretty quickly: within an hour on the server. The other tests a full model and basically replicates results from the paper and takes about a day. I always use a GPU for the latter but have only run it once or twice: if the first one works then there's an extremely unlikely the second one would break.

shuttle1987 · 2018-09-17T13:25:52Z

In any case, I've put test wavs in training and decoding still outputs some empty lists.

That's unfortunate if that isn't working, I'd hoped to get some sensible output here.

oadams · 2018-09-17T13:28:50Z

Note, it doesn't always output empty lists, but it's. It's just clear a better corpus and training procedure is needed. I can fix this sometime. The best thing to do would just be to add a test for the decode() function into the integration test that tests the tutorial, that way we can confirm its working. We can go ahead and merge this in and release to forge ahead with the web API.

shuttle1987 · 2018-09-17T13:31:54Z

The best thing to do would just be to add a test for the decode() function into the integration test that tests the tutorial, that way we can confirm its working

I agree that this is the best way forward, if this is something you could do that would be great.

Would a JSON formatted data output from the model description help this? If so I can put those changes into this PR as well.

oadams · 2018-09-17T13:45:03Z

Yeah, I'll do it.

Yes to JSON if it's a super quick job. Something formatted better than the current text files would be good, but that said this is for human readability and those text files do already do the job.

Added type annotations

0e53216

shuttle1987 added the refactor label Sep 9, 2018

shuttle1987 added 6 commits September 10, 2018 01:04

Add docstring and path check to decode function

f485442

Explain feature type in docstring

22c089c

Create parameter for max batch size

2815b63

Start implementing lazy preprocessing functionality

7ae76bd

Create 16k mono audio file

4069dfc

Fix variable

1894310

shuttle1987 mentioned this pull request Sep 9, 2018

[WIP] Create endpoint to transcribe audio with model persephone-tools/persephone-web-API#83

Merged

5 tasks

Missing import

169f6e1

shuttle1987 mentioned this pull request Sep 9, 2018

Add functionality to correctly load old metagraphs #117

Closed

This was referenced Sep 10, 2018

Model serialization persephone-tools/persephone-web-API#76

Closed

[WIP] Attempt to pickle model persephone-tools/persephone-web-API#81

Open

shuttle1987 added 6 commits September 10, 2018 18:54

add type annotations

e8e6c38

Document parameters in docstrings

991cf43

Fix file extension for feature file existence check

f89045c

Check file exists before attempting to call ffmpeg

e2ad49a

Attempt to preprocess all files that need to be preprocessed

862a529

Feed in names for tensorflow from parameters

a6d372a

shuttle1987 added 4 commits September 13, 2018 00:21

Create some test fixtures for creating real WAV files

101eabb

Fix Corpus creation test so it doesn't have to be skipped

94d7beb

Fix test path

81e6d58

Remove pyfakefs from test case

7b322e8

shuttle1987 added 2 commits September 13, 2018 03:54

Fix test case

588be43

Specify return type more narrowly

97b8fc5

shuttle1987 added 3 commits September 15, 2018 03:15

Best test corpus fixture

fb6bc0a

Fix up path to checkpoint save file

2281629

Fix names for tensorflow layers

e9993bb

shuttle1987 requested a review from oadams September 14, 2018 17:38

shuttle1987 changed the title ~~[WIP] Work on standalone model decoding function~~ [MRG] Work on standalone model decoding function Sep 14, 2018

shuttle1987 and others added 2 commits September 15, 2018 04:02

Changelog

80d1243

Update model.py

13113c4

- Added TODO - Minor syntax changes

oadams reviewed Sep 14, 2018

View reviewed changes

shuttle1987 added 3 commits September 18, 2018 00:36

Write out a JSON description of the model

97ccc46

Added some more info to JSON description

ff28be6

Use keyword arg to match python 3.6 API

bd548df

oadams merged commit 1999ca5 into persephone-tools:master Sep 24, 2018

This was referenced Oct 23, 2018

Model descriptions serialization #153

Closed

Transcription interface #168

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[MRG] Work on standalone model decoding function #197

[MRG] Work on standalone model decoding function #197

shuttle1987 commented Sep 9, 2018 •

edited

Loading

shuttle1987 commented Sep 10, 2018

shuttle1987 commented Sep 10, 2018

shuttle1987 commented Sep 12, 2018

oadams commented Sep 14, 2018 •

edited

Loading

shuttle1987 commented Sep 14, 2018 •

edited

Loading

shuttle1987 commented Sep 14, 2018

shuttle1987 commented Sep 14, 2018

oadams left a comment

shuttle1987 commented Sep 15, 2018

shuttle1987 commented Sep 15, 2018

oadams commented Sep 17, 2018

shuttle1987 commented Sep 17, 2018 •

edited

Loading

oadams commented Sep 17, 2018

shuttle1987 commented Sep 17, 2018

oadams commented Sep 17, 2018

[MRG] Work on standalone model decoding function #197

[MRG] Work on standalone model decoding function #197

Conversation

shuttle1987 commented Sep 9, 2018 • edited Loading

shuttle1987 commented Sep 10, 2018

shuttle1987 commented Sep 10, 2018

shuttle1987 commented Sep 12, 2018

oadams commented Sep 14, 2018 • edited Loading

shuttle1987 commented Sep 14, 2018 • edited Loading

shuttle1987 commented Sep 14, 2018

shuttle1987 commented Sep 14, 2018

oadams left a comment

Choose a reason for hiding this comment

shuttle1987 commented Sep 15, 2018

shuttle1987 commented Sep 15, 2018

oadams commented Sep 17, 2018

shuttle1987 commented Sep 17, 2018 • edited Loading

oadams commented Sep 17, 2018

shuttle1987 commented Sep 17, 2018

oadams commented Sep 17, 2018

shuttle1987 commented Sep 9, 2018 •

edited

Loading

oadams commented Sep 14, 2018 •

edited

Loading

shuttle1987 commented Sep 14, 2018 •

edited

Loading

shuttle1987 commented Sep 17, 2018 •

edited

Loading